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PREFACE 


Nobel Prize-winning physicist Richard Feynman once noted that nature 
has a far, far better imagination than our own. Few things in the universe 
illustrate this observation better than the cell. A tiny sac of molecules 
capable of self-replication, this marvelous structure constitutes the fun- 
damental building block of life. We are made of cells. Cells provide all 
the nutrients we consume. And the continuous activity of cells makes 
our planet habitable. To understand ourselves—and the world of which 
we are a part—we need to know something of the life of cells. Armed 
with such knowledge, we—as citizens and stewards of the global 
community—will be better equipped to make well-informed decisions 
about increasingly sophisticated issues, from climate change and food 
security to biomedical technologies and emerging epidemics. 


In Essential Cell Biology we introduce readers to the fundamentals of 
cell biology. The Fifth Edition introduces powerful new techniques that 
allow us to examine cells and their components with unprecedented 
precision—such as super-resolution fluorescence microsocopy and 
cryoelectron microscopy—as well as the latest methods for DNA 
sequencing and gene editing. We discuss new thinking about how cells 
organize and encourage the chemical reactions that make life possible, 
and we review recent insights into human origins and genetics. 


With each edition of Essential Cell Biology, its authors re-experience the 
joy of learning something new and surprising about cells. We are also 
reminded of how much we still don’t know. Many of the most fascinat- 
ing questions in cell biology remain unanswered. How did cells arise on 
the early Earth, multiplying and diversifying through billions of years of 
evolution to fill every possible niche—from steaming vents on the ocean 
floor to frozen mountaintops—and, in doing so, transform our planet's 
entire environment? How is it possible for billions of cells to seamlessly 
cooperate and form large, multicellular organisms like ourselves? These 
are among the many challenges that remain for the next generation of 
cell biologists, some of whom will begin a wonderful, lifelong journey 
with this textbook. 


Readers interested in learning how scientific inquisitiveness can fuel break- 
throughs in our understanding of cell biology will enjoy the stories of dis- 
covery presented in each chapter’s “How We Know” feature. Packed with 
experimental data and design, these narratives illustrate how biologists 
tackle important questions and how experimental results shape future 
ideas. In this edition, a new “How We Know” recounts the discoveries that 
first revealed how cells transform the energy locked in food molecules into 
the forms used to power the metabolic reactions on which life depends. 


As in previous editions, the questions in the margins and at the end of 
each chapter not only test comprehension but also encourage careful 
thought and the application of newly acquired information to a broader 
biological context. Some of these questions have more than one valid 
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answer and others invite speculation. Answers to all of the questions 
are included at the back of the book, and many provide additional 
information or an alternative perspective on material presented in the 
main text. 


More than 160 video clips, animations, atomic structures, and high- 
resolution micrographs complement the book and are available online. 
The movies are correlated with each chapter and callouts are highlighted 
in color. This supplemental material, created to clarify complex and critical 
concepts, highlights the intrinsic beauty of living cells. 


For those who wish to probe even more deeply, Molecular Biology of 
the Cell, now in its sixth edition, offers a detailed account of the life of 
the cell. In addition, Molecular Biology of the Cell, Sixth Edition: A Prob- 
lems Approach, by John Wilson and Tim Hunt, provides a gold mine of 
thought-provoking questions at all levels of difficulty. We have drawn 
upon this tour-de-force of experimental reasoning for some of the ques- 
tions in Essential Cell Biology, and we are very grateful to its authors. 


Every chapter of Essential Cell Biology is the product of a communal effort: 
both text and figures were revised and refined as drafts circulated from 
one author to another—many times over and back again! The numer- 
ous other individuals who have helped bring this project to fruition are 
credited in the Acknowledgments that follow. Despite our best efforts, it 
is inevitable that errors will have crept into the book, and we encourage 
eagle-eyed readers who find mistakes to let us know, so that we can 
correct them in the next printing. 
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Test Bank 
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D. Vaughan, Harvard University Division of Continuing Education, 
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level, learning objective, book section, and difficulty level, allowing in- 
structors to easily create meaningful exams. The Test Bank is available in 
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Animations and Videos 


Streaming links give access to more than 130 videos and animations, 
bringing the concepts of cell biology to life. The movies are correlated 
with each chapter and callouts are highlighted in color. 
Figure-integrated Lecture Outlines 


All of the figures are integrated in PowerPoint, along with the section 
and concept headings from the text, to give instructors a head start 
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Image Files 


Every figure and photograph in the book is available for download in 
PowerPoint and JPG formats from wwnorton.com/instructors. 


STUDENT RESOURCES 


digital wwnorton.com/ecb5 


Animations and Videos 


Streaming links give access to more than 130 videos and animations, 
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Cells: The Fundamental 
Units of Life 


What does it mean to be living? Petunias, people, and pond scum are all 
alive; stones, sand, and summer breezes are not. But what are the fun- 
damental properties that characterize living things and distinguish them 
from nonliving matter? 


The answer hinges on a basic fact that is taken for granted now but 
marked a revolution in thinking when first established more than 175 
years ago. All living things (or organisms) are built from cells: small, 
membrane-enclosed units filled with a concentrated aqueous solution of 
chemicals and endowed with the extraordinary ability to create copies of 
themselves by growing and then dividing in two. The simplest forms of 
life are solitary cells. Higher organisms, including ourselves, are commu- 
nities of cells derived by growth and division from a single founder cell. 
Every animal or plant is a vast colony of individual cells, each of which 
performs a specialized function that is integrated by intricate systems of 
cell-to-cell communication. 


Cells, therefore, are the fundamental units of life. Thus it is to cell biol- 
ogy—the study of cells and their structure, function, and behavior—that 
we look for an answer to the question of what life is and how it works. 
With a deeper understanding of cells, we can begin to tackle the grand 
historical problems of life on Earth: its mysterious origins, its stunning 
diversity produced by billions of years of evolution, and its invasion of 
every conceivable habitat on the planet. At the same time, cell biology 
can provide us with answers to the questions we have about ourselves: 
Where did we come from? How do we develop from a single fertilized egg 
cell? How is each of us similar to—yet different from—everyone else on 
Earth? Why do we get sick, grow old, and die? 
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CHAPTER 1 


Cells: The Fundamental Units of Life 


In this chapter, we introduce the concept of cells: what they are, where 
they come from, and how we have learned so much about them. We 
begin by looking at the great variety of forms that cells can adopt, and 
we take a preliminary glimpse at the chemical machinery that all cells 
have in common. We then consider how cells are made visible under 
the microscope and what we see when we peer inside them. Finally, we 
discuss how we can exploit the similarities of living things to achieve 
a coherent understanding of all forms of life on Earth—from the tiniest 
bacterium to the mightiest oak. 


UNITY AND DIVERSITY OF CELLS 


Biologists estimate that there may be up to 100 million distinct species 
of living things on our planet—organisms as different as a dolphin and 
a rose or a bacterium and a butterfly. Cells, too, differ vastly in form and 
function. Animal cells differ from those in a plant, and even cells within a 
single multicellular organism can differ wildly in appearance and activity. 
Yet despite these differences, all cells share a fundamental chemistry and 
other common features. 


In this section, we take stock of some of the similarities and differences 
among cells, and we discuss how all present-day cells appear to have 
evolved from a common ancestor. 


Cells Vary Enormously in Appearance and Function 


When comparing one cell and another, one of the most obvious places 
to start is with size. A bacterial cell—say a Lactobacillus in a piece of 
cheese—is a few micrometers, or um, in length. That’s about 25 times 
smaller than the width of a human hair. At the other extreme, a frog 
egg—which is also a single cell—has a diameter of about 1 millimeter 
(mm). If we scaled them up to make the Lactobacillus the size of a person, 
the frog egg would be half a mile high. 


Cells vary just as widely in their shape (Figure 1-1). A typical nerve cell in 
your brain, for example, is enormously extended: it sends out its electri- 
cal signals along a single, fine protrusion (an axon) that is 10,000 times 
longer than it is thick, and the cell receives signals from other nerve cells 
through a collection of shorter extensions that sprout from its body like 
the branches of a tree (see Figure 1-1A). A pond-dwelling Paramecium, 
on the other hand, is shaped like a submarine and is covered with thou- 
sands of cilia—hairlike projections whose sinuous, coordinated beating 
sweeps the cell forward, rotating as it goes (Figure 1—-1B). A cell in the 
surface layer of a plant is squat and immobile, surrounded by a rigid box 
of cellulose with an outer waterproof coating of wax (Figure 1-1C). A 
macrophage in the body of an animal, by contrast, crawls through tis- 
sues, constantly pouring itself into new shapes, as it searches for and 
engulfs debris, foreign microorganisms, and dead or dying cells (Figure 
1-1D). A fission yeast is shaped like a rod (Figure 1-1E), whereas a bud- 
ding yeast is delightfully spherical (see Figure 1-14). And so on. 


Cells are also enormously diverse in their chemical requirements. Some 
require oxygen to live; for others the gas is deadly. Some cells consume 
little more than carbon dioxide (CO2), sunlight, and water as their raw 
materials; others need a complex mixture of molecules produced by 
other cells. 


These differences in size, shape, and chemical requirements often reflect 
differences in cell function. Some cells are specialized factories for the 
production of particular substances, such as hormones, starch, fat, latex, 
or pigments. Others, like muscle cells, are engines that burn fuel to do 


mechanical work. Still others are electricity generators, like the modified 
muscle cells in the electric eel. 


Some modifications specialize a cell so much that the cell ceases to pro- 
liferate, thus producing no descendants. Such specialization would be 
senseless for a cell that lived a solitary life. In a multicellular organism, 
however, there is a division of labor among cells, allowing some cells to 
become specialized to an extreme degree for particular tasks and leaving 
them dependent on their fellow cells for many basic requirements. Even 
the most basic need of all, that of passing on the genetic instructions of 
the organism to the next generation, is delegated to specialists—the egg 
and the sperm. 


Living Cells All Have a Similar Basic Chemistry 


Despite the extraordinary diversity of plants and animals, people have 
recognized from time immemorial that these organisms have something 
in common, something that entitles them all to be called living things. 
But while it seemed easy enough to recognize life, it was remarkably dif- 
ficult to say in what sense all living things were alike. Textbooks had to 
settle for defining life in abstract general terms related to growth, repro- 
duction, and an ability to actively alter their behavior in response to the 
environment. 


The discoveries of biochemists and molecular biologists have provided 
an elegant solution to this awkward situation. Although the cells of all 
living things are enormously varied when viewed from the outside, they 
are fundamentally similar inside. We now know that cells resemble one 
another to an astonishing degree in the details of their chemistry. They are 
composed of the same sorts of molecules, which participate in the same 
types of chemical reactions (discussed in Chapter 2). In all organisms, 
genetic information—in the form of genes—is carried in DNA molecules. 
This information is written in the same chemical code, constructed out 
of the same chemical building blocks, interpreted by essentially the same 
chemical machinery, and replicated in the same way when a cell or 


100 um 


25 um 


Unity and Diversity of Cells 


QUESTION 1-1 


“Life” is easy to recognize but 
difficult to define. According to one 
popular biology text, living things: 
1. Are highly organized compared 
to natural inanimate objects. 

2. Display homeostasis, maintaining 
a relatively constant internal 
environment. 

3. Reproduce themselves. 

4. Grow and develop from simple 
beginnings. 

5. Take energy and matter from the 
environment and transform it. 

6. Respond to stimuli. 

7. Show adaptation to their 
environment. 

Score a person, a vacuum cleaner, 
and a potato with respect to these 
characteristics. 


Ld 
5 um 


Figure 1-1 Cells come in a variety of shapes and sizes. Note the very different scales of these micrographs. (A) Drawing of a single 
nerve cell from a mammalian brain. This cell has a single, unbranched extension (axon), projecting toward the top of the image, through 
which it sends electrical signals to other nerve cells, and it possesses a huge branching tree of projections (dendrites) through which it 
receives signals from as many as 100,000 other nerve cells. (B) Paramecium. This protozoan—a single giant cell—swims by means of the 
beating cilia that cover its surface. (C) The surface of a snapdragon flower petal displays an orderly array of tightly packed cells. 

(D) A macrophage spreads itself out as it patrols animal tissues in search of invading microorganisms. (E) A fission yeast is caught in the 
act of dividing in two. The medial septum (stained red with a fluorescent dye) is forming a wall between the two nuclei (also stained red) 
that have been separated into the two daughter cells; in this image, the cells’ membranes are stained with a green fluorescent dye. 

(A, Herederos de Santiago Ramon y Cajal, 1899; B, courtesy of Anne Aubusson Fleury, Michel Laurent, and André Adoutte; C, courtesy 
of Kim Findlay; D, from P.J. Hanley et al., Proc. Natl Acad. Sci. USA 107:12145-12150, 2010. With permission from National Academy of 


Sciences; E, courtesy of Janos Demeter and Shelley Sazer.) 
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Figure 1-2 In all living cells, genetic 
information flows from DNA to RNA 
(transcription) and from RNA to protein 
(translation)—an arrangement known 

as the central dogma. The sequence of 
nucleotides in a particular segment of 

DNA (a gene) is transcribed into an RNA 
molecule, which can then be translated into 
the linear sequence of amino acids of a 
protein. Only a small part of the gene, RNA, 
and protein is shown. 
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organism reproduces. Thus, in every cell, long polymer chains of DNA 
are made from the same set of four monomers, called nucleotides, strung 
together in different sequences like the letters of an alphabet. The infor- 
mation encoded in these DNA molecules is read out, or transcribed, into 
a related set of polynucleotides called RNA. Although some of these RNA 
molecules have their own regulatory, structural, or chemical activities, 
most are translated into a different type of polymer called a protein. This 
flow of information—from DNA to RNA to protein—is so fundamental to 
life that it is referred to as the central dogma (Figure 1-2). 


The appearance and behavior of a cell are dictated largely by its pro- 
tein molecules, which serve as structural supports, chemical catalysts, 
molecular motors, and much more. Proteins are built from amino acids, 
and all organisms use the same set of 20 amino acids to make their pro- 
teins. But the amino acids are linked in different sequences, giving each 
type of protein molecule a different three-dimensional shape, or confor- 
mation, just as different sequences of letters spell different words. In this 
way, the same basic biochemical machinery has served to generate the 
whole gamut of life on Earth (Figure 1-3). 


Living Cells Are Self-Replicating Collections of Catalysts 


One of the most commonly cited properties of living things is their abil- 
ity to reproduce. For cells, the process involves duplicating their genetic 
material and other components and then dividing in two—producing 
a pair of daughter cells that are themselves capable of undergoing the 
same cycle of replication. 


It is the special relationship between DNA, RNA, and proteins—as 
outlined in the central dogma (see Figure 1-2)—that makes this self- 
replication possible. DNA encodes information that ultimately directs 
the assembly of proteins: the sequence of nucleotides in a molecule of 
DNA dictates the sequence of amino acids in a protein. Proteins, in turn, 
catalyze the replication of DNA and the transcription of RNA, and they 
participate in the translation of RNA into proteins. This feedback loop 
between proteins and polynucleotides underlies the self-reproducing 
behavior of living things (Figure 1-4). We discuss this complex inter- 
dependence between DNA, RNA, and proteins in detail in Chapters 5 
through 8. 


In addition to their roles in polynucleotide and protein synthesis, proteins 
also catalyze the many other chemical reactions that keep the self-repli- 
cating system shown in Figure 1-4 running. A living cell can break down 
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Figure 1-3 All living organisms are constructed from cells. (A) A colony of bacteria, (B) a butterfly, (C) a rose, and (D) a dolphin 
are all made of cells that have a fundamentally similar chemistry and operate according to the same basic principles. (A, courtesy 
of Janice Carr; D, courtesy of Jonathan Gordon, IFAW.) 
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nutrients and use the products to both make the building blocks needed 
to produce polynucleotides, proteins, and other cell constituents and to 
generate the energy needed to power these biosynthetic processes. We 
discuss these vital metabolic reactions in detail in Chapters 3 and 13. 


Only living cells can perform these astonishing feats of self-replication. 
Viruses also contain information in the form of DNA or RNA, but they do 
not have the ability to reproduce by their own efforts. Instead, they para- 
sitize the reproductive machinery of the cells that they invade to make 
copies of themselves. Thus, viruses are not truly considered living. They 
are merely chemical zombies: inert and inactive outside their host cells 
but able to exert a malign control once they gain entry. We review the life 
cycle of viruses in Chapter 9. 


All Living Cells Have Apparently Evolved from the Same 
Ancestral Cell 


When a cell replicates its DNA in preparation for cell division, the copy- 
ing is not always perfect. On occasion, the instructions are corrupted by 
mutations that change the sequence of nucleotides in the DNA. For this 
reason, daughter cells are not necessarily exact replicas of their parent. 


Mutations can create offspring that are changed for the worse (in that 
they are less able to survive and reproduce), changed for the better (in 
that they are better able to survive and reproduce), or changed in a neutral 
way (in that they are genetically different but equally viable). The struggle 
for survival eliminates the first, favors the second, and tolerates the third. 
The genes of the next generation will be the genes of the survivors. 


For many organisms, the pattern of heredity may be complicated by sex- 
ual reproduction, in which two cells of the same species fuse, pooling 
their DNA. The genetic cards are then shuffled, re-dealt, and distributed 
in new combinations to the next generation, to be tested again for their 
ability to promote survival and reproduction. 


These simple principles of genetic change and selection, applied repeat- 
edly over billions of cell generations, are the basis of evolution—the 
process by which living species become gradually modified and adapted 
to their environment in more and more sophisticated ways. Evolution 
offers a startling but compelling explanation of why present-day cells 
are so similar in their fundamentals: they have all inherited their genetic 
instructions from the same common ancestral cell. It is estimated that 
this cell existed between 3.5 and 3.8 billion years ago, and we must sup- 
pose that it contained a prototype of the universal machinery of all life on 
Earth today. Through a very long process of mutation and natural selec- 
tion, the descendants of this ancestral cell have gradually diverged to fill 
every habitat on Earth with organisms that exploit the potential of the 
machinery in a seemingly endless variety of ways. 


Unity and Diversity of Cells 


Figure 1-4 Life is an autocatalytic 
process. DNA and RNA provide the 
sequence information (green arrows) that 
is used to produce proteins and to copy 
themselves. Proteins, in turn, provide the 
catalytic activity (red arrows) needed to 
synthesize DNA, RNA, and themselves. 
Together, these feedback loops create the 
self-replicating system that endows living 
cells with their ability to reproduce. 


QUESTION 1-2 


Mutations are mistakes in the DNA 
that change the genetic plan from 
that of the previous generation. 
Imagine a shoe factory. Would you 
expect mistakes (i.e., unintentional 
changes) in copying the shoe 
design to lead to improvements in 
the shoes produced? Explain your 
answer. 
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Cells: The Fundamental Units of Life 


Genes Provide Instructions for the Form, Function, and 
Behavior of Cells and Organisms 


A cell's genome—that is, the entire sequence of nucleotides in an organ- 
ism’s DNA—provides a genetic program that instructs a cell how to 
behave. For the cells of plant and animal embryos, the genome directs 
the growth and development of an adult organism with hundreds of dif- 
ferent cell types. Within an individual plant or animal, these cells can be 
extraordinarily varied, as we discuss in detail in Chapter 20. Fat cells, skin 
cells, bone cells, and nerve cells seem as dissimilar as any cells could 
be. Yet all these differentiated cell types are generated during embryonic 
development from a single fertilized egg cell, and they contain identi- 
cal copies of the DNA of the species. Their varied characters stem from 
the way that individual cells use their genetic instructions. Different cells 
express different genes: that is, they use their genes to produce some 
RNAs and proteins and not others, depending on their internal state and 
on cues that they and their ancestor cells have received from their sur- 
roundings—mainly signals from other cells in the organism. 


The DNA, therefore, is not just a shopping list specifying the molecules 
that every cell must make, and a cell is not just an assembly of all the 
items on the list. Each cell is capable of carrying out a variety of biologi- 
cal tasks, depending on its environment and its history, and it selectively 
uses the information encoded in its DNA to guide its activities. Later in 
this book, we will see in detail how DNA defines both the parts list of the 
cell and the rules that decide when and where these parts are to be made. 


CELLS UNDER THE MICROSCOPE 


Today, we have access to many powerful technologies for deciphering 
the principles that govern the structure and activity of the cell. But cell 
biology started without these modern tools. The earliest cell biologists 
began by simply looking at tissues and cells, and later breaking them 
open or slicing them up, attempting to view their contents. What they 
saw was to them profoundly baffling—a collection of tiny objects whose 
relationship to the properties of living matter seemed an impenetrable 
mystery. Nevertheless, this type of visual investigation was the first step 
toward understanding tissues and cells, and it remains essential today in 
the study of cell biology. 


Cells were not made visible until the seventeenth century, when the 
microscope was invented. For hundreds of years afterward, all that 
was known about cells was discovered using this instrument. Light 
microscopes use visible light to illuminate specimens, and they allowed 
biologists to see for the first time the intricate structure that underpins all 
living things. 


Although these instruments now incorporate many sophisticated 
improvements, the properties of light—specifically its wavelength—limit 
the fineness of detail these microscopes reveal. Electron microscopes, 
invented in the 1930s, go beyond this limit by using beams of electrons 
instead of beams of light as the source of illumination; because electrons 
have a much shorter wavelength, these instruments greatly extend our 
ability to see the fine details of cells and even render some of the larger 
molecules visible individually. 


In this section, we describe various forms of light and electron micro- 
scopy. These vital tools in the modern cell biology laboratory continue 
to improve, revealing new and sometimes surprising details about how 
cells are built and how they operate. 


Cells Under the Microscope 


The Invention of the Light Microscope Led to the 
Discovery of Cells 


By the seventeenth century, glass lenses were powerful enough to permit 
the detection of structures invisible to the naked eye. Using an instrument 
equipped with such a lens, Robert Hooke examined a piece of cork and 
in 1665 reported to the Royal Society of London that the cork was com- 
posed of a mass of minute chambers. He called these chambers “cells,” 
based on their resemblance to the simple rooms occupied by monks in a 
monastery. The name stuck, even though the structures Hooke described 
were actually the cell walls that remained after the plant cells living 
inside them had died. Later, Hooke and his Dutch contemporary Antoni 
van Leeuwenhoek were able to observe living cells, seeing for the first 
time a world teeming with motile microscopic organisms. 


For almost 200 years, such instruments—the first light microscopes— 
remained exotic devices, available only to a few wealthy individuals. It 
was not until the nineteenth century that microscopes began to be widely 
used to look at cells. The emergence of cell biology as a distinct science 
was a gradual process to which many individuals contributed, but its offi- 


cial birth is generally said to have been signaled by two publications: one Figure 1-5 New cells form by growth 
by the botanist Matthias Schleiden in 1838 and the other by the zoolo- and division of existing cells. (A) In 1880, 
gist Theodor Schwann in 1839. In these papers, Schleiden and Schwann Eduard Strasburger drew a living plant cell 


(a hair cell from a Tradescantia flower), which 


documented the results of a systematic investigation of plant and animal ke obean e dK ding inane overa pood 
tissues with the light microscope, showing that cells were the universal of 2.5 hours. Inside the cell, DNA (black) can 
building blocks of all living tissues. Their work, and that of other nine- be seen condensing into chromosomes, 
teenth-century microscopists, slowly led to the realization that all living which are then segregated into the two 
cells are formed by the growth and division of existing cells—a principle e a E E A 
sometimes referred to as the cell theory (Figure 1-5). The implication that w n A = Bie “a oa TER ol 
living organisms do not arise spontaneously but can be generated only 100:1363-1368, 1985. With permission from 
from existing organisms was hotly contested, but it was finally confirmed Rockefeller University Press.) 
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CHAPTER 1 


QUESTION 1-3 


You have embarked on an ambitious 
research project: to create life in a 
test tube. You boil up a rich mixture 
of yeast extract and amino acids ina 
flask, along with a sprinkling of the 
inorganic salts known to be essential 
for life. You seal the flask and allow 
it to cool. After several months, 

the liquid is as clear as ever, and 
there are no signs of life. A friend 
suggests that excluding the air was a 
mistake, since most life as we know 
it requires oxygen. You repeat the 
experiment, but this time you leave 
the flask open to the atmosphere. 
To your great delight, the liquid 
becomes cloudy after a few days, 
and, under the microscope, you 

see beautiful small cells that are 
clearly growing and dividing. Does 
this experiment prove that you 
managed to generate a novel life- 
form? How might you redesign your 
experiment to allow air into the 
flask, yet eliminate the possibility 
that contamination by airborne 
microorganisms is the explanation 
for the results? (For a ready- 

made answer, look up the classic 
experiments of Louis Pasteur.) 


Figure 1-6 Cells form tissues in plants 
and animals. (A) Cells in the root tip of 

a fern. The DNA-containing nuclei are 
stained red, and each cell is surrounded 
by a thin cell wall (light blue). The red 
nuclei of densely packed cells are seen 

at the bottom corners of the preparation. 
(B) Cells in the crypts of the small intestine. 
Each crypt appears in this cross section as 
a ring of closely packed cells (with nuclei 
stained blue). The ring is surrounded by 
extracellular matrix, which contains the 
scattered cells that produced most of the 
matrix components. (A, courtesy of James 
Mauseth; B, Jose Luis Calvo/Shutterstock.) 
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in the 1860s by an elegant set of experiments performed by Louis Pasteur 
(see Question 1-3). 


The principle that cells are generated only from preexisting cells and 
inherit their characteristics from them underlies all of biology and gives 
the subject a unique flavor: in biology, questions about the present are 
inescapably linked to conditions in the past. To understand why present- 
day cells and organisms behave as they do, we need to understand their 
history, all the way back to the misty origins of the first cells on Earth. 
Charles Darwin provided the key insight that makes this history com- 
prehensible. His theory of evolution, published in 1859, explains how 
random variation and natural selection gave rise to diversity among 
organisms that share a common ancestry. When combined with the cell 
theory, the theory of evolution leads us to view all life, from its beginnings 
to the present day, as one vast family tree of individual cells. Although 
this book is primarily about how cells work today, we will encounter the 
theme of evolution again and again. 


Light Microscopes Reveal Some of a Cell’s Components 


If a very thin slice is cut from a suitable plant or animal tissue and viewed 
using a light microscope, it is immediately apparent that the tissue is 
divided into thousands of small cells. In some cases, the cells are closely 
packed; in others, they are separated from one another by an extracellular 
matrix—a dense material often made of protein fibers embedded in a gel 
of long sugar chains. Each cell is typically about 5-20 um in diameter. If 
care has been taken to keep the specimen alive, particles will be seen 
moving around inside its individual cells. On occasion, a cell may even 
be seen slowly changing shape and dividing into two (see Figure 1-5 and 
Movie 1.1). 


Distinguishing the internal structure of a cell is difficult, not only because 
the parts are small, but also because they are transparent and mostly 
colorless. One way around the problem is to stain cells with dyes that 
color particular components differently (Figure 1-6). Alternatively, one 
can exploit the fact that cell components differ slightly from one another 
in refractive index, just as glass differs in refractive index from water, 
causing light rays to be deflected as they pass from the one medium into 
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the other. The small differences in refractive index can be made visible by 
specialized optical techniques, and the resulting images can be enhanced 
further by electronic processing (Figure 1—7A). 


As shown in Figures 1-6B and 1-7A, typical animal cells visualized in 
these ways have a distinct anatomy. They have a sharply defined bound- 
ary, indicating the presence of an enclosing membrane, the plasma 
membrane. A large, round structure, the nucleus, is prominent near the 
middle of the cell. Around the nucleus and filling the cell’s interior is the 
cytoplasm, a transparent substance crammed with what seems at first to 
be a jumble of miscellaneous objects. With a good light microscope, one 
can begin to distinguish and classify some of the specific components in 
the cytoplasm, but structures smaller than about 0.2 um—about half the 
wavelength of visible light—cannot normally be resolved; points closer 
than this are not distinguishable and appear as a single blur. 


In recent years, however, new types of light microscope called 
fluorescence microscopes have been developed that use sophisticated 
methods of illumination and electronic image processing to see fluores- 
cently labeled cell components in much finer detail (Figure 1-7B). The 
most recent super-resolution fluorescence microscopes, for example, can 
push the limits of resolution down even further, to about 20 nanometers 
(nm). That is the size of a single ribosome, a large macromolecular com- 
plex in which RNAs are translated into proteins. These super-resolution 
techniques are described further in Panel 1-1 (pp. 12-13). 


The Fine Structure of a Cell Is Revealed by Electron 
Microscopy 


For the highest magnification and best resolution, one must turn to an 
electron microscope, which can reveal details down to a few nano- 
meters. Preparing cell samples for the electron microscope is a painstak- 
ing process. Even for light microscopy, a tissue often has to be fixed (that 
is, preserved by pickling in a reactive chemical solution), supported by 
embedding in a solid wax or resin, cut, or sectioned, into thin slices, and 
stained before it is viewed. (The tissues in Figure 1-6 were prepared in 
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Figure 1-7 Some of the internal 
structures of a cell can be seen with a 
light microscope. (A) A cell taken from 
human skin and grown in culture was 
photographed through a light microscope 
using interference-contrast optics 
(described in Panel 1-1, pp. 12-13). The 
nucleus is especially prominent, as is the 
small, round nucleolus within it (discussed 
in Chapter 5 and see Panel 1-2, p. 25). 

(B) A pigment cell from a frog, stained with 
fluorescent dyes and viewed with a confocal 
fluorescence microscope (discussed in 
Panel 1-1). The nucleus is shown in purple, 
the pigment granules in red, and the 
microtubules—a class of protein filaments 
in the cytoplasm—in green. (A, courtesy of 
Casey Cunningham; B, courtesy of Stephen 
Rogers and the Imaging Technology Group 
of the Beckman Institute, University of 
llinois, Urbana.) 
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The fine structure of a cell 
can be seen in a transmission electron 
microscope. (A) Thin section of a liver cell 
showing the enormous amount of detail 
that is visible. Some of the components 
to be discussed later in the chapter are 
labeled; they are identifiable by their size, 
location, and shape. (B) A small region of 
the cytoplasm at higher magnification. The 
smallest structures that are clearly visible 
are the ribosomes, each of which is made 
of 80-90 or so individual protein and RNA 
molecules; some of the ribosomes are free 
in the cytoplasm, while others are bound 
to a membrane-enclosed organelle—the 
endoplasmic reticulum—discussed later 
(see Figure 1-22). (C) Portion of a long, 
threadlike DNA molecule isolated from a 
cell and viewed by electron microscopy. 

(A and B, by permission of E.L. Bearer and 
Daniel S. Friend; C, courtesy of Mei Lie Wong.) 
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this way.) For electron microscopy, similar procedures are required, but 
the sections have to be much thinner and there is no possibility of look- 
ing at living cells. 


When thin sections are cut, stained with electron-dense heavy metals, 
and placed in the electron microscope, much of the jumble of cell com- 
ponents becomes sharply resolved into distinct organelles—separate, 
recognizable substructures with specialized functions that are often only 
hazily defined with a conventional light microscope. A delicate mem- 
brane, only about 5 nm thick, is visible enclosing the cell, and similar 
membranes form the boundary of many of the organelles inside ( 

). The plasma membrane separates the interior of the cell 
from its external environment, while internal membranes surround most 
organelles. All of these membranes are only two molecules thick (as dis- 
cussed in Chapter 11). With an electron microscope, even individual large 
molecules can be seen ( ). 


The type of electron microscope used to look at thin sections of tissue is 
known as a transmission electron microscope. This instrument is, in prin- 
ciple, similar to a light microscope, except that it transmits a beam of 
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electrons rather than a beam of light through the sample. Another type 
of electron microscope—the scanning electron microscope—scatters elec- 
trons off the surface of the sample and so is used to look at the surface 
detail of cells and other structures. These techniques, along with the dif- 
ferent forms of light microscopy, are reviewed in Panel 1-1 (pp. 12-13). 


Even the most powerful electron microscopes, however, cannot visu- 
alize the individual atoms that make up biological molecules (Figure 
1-9). To study the cell's key components in atomic detail, biologists 
have developed even more sophisticated tools. Techniques such as x-ray 
crystallography or cryoelectron microscopy, for example, can be used to 
determine the precise positioning of atoms within the three-dimensional 
structure of protein molecules and complexes (discussed in Chapter 4). 


THE PROKARYOTIC CELL 


Of all the types of cells that have been examined microscopically, bacteria 
have the simplest structure and come closest to showing us life stripped 
down to its essentials. Indeed, a bacterium contains no organelles other 
than ribosomes—not even a nucleus to hold its DNA. This property—the 
presence or absence of a nucleus—is used as the basis for a simple but fun- 
damental classification of all living things. Organisms whose cells have a 
nucleus are called eukaryotes (from the Greek words eu, meaning “well” 
or “truly,” and karyon, a “kernel” or “nucleus”). Organisms whose cells do 
not have a nucleus are called prokaryotes (from pro, meaning “before’). 


The Prokaryotic Cell 


Figure 1-9 How big are cells and their 
components? (A) This chart lists sizes 

of cells and their component parts, the 

units in which they are measured, and the 
instruments needed to visualize them. 

(B) Drawings convey a sense of scale 
between living cells and atoms. Each panel 
shows an image that is magnified by a 
factor of 10 compared to its predecessor— 
producing an imaginary progression 

from a thumb, to skin, to skin cells, to 

a mitochondrion, to a ribosome, and 
ultimately to a cluster of atoms forming 

part of one of the many protein molecules 
in our bodies. Note that ribosomes are 
present inside mitochondria (as shown 
here), as well as in the cytoplasm. Details of 
molecular structure, as shown in the last two 
bottom panels, are beyond the power of the 
electron microscope. 
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Courtesy of Andrew Davis. 


A conventional light microscope 
allows us to magnify cells up to 
1000 times and to resolve details 
as small as 0.2 um (200 nm), a 
limitation imposed by the 
wavelike nature of light, not by 
the quality of the lenses. Three 
things are required for viewing 
cells in a light microscope. First, a 
bright light must be focused onto 
the specimen by lenses in the 
condenser. Second, the specimen 
must be carefully prepared to 
allow light to pass through it. 
Third, an appropriate set of 
lenses (objective, tube, and 
eyepiece) must be arranged to 
focus an image of the specimen 
in the eye. 


FIXED SAMPLES 


Most tissues are neither small 
enough nor transparent enough 
to examine directly in the 
microscope. Typically, therefore, 
they are chemically fixed and cut 
into thin slices, or sections, that 
can be mounted on a glass 
microscope slide and subsequently 
stained to reveal different 
components of the cells. A stained 
section of a plant root tip is shown 
| here (D). 
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LOOKING AT 
LIVING CELLS 


The same unstained, living 
animal cell (fibroblast) in 
culture viewed with 
(A) the simplest, bright- 
field optics; 
(B) phase-contrast optics; 
(C) interference-contrast 
optics. 
The two latter systems 
exploit differences in the 
way light travels through 
regions of the cell with 
differing refractive indices. 
All three images can be 
obtained on the same 
microscope simply by 
interchanging optical 
components. 
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Fluorescent dyes used for staining cells are detected with the 
aid of a fluorescence microscope. This is similar to an ordinary 
light microscope, except that the illuminating light is passed 
through two sets of filters (yellow). The first (@) filters the 
light before it reaches the specimen, passing only those 
wavelengths that excite the particular fluorescent dye. The 
second (‘®@) blocks out this light and passes only those 
wavelengths emitted when the dye fluoresces. Dyed objects 
show up in bright color on a dark background. 


FLUORESCENT 
PROBES 


Fluorescent molecules 
absorb light at one 
wavelength and emit 
it at another, longer 
wavelength. Some 
fluorescent dyes bind 
specifically to particular 
molecules in cells and 
can reveal their 
location when the cells 
are examined with a 
fluorescence microscope. 
In these dividing nuclei in a fly embryo, the stain for DNA 
fluoresces blue. Other dyes can be coupled to antibody 
molecules, which then serve as highly specific staining reagents 
that bind selectively to particular molecules, showing their 
distribution in the cell. Because fluorescent dyes emit light, they 
allow objects even smaller than 0.2 um to be seen. Here, a 
microtubule protein in the mitotic spindle (see Figure 1-28) is 
stained green with a fluorescent antibody. 


Courtesy of William Sullivan. 
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CONFOCAL FLUORESCENCE MICROSCOPY 


A confocal microscope is a specialized 
type of fluorescence microscope that 
builds up an image by scanning the 
specimen with a laser beam. The beam 
is focused onto a single point at a 
specific depth in the specimen, and a 
pinhole aperture in the detector allows 
only fluorescence emitted from this 
same point to be included in the image. L 
Scanning the beam across the specimen generates 2 um 
a sharp image of the plane of focus—an optical section. 

A series of optical sections at different depths allows a 
three-dimensional image to be constructed, such as this 
highly branched mitochondrion in a living yeast cell. 


Courtesy of Stefan Hell. 


SUPER-RESOLUTION FLUORESCENCE MICROSCOPY 


Several recent and ingenious techniques have allowed fluorescence microscopes to 
break the usual resolution limit of 200 nm. One such technique uses a sample that is 
labeled with molecules whose fluorescence can be reversibly switched on and off by 
different colored lasers. The specimen is scanned by a nested set of two laser beams, in 
which the central beam excites fluorescence in a very small spot of the sample, while a 
second beam—wrapped around the first—switches off fluorescence in the surrounding 
area. A related approach allows the positions of individual fluorescent molecules to be 
accurately mapped while others nearby are switched off. Both approaches slowly build 
up an image with a resolution as low as 20 nm. These new super-resolution methods 
are being extended into 3-D imaging and real-time live cell imaging. 
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Courtesy of Carl Zeiss Microscopy, LLC. 


1 um 


Microtubules viewed with conventional fluorescence 
microscope (/eft) and with super-resolution optics (right). In the 
super-resolution image, the microtubule can be clearly seen at 
the actual size, which is only 25 nm in diameter. 
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Courtesy of Andrew Davis. 


beam 
deflector 


objective 
lens 


electrons 
from 
screen specimen 


detector , 
— specimen 


lens 3 
é 
© 

specimen & 
$ 
> 

objective £ 

lens 2 
v 

The electron micrograph below f 
projector shows a small region of a cell in 

jens a thin section of testis. The tissue 

has been chemically fixed, 

viewing embedded in plastic, and cut 

screen or into very thin sections that have 

protograpnie then been stained with salts of 

iim 


uranium and lead. 


2 


age 
Ce oY f 


At 


S3 


KE. 


Courtesy of Daniel S. Friend. 


The transmission electron microscope (TEM) is in principle similar to 
a light microscope, but it uses a beam of electrons, whose 
wavelength is very short, instead of a beam of light, and magnetic 
coils to focus the beam instead of glass lenses. Because of the very 
small wavelength of electrons, the specimen must be very thin. 
Contrast is usually introduced by staining the specimen with 
electron-dense heavy metals. The specimen is then placed in a 
vacuum in the microscope. The TEM has a useful magnification of 
up to a million-fold and can resolve details as small as about 1 nm 
in biological specimens. 


In the scanning electron microscope (SEM), the specimen, which 
has been coated with a very thin film of a heavy metal, is scanned 
by a beam of electrons brought to a focus on the specimen by 
magnetic coils that act as lenses. The quantity of electrons 
scattered or emitted as the beam bombards each successive point 
on the surface of the specimen is measured by the detector, and is 
used to control the intensity of successive points in an image built 
up on a video screen. The microscope creates striking images of 
three-dimensional objects with great depth of focus and can 
resolve details down to somewhere between 3 nm and 20 nm, 
depending on the instrument. 


L~ |] 
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Scanning electron 
micrograph of stereocilia 
projecting from a hair 
cell in the inner ear (left). 
For comparison, the same 
structure is shown by light 
microscopy, at the limit of 
its resolution (above). 


Courtesy of Richard Jacobs and James Hudspeth. 
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CHAPTER 1 


Figure 1-10 Bacteria come in different 


shapes and sizes. Typical spherical, rodlike, 


and spiral-shaped bacteria are drawn 
to scale. The spiral cells shown are the 
organisms that cause syphilis. 


QUESTION 1-4 


A bacterium weighs about 10712 g 
and can divide every 20 minutes. 

If a single bacterial cell carried on 
dividing at this rate, how long would 
it take before the mass of bacteria 
would equal that of the Earth 

(6 x 1024 kg)? Contrast your result 
with the fact that bacteria originated 
at least 3.5 billion years ago and 
have been dividing ever since. 
Explain the apparent paradox. (The 
number of cells N in a culture at 
time t is described by the equation 
N = No x 2”G, where No is the 
number of cells at zero time, and 

G is the population doubling time.) 


Figure 1-11 The bacterium Escherichia 
coli (E. coli) has served as an important 
model organism. An electron micrograph 
of a longitudinal section is shown here; 

the cell’s DNA is concentrated in the lightly 
stained region. Note that E. coli has an 
outer membrane and an inner (plasma) 
membrane, with a thin cell wall in between. 
The many flagella distributed over its 
surface are not visible in this micrograph. 
(Courtesy of E. Kellenberger.) 
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Prokaryotes are typically spherical, rodlike, or corkscrew-shaped (Figure 
1-10). They are also small—generally just a few micrometers long, 
although some giant species are as much as 100 times longer than this. 
Prokaryotes often have a tough protective coat, or cell wall, surrounding 
the plasma membrane, which encloses a single compartment containing 
the cytoplasm and the DNA. In the electron microscope, the cell inte- 
rior typically appears as a matrix of varying texture, without any obvious 
organized internal structure (Figure 1-11). The cells reproduce quickly by 
dividing in two. Under optimum conditions, when food is plentiful, many 
prokaryotic cells can duplicate themselves in as little as 20 minutes. In 
only 11 hours, a single prokaryote can therefore give rise to more than 8 
billion progeny (which exceeds the total number of humans currently on 
Earth). Thanks to their large numbers, rapid proliferation, and ability to 
exchange bits of genetic material by a process akin to sex, populations 
of prokaryotic cells can evolve fast, rapidly acquiring the ability to use a 
new food source or to resist being killed by a new antibiotic. 


In this section, we offer an overview of the world of prokaryotes. Despite 
their simple appearance, these organisms lead sophisticated lives—occu- 
pying a stunning variety of ecological niches. We will also introduce the 
two distinct classes into which prokaryotes are divided: bacteria and 
archaea (singular, archaeon). Although they are structurally indistin- 
guishable, archaea and bacteria are only distantly related. 


Prokaryotes Are the Most Diverse and Numerous Cells 
on Earth 


Most prokaryotes live as single-celled organisms, although some join 
together to form chains, clusters, or other organized, multicellular struc- 
tures. In shape and structure, prokaryotes may seem simple and limited, 
but in terms of chemistry, they are the most diverse class of cells on the 
planet. Members of this class exploit an enormous range of habitats, from 
hot puddles of volcanic mud to the interiors of other living cells, and they 
vastly outnumber all eukaryotic organisms on Earth. Some are aerobic, 
using oxygen to oxidize food molecules; some are strictly anaerobic and 
are killed by the slightest exposure to oxygen. As we discuss later in this 
chapter, mitochondria—the organelles that generate energy in eukary- 
otic cells—are thought to have evolved from aerobic bacteria that took 
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to living inside the anaerobic ancestors of today’s eukaryotic cells. Thus 
our own oxygen-based metabolism can be regarded as a product of the 
activities of bacterial cells. 


Virtually any organic, carbon-containing material—from wood to petro- 
leum—can be used as food by one sort of bacterium or another. Even 
more remarkably, some prokaryotes can live entirely on inorganic sub- 
stances: they can get their carbon from COz in the atmosphere, their 
nitrogen from atmospheric N2, and their oxygen, hydrogen, sulfur, and 
phosphorus from air, water, and inorganic minerals. Some of these 
prokaryotic cells, like plant cells, perform photosynthesis, using energy 
from sunlight to produce organic molecules from CO; (Figure 1-12); oth- 
ers derive energy from the chemical reactivity of inorganic substances 
in the environment (Figure 1-13). In either case, such prokaryotes play 
a unique and fundamental part in the economy of life on Earth, as other 
living organisms depend on the organic compounds that these cells gen- 
erate from inorganic materials. 


Plants, too, can capture energy from sunlight and carbon from atmos- 
pheric CO2. But plants unaided by bacteria cannot capture N2 from the 
atmosphere. In a sense, plants even depend on bacteria for photosynthe- 
sis: as we discuss later, it is almost certain that the organelles in the plant 
cell that perform photosynthesis—the chloroplasts—have evolved from 
photosynthetic bacteria that long ago found a home inside the cytoplasm 
of a plant-cell ancestor. 


The World of Prokaryotes Is Divided into Two Domains: 
Bacteria and Archaea 


Traditionally, all prokaryotes have been classified together in one large 
group. But molecular studies have determined that there is a gulf within 
the class of prokaryotes, dividing it into two distinct domains—the bacte- 
ria and the archaea—which are thought to have diverged from a common 
prokaryotic ancestor approximately 3.5 billion years ago. Remarkably, 
DNA sequencing reveals that, at a molecular level, the members of these 
two domains differ as much from one another as either does from the 
eukaryotes. Most of the prokaryotes familiar from everyday life—the spe- 
cies that live in the soil or make us ill—are bacteria. Archaea are found 
not only in these habitats but also in environments that are too hostile 
for most other cells: concentrated brine, the hot acid of volcanic springs, 
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Figure 1-12 Some bacteria are 
photosynthetic. (A) Anabaena cylindrica 
forms long, multicellular chains. This light 
micrograph shows specialized cells that 
either fix nitrogen (that is, capture N2 
from the atmosphere and incorporate 

it into organic compounds; labeled H), 
fix CO2 through photosynthesis (labeled 
V), or become resistant spores (labeled 
S) that can survive under unfavorable 
conditions. (B) An electron micrograph of 
a related species, Phormidium laminosum, 
shows the intracellular membranes where 
photosynthesis occurs. As shown in these 
micrographs, some prokaryotes can have 
intracellular membranes and form simple 
multicellular organisms. (A, courtesy of 
David Adams; B, courtesy of D.P. Hill and 
C.J. Howe.) 
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Figure 1-13 A sulfur bacterium gets its 
energy from H2S. Beggiatoa, a prokaryote 


that lives in sulfurous environments, oxidizes 


H2S to produce sulfur and can fix carbon 
even in the dark. In this light micrograph, 
yellow deposits of sulfur can be seen inside 
two of these bacterial cells. (Courtesy of 
Ralph S. Wolfe.) 
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Yeasts are simple, free- 
living eukaryotes. The cells shown in this 
micrograph belong to the species of yeast, 
Saccharomyces cerevisiae, used to make 
dough rise and turn malted barley juice 
into beer. As can be seen in this image, the 
cells reproduce by growing a bud and then 
dividing asymmetrically into a large mother 
cell and a small daughter cell; for this 
reason, they are called budding yeast. 


The nucleus contains most 
of the DNA in a eukaryotic cell. (A) This 
drawing of a typical animal cell shows its 
extensive system of membrane-enclosed 
organelles. The nucleus is colored brown, 
the nuclear envelope is green, and the 
cytoplasm (the interior of the cell outside 
the nucleus) is white. (B) An electron 
micrograph of the nucleus in a mammalian 
cell. Individual chromosomes are not visible 
because at this stage of the cell-division 
cycle the DNA molecules are dispersed as 
fine threads throughout the nucleus. 

(B, by permission of E.L. Bearer and 
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the airless depths of marine sediments, the sludge of sewage treatment 
plants, pools beneath the frozen surface of Antarctica, as well as in the 
acidic, oxygen-free environment of a cow's stomach, where they break 
down ingested cellulose and generate methane gas. Many of these 
extreme environments resemble the harsh conditions that must have 
existed on the primitive Earth, where living things first evolved before the 
atmosphere became rich in oxygen. 


THE EUKARYOTIC CELL 


Eukaryotic cells, in general, are bigger and more elaborate than bacte- 
ria and archaea. Some live independent lives as single-celled organisms, 
such as amoebae and yeasts ( ); others live in multicellular 
assemblies. All of the more complex multicellular organisms—including 
plants, animals, and fungi—are formed from eukaryotic cells. 


By definition, all eukaryotic cells have a nucleus. But possession of a 
nucleus goes hand-in-hand with possession of a variety of other orga- 
nelles, most of which are membrane-enclosed and common to all 
eukaryotic organisms. In this section, we take a look at the main orga- 
nelles found in eukaryotic cells from the point of view of their functions, 
and we consider how they came to serve the roles they have in the life of 
the eukaryotic cell. 


The Nucleus Is the Information Store of the Cell 


The nucleus is usually the most prominent organelle in a eukaryotic cell 
( ). It is enclosed within two concentric membranes that form 


nucleus nuclear envelope condensed chromosomes 


the nuclear envelope, and it contains molecules of DNA—extremely long 
polymers that encode the genetic information of the organism. In the 
light microscope, these giant DNA molecules become visible as individual 
chromosomes when they become more compact before a cell divides 
into two daughter cells (Figure 1-16). DNA also carries the genetic infor- 
mation in prokaryotic cells; these cells lack a distinct nucleus not because 
they lack DNA, but because they do not keep their DNA inside a nuclear 
envelope, segregated from the rest of the cell contents. 


Mitochondria Generate Usable Energy from Food 
Molecules 


Mitochondria are present in essentially all eukaryotic cells, and they are 
among the most conspicuous organelles in the cytoplasm (see Figure 
1-8B). In a fluorescence microscope, they appear as worm-shaped struc- 
tures that often form branching networks (Figure 1-17). When seen with 
an electron microscope, individual mitochondria are found to be enclosed 
in two separate membranes, with the inner membrane formed into folds 
that project into the interior of the organelle (Figure 1-18). 


Microscopic examination by itself, however, gives little indication of 
what mitochondria do. Their function was discovered by breaking open 
cells and then spinning the soup of cell fragments in a centrifuge; this 
treatment separates the organelles according to their size and density. 
Purified mitochondria were then tested to see what chemical processes 
they could perform. This revealed that mitochondria are generators of 
chemical energy for the cell. They harness the energy from the oxidation 
of food molecules, such as sugars, to produce adenosine triphosphate, 
or ATP—the basic chemical fuel that powers most of the cell’s activities. 
Because the mitochondrion consumes oxygen and releases CO2 in the 
course of this activity, the entire process is called cell respiration—essen- 
tially, breathing at the level of a cell. Without mitochondria, animals, 
fungi, and plants would be unable to use oxygen to extract the energy 
they need from the food molecules that nourish them. The process of cell 
respiration is considered in detail in Chapter 14. 


Mitochondria contain their own DNA and reproduce by dividing. Because 
they resemble bacteria in so many ways, they are thought to derive from 
bacteria that were engulfed by some ancestor of present-day eukaryotic 


Figure 1-17 Mitochondria can vary in shape and size. This 
budding yeast cell, which contains a green fluorescent protein in its 
mitochondria, was viewed in a super-resolution confocal fluorescence 
microscope. In this three-dimensional image, the mitochondria are 
seen to form complex branched networks. (From A. Egner, S. Jakobs, 
and S.W. Hell, Proc. Natl. Acad. Sci. U.S.A 99:3370-3375, 2002. With 
permission from National Academy of Sciences.) 
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Figure 1-16 Chromosomes become 
visible when a cell is about to divide. 

As a eukaryotic cell prepares to divide, its 
DNA molecules become progressively more 
compacted (condensed), forming wormlike 
chromosomes that can be distinguished 

in the light microscope (see also Figure 
1-5). The photographs here show three 
successive steps in this chromosome 
condensation process in a cultured cell 
from a newt's lung; note that in the last 
micrograph on the right, the nuclear 
envelope has broken down. (Courtesy of 
Conly L. Rieder, Albany, New York.) 
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Figure 1-18 Mitochondria have a 
distinctive internal structure. (A) An 
electron micrograph of a cross section 

of a mitochondrion reveals the extensive 
infolding of the inner membrane. 

(B) This three-dimensional representation 
of the arrangement of the mitochondrial 
membranes shows the smooth outer 
membrane (gray) and the highly convoluted 
inner membrane (red). The inner membrane 
contains most of the proteins responsible 
for energy production in eukaryotic cells; it 
is highly folded to provide a large surface 
area for this activity. (C) In this schematic 
cell, the innermost compartment of the 
mitochondrion is colored orange. 

(A, courtesy of Daniel S. Friend, by 
permission of E.L. Bearer.) 


Figure 1-19 Mitochondria are thought to 
have evolved from engulfed bacteria. It is 
virtually certain that mitochondria evolved 
from aerobic bacteria that were engulfed 

by an archaea-derived, early anaerobic 
eukaryotic cell and survived inside it, living 
in symbiosis with their host. As shown in this 
model, the double membrane of present- 
day mitochondria is thought to have been 
derived from the plasma membrane and 
outer membrane of the engulfed bacterium; 
the membrane derived from the plasma 
membrane of the engulfing ancestral cell 
was ultimately lost. 


inner membrane 


cells (Figure 1-19). This evidently created a symbiotic relationship in 
which the host eukaryote and the engulfed bacterium helped each other 
to survive and reproduce. 


Chloroplasts Capture Energy from Sunlight 


Chloroplasts are large, green organelles that are found in the cells of 
plants and algae, but not in the cells of animals or fungi. These organelles 
have an even more complex structure than mitochondria: in addition to 
their two surrounding membranes, they possess internal stacks of mem- 
branes containing the green pigment chlorophyll (Figure 1-20). 
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Chloroplasts carry out photosynthesis—trapping the energy of sun- 
light in their chlorophyll molecules and using this energy to drive the 
manufacture of energy-rich sugar molecules. In the process, they release 
oxygen as a molecular by-product. Plant cells can then extract this stored 
chemical energy when they need it, in the same way that animal cells do: 
by oxidizing these sugars and their breakdown products, mainly in the 
mitochondria. Chloroplasts thus enable plants to get their energy directly 
from sunlight. They also allow plants to produce the food molecules— 
and the oxygen—that mitochondria use to generate chemical energy 
in the form of ATP. How these organelles work together is discussed in 
Chapter 14. 


Like mitochondria, chloroplasts contain their own DNA, reproduce by 
dividing in two, and are thought to have evolved from bacteria—in this 
case, from photosynthetic bacteria that were engulfed by an early aerobic 
eukaryotic cell (Figure 1-21). 


Internal Membranes Create Intracellular Compartments 
with Different Functions 


Nuclei, mitochondria, and chloroplasts are not the only membrane- 
enclosed organelles inside eukaryotic cells. The cytoplasm contains a 
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Figure 1-20 Chloroplasts in plant cells 
capture the energy of sunlight. (A) A 
single cell isolated from a leaf of a flowering 
plant, seen in the light microscope, showing 
many green chloroplasts. (B) A drawing 

of one of the chloroplasts, showing the 
inner and outer membranes, as well as the 
highly folded system of internal membranes 
containing the green chlorophyll molecules 
that absorb light energy. (A, courtesy of 
Preeti Dahiya.) 


Figure 1-21 Chloroplasts almost certainly 
evolved from engulfed photosynthetic 
bacteria. The bacteria are thought to have 
been taken up by early eukaryotic cells that 
already contained mitochondria. 
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Figure 1-22 The endoplasmic reticulum 
produces many of the components of a 
eukaryotic cell. (A) Schematic diagram of an 
animal cell shows the endoplasmic reticulum 
(ER) in green. (B) Electron micrograph of a 
thin section of a mammalian pancreatic cell 
shows a small part of the ER, of which there 
are vast amounts in this cell type, which is 
specialized for protein secretion. Note that 
the ER is continuous with the membranes 

of the nuclear envelope. The black particles 
studding the region of the ER (and nuclear 
envelope) shown here are ribosomes, 
structures that translate RNAs into proteins. 
Because of its appearance, ribosome-coated 
ER is often called “rough ER” to distinguish 
it from the “smooth ER,” which does not 
have ribosomes bound to it. (B, courtesy of 
Lelio Orci.) 


Figure 1-23 The Golgi apparatus is 
composed of a stack of flattened, 
membrane-enclosed discs. (A) Schematic 
diagram of an animal cell with the Golgi 
apparatus colored red. (B) More realistic 
drawing of the Golgi apparatus. Some of 
the vesicles seen nearby have pinched off 
from the Golgi stack; others are destined to 
fuse with it. Only one stack is shown here, 
but several can be present in a cell. 

(C) Electron micrograph that shows the 
Golgi apparatus from a typical animal cell. 
(C, courtesy of Brij L. Gupta.) 
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profusion of other organelles that are surrounded by single membranes 
(see Figure 1-8A). Most of these structures are involved with the cell’s 
ability to import raw materials and to export both useful substances and 
waste products that are produced by the cell (a topic we discuss in detail 
in Chapter 12). 


The endoplasmic reticulum (ER) is an irregular maze of interconnected 
spaces enclosed by a membrane (Figure 1-22). It is the site where most 
cell-membrane components, as well as materials destined for export 
from the cell, are made. This organelle is enormously enlarged in cells 
that are specialized for the secretion of proteins. Stacks of flattened, 
membrane-enclosed sacs constitute the Golgi apparatus (Figure 1-23), 
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which modifies and packages molecules made in the ER that are destined 
to be either secreted from the cell or transported to another cell com- 
partment. Lysosomes are small, irregularly shaped organelles in which 
intracellular digestion occurs, releasing nutrients from ingested food par- 
ticles into the cytosol and breaking down unwanted molecules for either 
recycling within the cell or excretion from the cell. Indeed, many of the 
large and small molecules within the cell are constantly being broken 
down and remade. Peroxisomes are small, membrane-enclosed vesicles 
that provide a sequestered environment for a variety of reactions in which 
hydrogen peroxide is used to inactivate toxic molecules. Membranes also 
form many types of small transport vesicles that ferry materials between 
one membrane-enclosed organelle and another. All of these membrane- 
enclosed organelles are highlighted in Figure 1-24A. 


A continual exchange of materials takes place between the endoplasmic 
reticulum, the Golgi apparatus, the lysosomes, the plasma membrane, 
and the outside of the cell. The exchange is mediated by transport vesi- 
cles that pinch off from the membrane of one organelle and fuse with 
another, like tiny soap bubbles that bud from and combine with other 
bubbles. At the surface of the cell, for example, portions of the plasma 
membrane tuck inward and pinch off to form vesicles that carry material 
captured from the external medium into the cell—a process called endo- 
cytosis (Figure 1-25). Animal cells can engulf very large particles, or even 
entire foreign cells, by endocytosis. In the reverse process, called exocy- 
tosis, vesicles from inside the cell fuse with the plasma membrane and 
release their contents into the external medium (see Figure 1-25); most 
of the hormones and signal molecules that allow cells to communicate 
with one another are secreted from cells by exocytosis. How membrane- 
enclosed organelles move proteins and other molecules from place to 
place inside the eukaryotic cell is discussed in detail in Chapter 15. 


The Cytosol Is a Concentrated Aqueous Gel of Large 
and Small Molecules 


If we were to strip the plasma membrane from a eukaryotic cell and 
remove all of its membrane-enclosed organelles—including the nucleus, 
endoplasmic reticulum, Golgi apparatus, mitochondria, chloroplasts, and 
so on—we would be left with the cytosol (Figure 1—24B). In other words, 
the cytosol is the part of the cytoplasm that is not contained within 
intracellular membranes. In most cells, the cytosol is the largest single 
compartment. It contains a host of large and small molecules, crowded 
together so closely that it behaves more like a water-based gel than a 
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Figure 1-24 Membrane-enclosed 
organelles are distributed throughout the 
eukaryotic cell cytoplasm. (A) The various 
types of membrane-enclosed organelles, 
shown in different colors, are each 
specialized to perform a different function. 
(B) The cytoplasm that fills the space outside 
of these organelles is called the cytosol 
(colored blue). 
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Figure 1-25 Eukaryotic cells engage in 
continual endocytosis and exocytosis 
across their plasma membrane. They 
import extracellular materials by endocytosis 
and secrete intracellular materials by 
exocytosis. Endocytosed material is first 
delivered to membrane-enclosed organelles 
called endosomes (discussed in Chapter 15). 
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Figure 1-26 The cytosol is extremely 
crowded. This atomically detailed model 
of the cytosol of E. coli is based on the 
sizes and concentrations of 50 of the most 
abundant large molecules present in the 
bacterium. RNAs, proteins, and ribosomes 
are shown in different colors (Movie 1.2). 
(From S.R. McGuffee and A.H. Elcock, PLoS 
Comput. Biol. 6:e1000694, 2010.) 


QUESTION 1-5 


Suggest a reason why it would be 
advantageous for eukaryotic cells to 
evolve elaborate internal membrane 
systems that allow them to import 
substances from the outside, as 
shown in Figure 1-25. 


Figure 1-27 The cytoskeleton is a 
network of protein filaments that can 
be seen criss-crossing the cytoplasm of 
eukaryotic cells. The three major types of 
filaments can be detected using different 
fluorescent stains. Shown here are (A) 
actin filaments, (B) microtubules, and 

(C) intermediate filaments. Intermediate 
filaments are not found in the cytoplasm of 
cells with cell walls, such as plant cells. 

(A, Molecular Expressions at Florida State 
University; B, courtesy of Nancy Kedersha; 
C, courtesy of Clive Lloyd.) 
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liquid solution (Figure 1-26). The cytosol is the site of many chemical 
reactions that are fundamental to the cell’s existence. The early steps in 
the breakdown of nutrient molecules take place in the cytosol, for exam- 
ple, and it is here that most proteins are made by ribosomes. 


The Cytoskeleton Is Responsible for Directed Cell 
Movements 


The cytosol is not just a structureless soup of chemicals and organelles. 
Using an electron microscope, one can see that in eukaryotic cells the 
cytosol is criss-crossed by long, fine filaments. Frequently, the filaments 
are seen to be anchored at one end to the plasma membrane or to radi- 
ate out from a central site adjacent to the nucleus. This system of protein 
filaments, called the cytoskeleton, is composed of three major filament 
types (Figure 1-27). The thinnest of these filaments are the actin filaments; 
they are abundant in all eukaryotic cells but occur in especially large 
numbers inside muscle cells, where they serve as a central part of the 
machinery responsible for muscle contraction. The thickest filaments in 
the cytosol are called microtubules (see Figure 1-7B), because they have 
the form of minute hollow tubes; in dividing cells, they become reorgan- 
ized into a spectacular array that helps pull the duplicated chromosomes 
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duplicated Figure 1-28 Microtubules help segregate 

chromosomes the chromosomes in a dividing animal 
cell. A transmission electron micrograph 
and schematic drawing show duplicated 
chromosomes attached to the microtubules 
of a mitotic spindle (discussed in Chapter 
18). When a cell divides, its nuclear 
envelope breaks down and its DNA 
condenses into visible chromosomes, each 
of which has duplicated to form a pair of 
conjoined chromosomes that will ultimately 
be pulled apart into separate daughter cells 
by the spindle microtubules. See also 
Panel 1-1, pp. 12-13. (Photomicrograph 
courtesy of Conly L. Rieder, Albany, 
New York.) 


microtubules 


apart and distribute them equally to the two daughter cells (Figure 1-28). 
Intermediate in thickness between actin filaments and microtubules are 
the intermediate filaments, which serve to strengthen most animal cells. 
These three types of filaments, together with other proteins that attach to 
them, form a system of girders, ropes, and motors that gives the cell its 
mechanical strength, controls its shape, and drives and guides its move- 
ments (Movie 1.3 and Movie 1.4). 


Because the cytoskeleton governs the internal organization of the cell as 
well as its external features, it is as necessary to a plant cell—boxed in 
by a tough cell wall—as it is to an animal cell that freely bends, stretches, 
swims, or crawls. In a plant cell, for example, organelles such as mito- 
chondria are driven in a constant stream around the cell interior along 
cytoskeletal tracks (Movie 1.5). And animal cells and plant cells alike 
depend on the cytoskeleton to separate their internal components into 
two daughter cells during cell division (see Figure 1-28). 


The cytoskeleton’s role in cell division may be its most ancient func- 
tion. Even bacteria contain proteins that are distantly related to those 
that form the cytoskeletal elements involved in eukaryotic cell division; 
in bacteria, these proteins also form filaments that play a part in cell divi- 
sion. We examine the cytoskeleton in detail in Chapter 17, discuss its role 
in cell division in Chapter 18, and review how it responds to signals from 
outside the cell in Chapter 16. 


The Cytosol Is Far from Static 


The cell interior is in constant motion. The cytoskeleton is a dynamic jun- 
gle of protein ropes that are continually being strung together and taken 
apart; its filaments can assemble and then disappear in a matter of min- 
utes. Motor proteins use the energy stored in molecules of ATP to trundle 
along these tracks and cables, carrying organelles and proteins through- 
out the cytoplasm, and racing across the width of the cell in seconds. In 
addition, the large and small molecules that fill every free space in the cell 
are knocked to and fro by random thermal motion, constantly colliding 
with one another and with other structures in the cell's crowded cytosol. 


Of course, neither the bustling nature of the cell’s interior nor the details 
of cell structure were appreciated when scientists first peered at cells 
in a microscope; our knowledge of cell structure accumulated slowly. 
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TABLE 1-1 HISTORICAL LANDMARKS IN DETERMINING CELL STRUCTURE 


1665 
1674 
1833 
1839 


1857 
1879 
1881 


1898 
1902 
1952 


1957 


1960 


1965 


1968 
1970 


1974 
1994 


1990s- 
2000s 


QUESTION 1-6 


Discuss the relative advantages and 
disadvantages of light and electron 
microscopy. How could you best 
visualize a living skin cell, a yeast 
mitochondrion, a bacterium, and 


Hooke uses a primitive microscope to describe small chambers in sections of cork that he calls “cells” 
Leeuwenhoek reports his discovery of protozoa. Nine years later, he sees bacteria for the first time 
Brown publishes his microscopic observations of orchids, clearly describing the cell nucleus 


Schleiden and Schwann propose the cell theory, stating that the nucleated cell is the universal building block of plant and 
animal tissues 


Kölliker describes mitochondria in muscle cells 
Flemming describes with great clarity chromosome behavior during mitosis in animal cells 


Cajal and other histologists develop staining methods that reveal the structure of nerve cells and the organization of 
neural tissue 


Golgi first sees and describes the Golgi apparatus by staining cells with silver nitrate 
Boveri links chromosomes and heredity by observing chromosome behavior during sexual reproduction 


Palade, Porter, and Sjöstrand develop methods of electron microscopy that enable many intracellular structures to be 
seen for the first time. In one of the first applications of these techniques, Huxley shows that muscle contains arrays of 
protein filaments—the first evidence of a cytoskeleton 


Robertson describes the bilayer structure of the cell membrane, seen for the first time in the electron microscope 


Kendrew describes the first detailed protein structure (sperm whale myoglobin) to a resolution of 0.2 nm using x-ray 
crystallography. Perutz proposes a lower-resolution structure for hemoglobin 


de Duve and his colleagues use a cell-fractionation technique to separate peroxisomes, mitochondria, and lysosomes 
from a preparation of rat liver 


Petran and collaborators make the first confocal microscope 


Frye and Edidin use fluorescent antibodies to show that plasma membrane molecules can diffuse in the plane of the 
membrane, indicating that cell membranes are fluid 


Lazarides and Weber use fluorescent antibodies to stain the cytoskeleton 


Chalfie and collaborators introduce green fluorescent protein (GFP) as a marker to follow the behavior of proteins in living 
cells 


Betzig, Hell, and Moerner develop techniques for super-resolution fluorescence microscopy that allow observation of 
biological molecules too small to be resolved by conventional light or fluorescence microscopy 


A few of the key discoveries are listed in Table 1-1. In addition, Panel 
1-2 (p. 25) summarizes the main differences between animal, plant, and 
bacterial cells. 


Eukaryotic Cells May Have Originated as Predators 


Eukaryotic cells are typically 10 times the length and 1000 times the vol- 
ume of prokaryotic cells, although there is huge size variation within each 
category. They also possess a whole collection of features—a nucleus, 
a versatile cytoskeleton, mitochondria, and other organelles—that set 
them apart from bacteria and archaea. 


When and how eukaryotes evolved these systems remains something of a 
mystery. Although eukaryotes, bacteria, and archaea must have diverged 
from one another very early in the history of life on Earth (discussed in 
Chapter 14), the eukaryotes did not acquire all of their distinctive features 
at the same time (Figure 1-29). According to one theory, the ancestral 
eukaryotic cell was a predator that fed by capturing other cells. Such a 
way of life requires a large size, a flexible membrane, and a cytoskel- 


a mirena? eton to help the cell move and eat. The nuclear compartment may have 


evolved to keep the DNA segregated from this physical and chemical 
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The animal cell drawing is , 

based on a fibroblast, a cell co 
flagellum that inhabits connective tissue 
and deposits extracellular nuclear 
matrix. A micrograph of a ar 
living fibroblast is shown in 
Figure 1-7A. The plant cell 
drawing is typical of a young 
leaf cell. The bacterium shown microtubule 
is rod-shaped and has a single 
flagellum for motility. A 
comparison of the scale bars vacuole 
reveals the bacterium’s (fluid-filled) 
relatively small size. 
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Figure 1-29 Where did eukaryotes 
come from? The eukaryotic, bacterial, 
and archaean lineages diverged from one 
another more than 3 billion years ago— 
very early in the evolution of life on Earth. 
Some time later, eukaryotes are thought 
to have acquired mitochondria; later still, a 
subset of eukaryotes acquired chloroplasts. 
Mitochondria are essentially the same in 
plants, animals, and fungi, and therefore 
were presumably acquired before these 
lines diverged about 1.5 billion years ago. 


Figure 1-30 One protozoan eats another. 
(A) The scanning electron micrograph shows 
Didinium on its own, with its circumferential 
rings of beating cilia and its “snout” at the 
top. (B) Didinium is seen ingesting another 
ciliated protozoan, a Paramecium, artificially 
colored yellow. (Courtesy of D. Barlow.) 
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hurly-burly, so as to allow more delicate and complex control of the way 
the cell reads out its genetic information. 


Such a primitive eukaryotic cell, with a nucleus and cytoskeleton, was 
most likely the sort of cell that engulfed the free-living, oxygen-consum- 
ing bacteria that were the likely ancestors of the mitochondria (see Figure 
1-19). This partnership is thought to have been established 1.5 billion 
years ago, when the Earth’s atmosphere first became rich in oxygen. A 
subset of these cells later acquired chloroplasts by engulfing photosyn- 
thetic bacteria (see Figure 1-21). The likely history of these endosymbiotic 
events is illustrated in Figure 1-29. 


That single-celled eukaryotes can prey upon and swallow other cells 
is borne out by the behavior of many present-day protozoans: a class 
of free-living, motile, unicellular organisms. Didinium, for example, is a 
large, carnivorous protozoan with a diameter of about 150 pum—roughly 
10 times that of the average human cell. It has a globular body encircled 
by two fringes of cilia, and its front end is flattened except for a single 
protrusion rather like a snout (Figure 1-30A). Didinium swims at high 
speed by means of its beating cilia. When it encounters a suitable prey, 
usually another type of protozoan, it releases numerous small, para- 
lyzing darts from its snout region. Didinium then attaches to and devours 


(A) l 
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Model Organisms 


the other cell, inverting like a hollow ball to engulf its victim, which can 
be almost as large as itself (Figure 1-30B). 


Not all protozoans are predators. They can be photosynthetic or carnivo- 
rous, motile or sedentary. Their anatomy is often elaborate and includes 
such structures as sensory bristles, photoreceptors, beating cilia, stalklike 
appendages, mouthparts, stinging darts, and musclelike contractile bun- 
dles. Although they are single cells, protozoans can be as intricate and 
versatile as many multicellular organisms (Figure 1-31). Much remains 
to be learned about fundamental cell biology from studies of these fasci- 
nating life-forms. 


MODEL ORGANISMS 


All cells are thought to be descended from a common ancestor, whose 
fundamental properties have been conserved through evolution. Thus, 
knowledge gained from the study of one organism contributes to our 
understanding of others, including ourselves. But certain organisms are 
easier than others to study in the laboratory. Some reproduce rapidly and 
are convenient for genetic manipulations; others are multicellular but 
transparent, so the development of all their internal tissues and organs 
can be viewed directly in the live animal. For reasons such as these, biol- 
ogists have become dedicated to studying a few chosen species, pooling 
their knowledge to gain a deeper understanding than could be achieved if 
their efforts were spread over many different species. Although the roster 
of these representative organisms is continually expanding, a few stand 
out in terms of the breadth and depth of information that has been accu- 
mulated about them over the years—knowledge that contributes to our 
understanding of how all cells work. In this section, we examine some 
of these model organisms and review the benefits that each offers to 
the study of cell biology and, in many cases, to the promotion of human 
health. 


Molecular Biologists Have Focused on £. coli 


In molecular terms, we understand the workings of the bacterium 
Escherichia coli—E. coli for short—more thoroughly than those of any 
other living organism (see Figure 1-11). This small, rod-shaped cell nor- 
mally lives in the gut of humans and other vertebrates, but it also grows 
happily and reproduces rapidly in a simple nutrient broth in a culture 
bottle. 


Figure 1-31 An assortment of protozoans 
illustrates the enormous variety within 
this class of single-celled eukaryotes. 
These drawings are done to different scales, 
but in each case the scale bar represents 

10 um. The organisms in (A), (C), and (G) are 
ciliates; (B) is a heliozoan; (D) is an amoeba; 
(E) is a dinoflagellate; and (F) is a euglenoid. 
To see the latter in action, watch Movie 1.6. 
Because these organisms can only be seen 
with the aid of a microscope, they are also 
referred to as microorganisms. (From M.A. 
Sleigh, The Biology of Protozoa. London: 
Edward Arnold, 1973. With permission from 
Edward Arnold.) 
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Figure 1-32 The yeast Saccharomyces 
cerevisiae is a model eukaryote. In this 
scanning electron micrograph, a number 
of the cells are captured in the process 
of dividing, which they do by budding. 
Another micrograph of the same species 
is shown in Figure 1-14. (Courtesy of Ira 
Herskowitz and Eric Schabtach.) 


Most of our knowledge of the fundamental mechanisms of life—including 
how cells replicate their DNA and how they decode these genetic instruc- 
tions to make proteins—has come from studies of E. coli. Subsequent 
research has confirmed that these basic processes occur in essentially the 
same way in our own cells as they do in E. coli. 


Brewer's Yeast Is a Simple Eukaryote 


We tend to be preoccupied with eukaryotes because we are eukaryotes 
ourselves. But humans are complicated and reproduce slowly. So to get 
a handle on the fundamental biology of eukaryotes, we study a simpler 
representative—one that is easier and cheaper to keep and reproduces 
more rapidly. A popular choice has been the budding yeast Saccharomyces 
cerevisiae (Figure 1-32)—the same microorganism that is used for brew- 
ing beer and baking bread. 


S. cerevisiae is a small, single-celled fungus that is at least as closely 
related to animals as it is to plants. Like other fungi, it has a rigid cell wall, 
is relatively immobile, and possesses mitochondria but not chloroplasts. 
When nutrients are plentiful, S. cerevisiae reproduces almost as rapidly as 
a bacterium. Yet it carries out all the basic tasks that every eukaryotic cell 
must perform. Genetic and biochemical studies in yeast have been crucial 
to understanding many basic mechanisms in eukaryotic cells, including 
the cell-division cycle—the chain of events by which the nucleus and all 
the other components of a cell are duplicated and parceled out to create 
two daughter cells. The machinery that governs cell division has been so 
well conserved over the course of evolution that many of its components 
can function interchangeably in yeast and human cells (How We Know, 
pp. 30-31). Darwin himself would no doubt have been stunned by this 
dramatic example of evolutionary conservation. 


Arabidopsis Has Been Chosen as a Model Plant 


The large, multicellular organisms that we see around us—both plants 
and animals—seem fantastically varied, but they are much closer to 
one another, in their evolutionary origins and their basic cell biology, 
than they are to the great host of microscopic single-celled organisms. 
Whereas bacteria, archaea, and eukaryotes separated from each other 
more than 3 billion years ago, plants, animals, and fungi diverged only 
about 1.5 billion years ago, and the different species of flowering plants 
less than 200 million years ago (see Figure 1-29). 


The close evolutionary relationship among all flowering plants means 
that we can gain insight into their cell and molecular biology by focusing 
on just a few convenient species for detailed analysis. Out of the several 
hundred thousand species of flowering plants on Earth today, molecular 
biologists have focused their efforts on a small weed, the common wall 
cress Arabidopsis thaliana (Figure 1-33), which can be grown indoors 
in large numbers: one plant can produce thousands of offspring within 
8-10 weeks. Because genes found in Arabidopsis have counterparts in 
agricultural species, studying this simple weed provides insights into 
the development and physiology of the crop plants upon which our lives 
depend, as well as into the evolution of all the other plant species that 
dominate nearly every ecosystem on the planet. 


Figure 1-33 Arabidopsis thaliana, the common wall cress, is a 
model plant. This small weed has become the favorite organism 
of plant molecular and developmental biologists. (Courtesy of Toni 
Hayden and the John Innes Centre.) 
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Model Animals Include Flies, Worms, Fish, and Mice 


Multicellular animals account for the majority of all named species of 
living organisms, and the majority of animal species are insects. It is fit- 
ting, therefore, that an insect, the small fruit fly Drosophila melanogaster 
(Figure 1-34), should occupy a central place in biological research. The 
foundations of classical genetics (which we discuss in Chapter 19) were 
built to a large extent on studies of this insect. More than 80 years ago, 
genetic analysis of the fruit fly provided definitive proof that genes—the 
units of heredity—are carried on chromosomes. In more recent times, 
Drosophila, more than any other organism, has shown us how the genetic 
instructions encoded in DNA molecules direct the development of a ferti- 
lized egg cell (or zygote) into an adult multicellular organism containing 
vast numbers of different cell types organized in a precise and predict- 
able way. Drosophila mutants with body parts strangely misplaced or 
oddly patterned have provided the key to identifying and characterizing 
the genes that are needed to make a properly structured adult body, with 
gut, wings, legs, eyes, and all the other bits and pieces—all in their cor- 
rect places. These genes—which are copied and passed on to every cell 
in the body—define how each cell will behave in its social interactions 
with its sisters and cousins, thus controlling the structures that the cells 
can create, a regulatory feat we return to in Chapter 8. More importantly, 
the genes responsible for the development of Drosophila have turned out 
to be amazingly similar to those of humans—far more similar than one 
would suspect from the outward appearances of the two species. Thus 
the fly serves as a valuable model for studying human development as 
well as the genetic basis of many human diseases. 


Another widely studied animal is the nematode worm Caenorhabditis 
elegans (Figure 1-35), a harmless relative of the eelworms that attack the 
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Figure 1-34 Drosophila melanogaster is a 
favorite among developmental biologists 
and geneticists. Molecular genetic studies 
on this small fly have provided a key to the 
understanding of how all animals develop. 
(Edward B. Lewis. Courtesy of the Archives, 
California Institute of Technology.) 


QUESTION 1-7 


Your next-door neighbor has 
donated $100 in support of cancer 
research and is horrified to learn 
that her money is being spent on 
studying brewer's yeast. How could 
you put her mind at ease? 


Figure 1-35 Caenorhabditis elegans is 

a small nematode worm that normally 
lives in the soil. Most individuals are 
hermaphrodites, producing both sperm and 
eggs (the latter of which can be seen just 
beneath the skin along the underside of the 
animal). C. elegans was the first multicellular 
organism to have its complete genome 
sequenced. (Courtesy of Maria Gallegos.) 
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HOW WE KNOW 
LIFES COMMON MECHANISMS 


All living things are made of cells, and all cells—as we 
have discussed in this chapter—are fundamentally simi- 
lar inside: they store their genetic instructions in DNA 
molecules, which direct the production of RNA mol- 
ecules that direct the production of proteins. It is largely 
the proteins that carry out the cell’s chemical reactions, 
give the cell its shape, and control its behavior. But how 
deep do these similarities between cells—and the organ- 
isms they comprise—really run? Are proteins from one 
organism interchangeable with proteins from another? 
Would an enzyme that breaks down glucose in a bacte- 
rium, for example, be able to digest the same sugar if it 
were placed inside a yeast cell or a cell from a lobster or 
a human? What about the molecular machines that copy 
and interpret genetic information? Are they functionally 
equivalent from one organism to another? Insights have 
come from many sources, but the most stunning and 
dramatic answer came from experiments performed on 
humble yeast cells. These studies, which shocked the 
biological community, focused on one of the most fun- 
damental processes of life—cell division. 


Division and discovery 


All cells come from other cells, and the only way to 
make a new cell is through division of a preexisting 
one. To reproduce, a parent cell must execute an orderly 
sequence of reactions, through which it duplicates its 
contents and divides in two. This critical process of 
duplication and division—known as the cell-division 
cycle, or cell cycle for short—is complex and carefully 
controlled. Defects in any of the proteins involved can 
be devastating to the cell. 


Fortunately for biologists, this acute reliance on cru- 
cial proteins makes them easy to identify and study. Ifa 
protein is essential for a given process, a mutation that 
results in an abnormal protein—or in no protein at all— 
can prevent the cell from carrying out the process. By 
isolating organisms that are defective in their cell-divi- 
sion cycle, scientists have worked backward to discover 
the proteins that control progress through the cycle. 


The study of cell-cycle mutants has been particularly 
successful in yeasts. Yeasts are unicellular fungi and are 
popular organisms for such genetic studies. They are 
eukaryotes, like us, but they are small, simple, rapidly 
reproducing, and easy to manipulate genetically. Yeast 
mutants that are defective in their ability to complete 
cell division have led to the discovery of many genes 
that control the cell-division cycle—the so-called Cdc 
genes—and have provided a detailed understanding of 
how these genes, and the proteins they encode, actually 
work. 


Paul Nurse and his colleagues used this approach to 
identify Cdc genes in the yeast Schizosaccharomyces 
pombe, which is named after the African beer from 
which it was first isolated. S. pombe is a rod-shaped cell, 
which grows by elongation at its ends and divides by 
fission into two, through the formation of a partition in 
the center of the rod (see Figure 1-1E). The research- 
ers found that one of the Cdc genes they had identified, 
called Cdc2, was required to trigger several key events in 
the cell-division cycle. When that gene was inactivated 
by a mutation, the yeast cells would not divide. And 
when the cells were provided with a normal copy of the 
gene, their ability to reproduce was restored. 


It’s obvious that replacing a faulty Cdc2 gene in S. pombe 
with a functioning Cdc2 gene from the same yeast 
should repair the damage and enable the cell to divide 
normally. But what about using a similar cell-division 
gene from a different organism? That’s the question the 
Nurse team tackled next. 


Next of kin 


Saccharomyces cerevisiae is another kind of yeast and 
is one of a handful of model organisms biologists have 
chosen to study to expand their understanding of how 
eukaryotic cells work. Also used to brew beer, S. cerevi- 
siae divides by forming a small bud that grows steadily 
until it separates from the mother cell (see Figures 1-14 
and 1-32). Although S. cerevisiae and S. pombe differ in 
their style of division, both rely on a complex network 
of interacting proteins to get the job done. But could the 
proteins from one type of yeast substitute for those of 
the other? 


To find out, Nurse and his colleagues prepared DNA from 
healthy S. cerevisiae, and they introduced this DNA into 
S. pombe cells that contained a temperature-sensitive 
mutation in the Cdc2 gene that kept the cells from divid- 
ing when the heat was turned up. And they found that 
some of the mutant S. pombe cells regained the ability to 
proliferate at the elevated temperature. If spread onto a 
culture plate containing a growth medium, the rescued 
cells could divide again and again to form visible colo- 
nies, each containing millions of individual yeast cells 
(Figure 1-36). Upon closer examination, the research- 
ers discovered that these “rescued” yeast cells had 
received a fragment of DNA that contained the S. cerevi- 
Siae version of Cdc2—a gene that had been discovered in 
pioneering studies of the cell cycle by Lee Hartwell and 
colleagues. 


The result was exciting, but perhaps not all that sur- 
prising. After all, how different can one yeast be from 
another? A more demanding test would be to use DNA 
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Figure 1-36 S. pombe mutants defective in a cell-cycle gene 
can be rescued by the equivalent gene from S. cerevisiae. 
DNA is collected from S. cerevisiae and broken into large 
fragments, which are introduced into a culture of mutant 

S. pombe cells dividing at room temperature. We discuss how 
DNA can be manipulated and transferred into different cell 
types in Chapter 10. These yeast cells are then spread onto a 
plate containing a suitable growth medium and are incubated 
at a warm temperature, at which the mutant Cdc2 protein is 
inactive. The rare cells that survive and proliferate on these plates 
have been rescued by incorporation of foreign DNA fragments 
containing the Cdc2 gene, allowing them to divide normally at 
the higher temperature. 


from a more distant relative. So Nurse’s team repeated 
the experiment, this time using human DNA. And the 
results were the same. The human equivalent of the 
S. pombe Cdc2 gene could rescue the mutant yeast cells, 
allowing them to divide normally. 


Gene reading 


This result was much more surprising—even to Nurse. 
The ancestors of yeast and humans diverged some 


human 
S. pombe 
S. cerevisiae -- 
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1.5 billion years ago. So it was hard to believe that these 
two organisms would orchestrate cell division in such 
a similar way. But the results clearly showed that the 
human and yeast proteins are functionally equivalent. 
Indeed, Nurse and colleagues demonstrated that the 
proteins are almost exactly the same size and consist of 
amino acids strung together in a very similar order; the 
human Cdc2 protein is identical to the S. pombe Cdc2 
protein in 63% of its amino acids and is identical to the 
equivalent protein from S. cerevisiae in 58% of its amino 
acids (Figure 1-37). Together with Tim Hunt, who dis- 
covered a different cell-cycle protein called cyclin, Nurse 
and Hartwell shared a 2001 Nobel Prize for their studies 
of key regulators of the cell cycle. 


The Nurse experiments showed that proteins from very 
different eukaryotes can be functionally interchange- 
able and suggested that the cell cycle is controlled in 
a similar fashion in every eukaryotic organism alive 
today. Apparently, the proteins that orchestrate the cycle 
in eukaryotes are so fundamentally important that they 
have been conserved almost unchanged over more than 
a billion years of eukaryotic evolution. 


The same experiment also highlights another, even more 
basic point. The mutant yeast cells were rescued, not by 
direct injection of the human protein, but by introduc- 
tion of a piece of human DNA. Thus the yeast cells could 
read and use this information correctly, indicating that, 
in eukaryotes, the molecular machinery for reading the 
information encoded in DNA is also similar from cell to 
cell and from organism to organism. A yeast cell has 
all the equipment it needs to interpret the instructions 
encoded in a human gene and to use that information to 
direct the production of a fully functional human protein. 


The story of Cdc2 is just one of thousands of exam- 
ples of how research in yeast cells has provided critical 
insights into human biology. Although it may sound 
paradoxical, the shortest, most efficient path to improv- 
ing human health will often begin with detailed studies 
of the biology of simple organisms such as brewer's or 
baker’s yeast. 


Figure 1-37 The cell-division-cycle proteins from yeasts and human are very similar in their amino acid sequences. Identities 
between the amino acid sequences of a region of the human Cdc2 protein and a similar region of the equivalent proteins in S. pombe 
and S. cerevisiae are indicated by green shading. Each amino acid is represented by a single letter. 
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Figure 1-38 Zebrafish are popular models 
for studies of vertebrate development. 
(A) These small, hardy, tropical fish—a staple 
in many home aquaria—are easy and cheap 
to breed and maintain. (B) They are also 
ideal for developmental studies, as their 
transparent embryos develop outside the 
mother, making it easy to observe cells 
moving and changing their characters in 

the living organism as it develops. In this 
image of a two-day-old embryo, taken with 
a confocal microscope, a green fluorescent 
protein marks the developing lymphatic 
vessels and a red fluorescent protein marks 
developing blood vessels; regions where 
the two fluorescent markers coincide appear 
yellow. (A, courtesy of Steve Baskauf; 

B, from H.M. Jung et al., Development 
144:2070-2081, 2017.) 


roots of crops. Smaller and simpler than Drosophila, this creature devel- 
ops with clockwork precision from a fertilized egg cell into an adult that 
has exactly 959 body cells (plus a variable number of egg and sperm 
cells)—an unusual degree of regularity for an animal. We now have a 
minutely detailed description of the sequence of events by which this 
occurs—as the cells divide, move, and become specialized according to 
strict and predictable rules. And a wealth of mutants are available for 
testing how the worm’s genes direct this developmental ballet. Some 70% 
of human genes have some counterpart in the worm, and C. elegans, like 
Drosophila, has proved to be a valuable model for many of the devel- 
opmental processes that occur in our own bodies. Studies of nematode 
development, for example, have led to a detailed molecular understand- 
ing of apoptosis, a form of programmed cell death by which animals 
dispose of surplus cells, a topic discussed in Chapter 18. This process is 
also of great importance in the development of cancer, as we discuss in 
Chapter 20. 


Another animal that is providing molecular insights into developmen- 
tal processes, particularly in vertebrates, is the zebrafish (Figure 1-38A). 
Because this creature is transparent for the first two weeks of its life, it 
provides an ideal system in which to observe how cells behave during 
development in a living animal (Figure 1-38B). 


Mammals are among the most complex of animals, and the mouse has 
long been used as the model organism in which to study mammalian 
genetics, development, immunology, and cell biology. Thanks to mod- 
ern molecular biological techniques, it is possible to breed mice with 
deliberately engineered mutations in any specific gene, or with artificially 
constructed genes introduced into them (as we discuss in Chapter 10). 
In this way, one can test what a given gene is required for and how it 
functions. Almost every human gene has a counterpart in the mouse, 
with a similar DNA sequence and function. Thus, this animal has proven 
an excellent model for studying genes that are important in both human 
health and disease. 


Biologists Also Directly Study Humans and Their Cells 


Humans are not mice—or fish or flies or worms or yeast—and so many 
scientists also study human beings themselves. Like bacteria or yeast, 
our individual cells can be harvested and grown in culture, where inves- 
tigators can study their biology and more closely examine the genes that 
govern their functions. Given the appropriate surroundings, many human 
cell types—indeed, many cell types of animals or plants—will survive, 
proliferate, and even express specialized properties in a culture dish. 
Experiments using such cultured cells are sometimes said to be carried 
out in vitro (literally, “in glass”) to contrast them with experiments on 
intact organisms, which are said to be carried out in vivo (literally, “in the 
living”). 

Although not true for all cell types, many cells—including those harvested 
from humans—continue to display the differentiated properties appropri- 
ate to their origin when they are grown in culture: fibroblasts, a major cell 
type in connective tissue, continue to secrete proteins that form the extra- 
cellular matrix; embryonic heart muscle cells contract spontaneously in 
the culture dish; nerve cells extend axons and make functional connec- 
tions with other nerve cells; and epithelial cells join together to form 
continuous sheets, as they do inside the body (Figure 1-39 and Movie 
1.7). Because cultured cells are maintained in a controlled environment, 
they are accessible to study in ways that are often not possible in vivo. For 
example, cultured cells can be exposed to hormones or growth factors, 
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Figure 1-39 Cells in culture often display properties that reflect their origin. These phase-contrast micrographs 
show a variety of cell types in culture. (A) Fibroblasts from human skin. (B) Human neurons make connections with 
one another in culture. (C) Epithelial cells from human cervix form a cell sheet in culture. (Micrographs courtesy of 
ScienCell Research Laboratories, Inc.) 


and the effects that these signal molecules have on the shape or behavior 
of the cells can be easily explored. Remarkably, certain human embryo 
cells can be coaxed into differentiating into multiple cell types, which 
can self-assemble into organlike structures that closely resemble a nor- 
mal organ such as an eye or brain. Such organoids can be used to study 
developmental processes—and how they are derailed in certain human 
genetic diseases (discussed in Chapter 20). 


In addition to studying our cells in culture, humans are also examined 
directly in clinics. Much of the research on human biology has been driven 
by medical interests, and the medical database on the human species is 
enormous. Although naturally occurring, disease-causing mutations in 
any given human gene are rare, the consequences are well documented. 
This is because humans are unique among animals in that they report 
and record their own genetic defects: in no other species are billions of 
individuals so intensively examined, described, and investigated. 


Nevertheless, the extent of our ignorance is still daunting. The mamma- 
lian body is enormously complex, being formed from thousands of billions 
of cells, and one might despair of ever understanding how the DNA ina 
fertilized mouse egg cell directs the generation of a mouse rather than 
a fish, or how the DNA in a human egg cell directs the development of 
a human rather than a mouse. Yet the revelations of molecular biology 
have made the task seem eminently approachable. As much as anything, 
this new optimism has come from the realization that the genes of one 
type of animal have close counterparts in most other types of animals, 
apparently serving similar functions (Figure 1-40). We all have a com- 
mon evolutionary origin, and under the surface it seems that we share 
the same molecular mechanisms. Flies, worms, fish, mice, and humans 
thus provide a key to understanding how animals in general are made 
and how their cells work. 


Comparing Genome Sequences Reveals Life’s Common 
Heritage 


At a molecular level, evolutionary change has been remarkably slow. 
We can see in present-day organisms many features that have been 
preserved through more than 3 billion years of life on Earth—about one- 
fifth of the age of the universe. This evolutionary conservatism provides 


33 


34 


CHAPTER] Cells: The Fundamental Units of Life 


Figure 1-40 Different species share 
similar genes. The human baby and the 
mouse shown here have remarkably similar 
white patches on their foreheads because 
they both have defects in the same gene 
(called Kit), which is required for the normal 
development, migration, and maintenance 
of some skin pigment cells. (Courtesy of 
R.A. Fleischman, Proc. Natl. Acad. Sci. 
U.S.A. 88:10885-10889, 1991.) 


Figure 1-41 Organisms vary enormously 
in the size of their genomes. Genome size 
is measured in nucleotide pairs of DNA per 
haploid genome; that is, per single copy 

of the genome. (The body cells of sexually 
reproducing organisms such as ourselves 
are generally diploid: they contain two 
copies of the genome, one inherited from 
the mother, the other from the father.) 
Closely related organisms can vary widely 

in the quantity of DNA in their genomes (as 
indicated by the length of the green bars), 
even though they contain similar numbers 
of functionally distinct genes; this is because 
most of the DNA in large genomes does not 
code for protein, as discussed shortly. (Data 
from T.R. Gregory, 2008, Animal Genome 
Size Database: www.genomesize.com.) 


the foundation on which the study of molecular biology is built. To set 
the scene for the chapters that follow, therefore, we end this chapter by 
considering a little more closely the family relationships and basic simi- 
larities among all living things. This topic has been dramatically clarified 
by technological advances that have allowed us to determine the com- 
plete genome sequences of thousands of organisms, including our own 
species (as discussed in more detail in Chapter 9). 


The first thing we note when we look at an organism's genome is its over- 
all size and how many genes it packs into that length of DNA. Prokaryotes 
carry very little superfluous genetic baggage and, nucleotide-for-nucleo- 
tide, they squeeze a lot of information into their relatively small genomes. 
E. coli, for example, carries its genetic instructions in a single, circular, 
double-stranded molecule of DNA that contains 4.6 million nucleotide 
pairs and 4300 protein-coding genes. (We focus on the genes that code 
for proteins because they are the best characterized, and their numbers 
are the most certain. We review how genes are counted in Chapter 9.) 
The simplest known bacterium contains only about 500 protein-coding 
genes, but most prokaryotes have genomes that contain at least 1 million 
nucleotide pairs and 1000-8000 protein-coding genes. With these few 
thousand genes, prokaryotes are able to thrive in even the most hostile 
environments on Earth. 


The compact genomes of typical bacteria are dwarfed by the genomes of 
typical eukaryotes. The human genome, for example, contains about 700 
times more DNA than the E. coli genome, and the genome of an amoeba 
contains about 100 times more than ours (Figure 1-41). The rest of the 
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TABLE 1-2 SOME MODEL ORGANISMS AND THEIR GENOMES 


Genome Size* | Approximate Number 


(Nucleotide of Protein-coding 
Pairs) Genes 
Homo sapiens (human) 3200 x 10° 19,000 
Mus musculus (mouse) 2800 x 10° 22,000 
Drosophila melanogaster (fruit fly) 180 x 10° 14,000 
Arabidopsis thaliana (plant) 103 x 10° 28,000 
Caenorhabditis elegans (roundworm) 100 x 10° 22,000 
Saccharomyces cerevisiae (yeast) 125 x 10° 6600 
Escherichia coli (bacterium) 4.6 x 10° 4300 


*Genome size includes an estimate for the amount of highly repeated, noncoding 
DNA sequence, which does not appear in genome databases. 


model organisms we have described have genomes that fall somewhere 
between E. coli and human in terms of size. S. cerevisiae contains about 
2.5 times as much DNA as E. coli; D. melanogaster has about 10 times 
more DNA than S. cerevisiae; and M. musculus has about 20 times more 
DNA than D. melanogaster (Table 1-2). 


In terms of gene numbers, however, the differences are not so great. We 
have only about five times as many protein-coding genes as E. coli, for 
example. Moreover, many of our genes—and the proteins they encode— 
fall into closely related family groups, such as the family of hemoglobins, 
which has nine closely related members in humans. Thus the number of 
fundamentally different proteins in a human is not very many times more 
than in the bacterium, and the number of human genes that have iden- 
tifiable counterparts in the bacterium is a significant fraction of the total. 


This high degree of “family resemblance” is striking when we compare 
the genome sequences of different organisms. When genes from different 
organisms have very similar nucleotide sequences, it is highly probable 
that they descended from a common ancestral gene. Such genes (and 
their protein products) are said to be homologous. Now that we have the 
complete genome sequences of many different organisms from all three 
domains of life—archaea, bacteria, and eukaryotes—we can search sys- 
tematically for homologies that span this enormous evolutionary divide. 
By taking stock of the common inheritance of all living things, scientists 
are attempting to trace life’s origins back to the earliest ancestral cells. 
We return to this topic in Chapter 9. 


Genomes Contain More Than Just Genes 


Although our view of genome sequences tends to be “gene-centric,” our 
genomes contain much more than just genes. The vast bulk of our DNA 
does not code for proteins or for functional RNA molecules. Instead, it 
includes a mixture of sequences that help regulate gene activity, plus 
sequences that seem to be dispensable. The large quantity of regulatory 
DNA contained in the genomes of eukaryotic multicellular organisms 
allows for enormous complexity and sophistication in the way different 
genes are brought into action at different times and places. Yet, in the 
end, the basic list of parts—the set of proteins that the cells can make, as 
specified by the DNA—is not much longer than the parts list of an auto- 
mobile, and many of those parts are common not only to all animals, but 
also to the entire living world. 
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That DNA can program the growth, development, and reproduction of 
living cells and complex organisms is truly amazing. In the rest of this 
book, we will try to explain what is known about how cells work—by 
examining their component parts, how these parts work together, and 
how the genome of each cell directs the manufacture of the parts the cell 
needs to function and to reproduce. 


ESSENTIAL CONCEPTS 


Cells are the fundamental units of life. All present-day cells are 
believed to have evolved from an ancestral cell that existed more 
than 3 billion years ago. 


All cells are enclosed by a plasma membrane, which separates the 
inside of the cell from its environment. 


All cells contain DNA as a store of genetic information and use it to 
guide the synthesis of RNA molecules and proteins. This molecular 
relationship underlies cells’ ability to self-replicate. 


Cells in a multicellular organism, though they all contain the same 
DNA, can be very different because they turn on different sets of 
genes according to their developmental history and to signals they 
receive from their environment. 


Animal and plant cells are typically 5-20 um in diameter and can be 
seen with a light microscope, which also reveals some of their inter- 
nal components, including the larger organelles. 


The electron microscope reveals even the smallest organelles, but 
specimens require elaborate preparation and cannot be viewed while 
alive. 


Specific large molecules can be located in fixed or living cells by fluo- 
rescence microscopy. 


The simplest of present-day living cells are prokaryotes—bacteria 
and archaea: although they contain DNA, they lack a nucleus and 
most other organelles and probably resemble most closely the origi- 
nal ancestral cell. 


Different species of prokaryotes are diverse in their chemical capa- 
bilities and inhabit an amazingly wide range of habitats. 


Eukaryotic cells possess a nucleus and other organelles not found in 
prokaryotes. They probably evolved in a series of stages, including 
the acquisition of mitochondria by engulfment of aerobic bacteria 
and (for cells that carry out photosynthesis) the acquisition of chlo- 
roplasts by engulfment of photosynthetic bacteria. 


The nucleus contains the main genetic information of the eukaryotic 
organism, stored in very long DNA molecules. 


The cytoplasm of eukaryotic cells includes all of the cell’s contents 
outside the nucleus and contains a variety of membrane-enclosed 
organelles with specialized functions: mitochondria carry out the final 
oxidation of food molecules and produce ATP; the endoplasmic reticu- 
lum and the Golgi apparatus synthesize complex molecules for export 
from the cell and for insertion in cell membranes; lysosomes digest 
large molecules; in plant cells and other photosynthetic eukaryotes, 
chloroplasts perform photosynthesis. 


Outside the membrane-enclosed organelles in the cytoplasm is the 
cytosol, a highly concentrated mixture of large and small molecules 
that carry out many essential biochemical processes. 


The cytoskeleton is composed of protein filaments that extend 
throughout the cytoplasm and are responsible for cell shape and 
movement and for the transport of organelles and large molecular 
complexes from one intracellular location to another. 


Questions 


e Free-living, single-celled eukaryotic microorganisms are complex 
cells that, in some cases, can swim, mate, hunt, and devour other 
microorganisms. 


e Animals, plants, and some fungi are multicellular organisms that con- 
sist of diverse eukaryotic cell types, all derived from a single fertilized 
egg cell; the number of such cells cooperating to form a large, multi- 
cellular organism such as a human runs into thousands of billions. 


e Biologists have chosen a small number of model organisms to study 
intensely, including the bacterium E. coli, brewer’s yeast, a nematode 
worm, a fly, a small plant, a fish, mice, and humans themselves. 


e The human genome has about 19,000 protein-coding genes, which is 
about five times as many as E. coli and about 5000 more than the fly. 
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QUESTIONS 


QUESTION 1-8 


By now you should be familiar with the following cell 
components. Briefly define what they are and what function 
they provide for cells. 


cytosol 
cytoplasm 
mitochondria 
nucleus 
chloroplasts 
lysosomes 


chromosomes 
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QUESTION 1-9 


Which of the following statements are correct? Explain your 
answers. 


A. The hereditary information of a cell is passed on by its 
proteins. 


B. Bacterial DNA is found in the cytoplasm. 
C. Plants are composed of prokaryotic cells. 


D. With the exception of egg and sperm cells, all of the 
nucleated cells within a single multicellular organism have 
the same number of chromosomes. 


E. The cytosol includes membrane-enclosed organelles 
such as lysosomes. 


F. The nucleus and a mitochondrion are each surrounded 
by a double membrane. 


G. Protozoans are complex organisms with a set of 
specialized cells that form tissues such as flagella, 
mouthparts, stinging darts, and leglike appendages. 


H. Lysosomes and peroxisomes are the sites of degradation 
of unwanted materials. 
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QUESTION 1-10 


Identify the different organelles indicated with letters in the 
electron micrograph of a plant cell shown below. Estimate 
the length of the scale bar in the figure. 


QUESTION 1-11 


There are three major classes of protein filaments that 
make up the cytoskeleton of a typical animal cell. What are 
they, and what are the differences in their functions? Which 
cytoskeletal filaments would be most plentiful in a muscle 
cell or in an epidermal cell making up the outer layer of the 
skin? Explain your answers. 


QUESTION 1-12 


Natural selection is such a powerful force in evolution 
because organisms or cells with even a small reproductive 
advantage will eventually outnumber their competitors. 

To illustrate how quickly this process can occur, consider 

a cell culture that contains 1 million bacterial cells that 
double every 20 minutes. A single cell in this culture 
acquires a mutation that allows it to divide faster, with a 
generation time of only 15 minutes. Assuming that there is 
an unlimited food supply and no cell death, how long would 
it take before the progeny of the mutated cell became 
predominant in the culture? (Before you go through the 
calculation, make a guess: do you think it would take about 
a day, a week, a month, or a year?) How many cells of either 
type are present in the culture at this time? (The number of 
cells N in the culture at time t is described by the equation 
N = Nox 2*6, where No is the number of cells at zero time 
and G is the generation time.) 


QUESTION 1-13 


When bacteria are cultured under adverse conditions—for 
example, in the presence of a poison such as an antibiotic— 
most cells grow and divide slowly. But it is not uncommon to 
find that the rate of proliferation is restored to normal after 
a few days. Suggest why this may be the case. 


QUESTION 1-14 


Apply the principle of exponential growth of a population of 
cells in a culture (as described in Question 1-12) to the cells 
in a multicellular organism, such as yourself. There are about 
1013 cells in your body. Assume that one cell has acquired 
mutations that allow it to divide in an uncontrolled manner 
to become a cancer cell. Some cancer cells can proliferate 
with a generation time of about 24 hours. If none of the 
cancer cells died, how long would it take before 1013 cells 

in your body would be cancer cells? (Use the equation 

N = No x 2¥G, with t the time and G the generation time. 
Hint: 1013 = 243.) 


QUESTION 1-15 


“The structure and function of a living cell are dictated 
by the laws of chemistry, physics, and thermodynamics.” 
Provide examples that support (or refute) this claim. 


QUESTION 1-16 


What, if any, are the advantages in being multicellular? 


QUESTION 1-17 


Draw to scale the outline of two spherical cells, one a 
bacterium with a diameter of 1 um, the other an animal cell 
with a diameter of 15 um. Calculate the volume, surface 
area, and surface-to-volume ratio for each cell. How 

would the latter ratio change if you included the internal 
membranes of the animal cell in the calculation of surface 
area (assume internal membranes have 15 times the area of 
the plasma membrane)? (The volume of a sphere is given by 
4nr3/3 and its surface by 4nr2, where r is its radius.) Discuss 
the following hypothesis: “Internal membranes allowed 
bigger cells to evolve.” 


QUESTION 1-18 


What are the arguments that all living cells evolved from 

a common ancestor cell? Imagine the very “early days” 

of evolution of life on Earth. Would you assume that the 
primordial ancestor cell was the first and only cell to form? 


QUESTION 1-19 


Looking at some pond water with a light microscope, you 
notice an unfamiliar rod-shaped cell about 200 um long. 
Knowing that some exceptional bacteria can be as big 

as this or even bigger, you wonder whether your cell is a 
bacterium or a eukaryote. How will you decide? If it is not a 
eukaryote, how will you discover whether it is a bacterium 
or an archaeon? 


Chemical Components of Cells 


At first sight, it is difficult to comprehend that living creatures are merely 
chemical systems. Their incredible diversity of form, their seemingly pur- 
poseful behavior, and their ability to grow and reproduce all seem to set 
them apart from the world of solids, liquids, and gases that chemistry nor- 
mally describes. Indeed, until the late nineteenth century, it was widely 
believed that all living things contained a vital force—an “animus’”—that 
was responsible for their distinctive properties. 


We now know that there is nothing in a living organism that disobeys 
chemical or physical laws. However, the chemistry of life is indeed a 
special kind. First, it is based overwhelmingly on carbon compounds, the 
study of which is known as organic chemistry. Second, it depends almost 
exclusively on chemical reactions that take place in a watery, or aqueous, 
environment and in the relatively narrow range of temperatures experi- 
enced on Earth. Third, it is enormously complex: even the simplest cell is 
vastly more complicated in its chemistry than any other chemical system 
known. Fourth, it is dominated and coordinated by collections of large 
polymers—molecules made of many chemical subunits linked end-to- 
end—whose unique properties enable cells and organisms to grow and 
reproduce and to do all the other things that are characteristic of life. 
Finally, the chemistry of life is tightly regulated: cells deploy a wide vari- 
ety of mechanisms to make sure that each of their chemical reactions 
occurs at the proper rate, time, and place. 


Because chemistry lies at the heart of all biology, in this chapter, we briefly 
survey the chemistry of the living cell. We will meet the molecules from 
which cells are made and examine their structures, shapes, and chemical 
properties. These molecules determine the size, structure, and functions 
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Figure 2-1 An atom consists of a nucleus 
surrounded by an electron cloud. The 
dense, positively charged nucleus contains 
nearly all of the atom’s mass. The much 
lighter and negatively charged electrons 
occupy space around the nucleus, 
as governed by the laws of quantum 
mechanics. The electrons are depicted as a 
continuous cloud, because there is no way 
of predicting exactly where an electron is at 
any given instant. The density of shading of 
the cloud is an indication of the probability 
that electrons will be found there. 

The diameter of the electron cloud 
ranges from about 0.1 nm (for hydrogen) 
to about 0.4 nm (for atoms of high atomic 
number). The nucleus is very much smaller: 
about 5 x 10% nm for carbon, for example. 
If this diagram were drawn to scale, the 
nucleus would not be visible. 


Figure 2-2 The number of protons in 
an atom determines its atomic number. 
Schematic representations of an atom of 


carbon and an atom of hydrogen are shown. 


The nucleus of every atom except hydrogen 
consists of both positively charged protons 
and electrically neutral neutrons; the atomic 
weight equals the number of protons plus 
neutrons. The number of electrons in an 
atom is equal to the number of protons, so 
that the atom has no net charge. 

In contrast to Figure 2-1, the electrons 
are shown here as individual particles. 
The concentric black circles represent in a 
highly schematic form the “orbits” (that is, 
the different distributions) of the electrons. 
The neutrons, protons, and electrons are in 
reality minuscule in relation to the atom as 
a whole; their size is greatly exaggerated 
here. 
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of living cells. By understanding how they interact, we can begin to see 
how cells exploit the laws of chemistry and physics to survive, thrive, and 
reproduce. 


CHEMICAL BONDS 


Matter is made of combinations of elements—substances such as hydro- 
gen or carbon that cannot be broken down or interconverted by chemical 
means. The smallest particle of an element that still retains its distinctive 
chemical properties is an atom. The characteristics of substances other 
than pure elements—including the materials from which living cells are 
made—depend on which atoms they contain and the way that these 
atoms are linked together in groups to form molecules. To understand 
living organisms, therefore, it is crucial to know how the chemical bonds 
that hold atoms together in molecules are formed. 


Cells Are Made of Relatively Few Types of Atoms 


Each atom has at its center a dense, positively charged nucleus, which 
is surrounded at some distance by a cloud of negatively charged 
electrons, held in orbit by electrostatic attraction to the nucleus (Figure 
2-1). The nucleus consists of two kinds of subatomic particles: protons, 
which are positively charged, and neutrons, which are electrically neutral. 
The atomic number of an element is determined by the number of protons 
present in its atom’s nucleus. An atom of hydrogen has a nucleus com- 
posed of a single proton; so hydrogen, with an atomic number of 1, is the 
lightest element. An atom of carbon has six protons in its nucleus and an 
atomic number of 6 (Figure 2-2). 


The electric charge carried by each proton is exactly equal and opposite 
to the charge carried by a single electron. Because the whole atom is elec- 
trically neutral, the number of negatively charged electrons surrounding 
the nucleus is therefore equal to the number of positively charged pro- 
tons that the nucleus contains; thus the number of electrons in an atom 
also equals the atomic number. All atoms of a given element have the 
same atomic number, and we will see shortly that it is this number that 
dictates each element's chemical behavior. 


Neutrons have essentially the same mass as protons. They contribute to 
the structural stability of the nucleus: if there are too many or too few, 
the nucleus may disintegrate by radioactive decay. However, neutrons 
do not alter the chemical properties of the atom. Thus an element can 
exist in several physically distinguishable but chemically identical forms, 
called isotopes, each having a different number of neutrons but the same 
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carbon atom hydrogen atom 
atomic number = 1 


atomic weight = 1 


atomic number = 6 
atomic weight = 12 


number of protons. Multiple isotopes of almost all the elements occur 
naturally, including some that are unstable—and thus radioactive. For 
example, while most carbon on Earth exists as carbon 12, a stable iso- 
tope with six protons and six neutrons, also present are small amounts of 
an unstable isotope, carbon 14, which has six protons and eight neutrons. 
Carbon 14 undergoes radioactive decay at a slow but steady rate, a prop- 
erty that allows archaeologists to estimate the age of organic material. 


The atomic weight of an atom, or the molecular weight of a molecule, 
is its mass relative to the mass of a hydrogen atom. This value is equal 
to the number of protons plus the number of neutrons that the atom or 
molecule contains; because electrons are so light, they contribute almost 
nothing to the total mass. Thus the major isotope of carbon has an atomic 
weight of 12 and is written as !2C. The unstable carbon isotope just men- 
tioned has an atomic weight of 14 and is written as '4C. The mass of an 
atom or a molecule is generally specified in daltons, one dalton being an 
atomic mass unit essentially equal to the mass of a hydrogen atom. 


Atoms are so small that it is hard to imagine their size. An individual 
carbon atom is roughly 0.2 nm in diameter, so it would take about 5 
million of them, laid out in a straight line, to span a millimeter. One pro- 
ton or neutron weighs approximately 1/(6 x 107%) gram. As hydrogen 
has only one proton—thus an atomic weight of 1—1 gram of hydrogen 
contains 6 x 10% atoms. For carbon—which has six protons and six neu- 
trons, and an atomic weight of 12—12 grams contain 6 x 107° atoms. This 
huge number, called Avogadro’s number, allows us to relate everyday 
quantities of chemicals to numbers of individual atoms or molecules. If 
a substance has a molecular weight of X, X grams of the substance will 
contain 6 x 1073 molecules. This quantity is called one mole of the sub- 
stance (Figure 2-3). The concept of mole is used widely in chemistry as 
a way to represent the number of molecules that are available to partici- 
pate in chemical reactions. 


There are about 90 naturally occurring elements, each differing from the 
others in the number of protons and electrons in its atoms. Living things, 
however, are made of only a small selection of these elements, four of 
which—carbon (C), hydrogen (H), nitrogen (N), and oxygen (O)—consti- 
tute 96% of any organism's weight. This composition differs markedly 
from that of the nonliving, inorganic environment on Earth (Figure 2-4) 
and is evidence that a distinctive type of chemistry operates in biological 
systems. 


The Outermost Electrons Determine How Atoms Interact 


To understand how atoms come together to form the molecules that 
make up living organisms, we have to pay special attention to each 
atom’s electrons. Protons and neutrons are welded tightly to one another 
in an atom’s nucleus, and they change partners only under extreme con- 
ditions—during radioactive decay, for example, or in the interior of the 
sun or a nuclear reactor. In living tissues, only the electrons of an atom 
undergo rearrangements. They form the accessible part of the atom and 
specify the chemical rules by which atoms combine to form molecules. 


Electrons are in continuous motion around the nucleus, but motions on 
this submicroscopic scale obey different laws from those we are familiar 
with in everyday life. These laws dictate that electrons in an atom can 
exist only in certain discrete regions of movement—very roughly speak- 
ing, in distinct orbits. Moreover, there is a strict limit to the number of 
electrons that can be accommodated in an orbit of a given type, a so- 
called electron shell. The electrons closest on average to the positively 
charged nucleus are attracted most strongly to it and occupy the inner, 
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A mole is X grams of a substance, 
where X is the molecular weight of the 
substance. A mole will contain 

6 x 10” molecules of the substance. 


1 mole of carbon weighs 12 g 
1 mole of glucose weighs 180 g 
1 mole of sodium chloride weighs 58 g 


A one molar solution has a 
concentration of 1 mole of the substance 
in 1 liter of solution. A 1 M solution of 
glucose, for example, contains 180 g/L, 
and a one millimolar (1 mM) solution 
contains 180 mg/L. 


The standard abbreviation for gram is g; 
the abbreviation for liter is L. 


Figure 2-3 What's a mole? Some simple 
examples of moles and molar solutions. 
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Figure 2-4 The distribution of elements 
in the Earth's crust differs radically from 
that in the human body. The abundance 
of each element is expressed here as a 
percentage of the total number of atoms 
present in a biological or geological sample 
(water included). Thus, for example, more 
than 60% of the atoms in the human body 
are hydrogen atoms, and nearly 30% of the 
atoms in the Earth’s crust are silicon atoms 
(Si). The relative abundance of elements is 


similar in all living things. 
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Figure 2-5 An element's chemical 
reactivity depends on the degree to 
which its outermost electron shell is filled. 
All of the elements commonly found in 
living organisms have outermost shells that 
are not completely filled. The electrons in 
these incomplete shells (here shown in red) 
can participate in chemical reactions with 
other atoms. Inert gases (yellow), in contrast, 
have completely filled outermost shells 
(gray) and are thus chemically unreactive. 


QUESTION 2-1 


A cup containing exactly 18 g, or 

1 mole, of water was emptied into 
the Aegean Sea 3000 years ago. 
What are the chances that the same 
quantity of water, scooped today 
from the Pacific Ocean, would 
include at least one of these ancient 
water molecules? Assume perfect 
mixing and an approximate volume 
for the world’s oceans of 1.5 billion 
cubic kilometers (1.5 x 10? km3). 
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most tightly bound shell. This innermost shell can hold a maximum of 
two electrons. The second shell is farther away from the nucleus, and 
can hold up to eight electrons. The third shell can also hold up to eight 
electrons, which are even less tightly bound. The fourth and fifth shells 
can hold 18 electrons each. Atoms with more than four shells are very 
rare in biological molecules. 


The arrangement of electrons in an atom is most stable when all the 
electrons are in the most tightly bound states that are possible for them— 
that is, when they occupy the innermost shells, closest to the nucleus. 
Therefore, with certain exceptions in the larger atoms, the electrons of an 
atom fill the shells in order—the first before the second, the second before 
the third, and so on. An atom whose outermost shell is entirely filled 
with electrons is especially stable and therefore chemically unreactive. 
Examples are helium with 2 electrons (atomic number 2), neon with 2 + 8 
electrons (atomic number 10), and argon with 2 + 8 + 8 electrons (atomic 
number 18); these are all inert gases. Hydrogen, by contrast, has only 
one electron, which leaves its outermost shell half-filled, so it is highly 
reactive. The atoms found in living organisms all have outermost shells 
that are incompletely filled, and they are therefore able to react with one 
another to form molecules (Figure 2-5). 


Because an incompletely filled electron shell is less stable than one that 
is completely filled, atoms with incomplete outer shells have a strong 
tendency to interact with other atoms so as to either gain or lose enough 
electrons to fill the outermost shell. This electron exchange can be 
achieved either by transferring electrons from one atom to another or 
by sharing electrons between two atoms. These two strategies gener- 
ate the two types of chemical bonds that can bind atoms strongly to 
one another: an ionic bond is formed when electrons are donated by one 
atom to another, whereas a covalent bond is formed when two atoms 
share a pair of electrons (Figure 2-6). 


An H atom, which needs only one more electron to fill its only shell, gen- 
erally acquires this electron by sharing—forming one covalent bond with 
another atom. The other most common elements in living cells—C, N, 
and O, which have an incomplete second shell, and P and S, which have 
an incomplete third shell (see Figure 2-5)—also tend to share electrons; 
these elements thus fill their outer shells by forming several covalent 
bonds. The number of electrons an atom must acquire or lose (either by 
sharing or by transfer) to attain a filled outer shell determines the number 
of bonds that the atom can make. 
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Because the state of the outer electron shell determines the chemical 
properties of an element, when the elements are listed in order of their 
atomic number we see a periodic recurrence of elements that have simi- 
lar properties. For example, an element with an incomplete second shell 
containing one electron will behave in a similar way as an element that 
has filled its second shell and has an incomplete third shell containing 
one electron. The metals, for example, have incomplete outer shells with 
just one or a few electrons, whereas, as we have just seen, the inert gases 
have full outer shells. This arrangement gives rise to the periodic table of 
the elements, outlined in Figure 2-7, in which the elements found in liv- 
ing organisms are highlighted in color. 


Covalent Bonds Form by the Sharing of Electrons 


All of the characteristics of a cell depend on the molecules it contains. 
A molecule is a cluster of atoms held together by covalent bonds, in 
which electrons are shared rather than transferred between atoms. The 
shared electrons complete the outer shells of the interacting atoms. In the 
simplest possible molecule—a molecule of hydrogen (Hz)—two H atoms, 
each with a single electron, share their electrons, thus filling their outer- 
most shells. The shared electrons form a cloud of negative charge that 
is densest between the two positively charged nuclei. This electron den- 
sity helps to hold the nuclei together by opposing the mutual repulsion 
between the positive charges of the nuclei, which would otherwise force 
them apart. The attractive and repulsive forces are precisely in balance 
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Figure 2-6 Atoms can attain a more 
stable arrangement of electrons in their 
outermost shell by interacting with one 
another. A covalent bond is formed when 
electrons are shared between atoms. An 
ionic bond is formed when electrons are 
transferred from one atom to the other. The 
two cases shown represent extremes; often, 
covalent bonds form with a partial transfer 
(unequal sharing of electrons), resulting in a 
polar covalent bond, as we discuss shortly. 


QUESTION 2-2 


A carbon atom contains six protons 
and six neutrons. 

A. What are its atomic number and 
atomic weight? 

B. How many electrons does it 
have? 

C. How many additional electrons 
must it add to fill its outermost 
shell? How does this affect carbon’s 
chemical behavior? 

D. Carbon with an atomic weight of 
14 is radioactive. How does it differ 
in structure from nonradioactive 
carbon? How does this difference 
affect its chemical behavior? 


Figure 2-7 When ordered by their atomic 
number into the periodic table, the 
elements fall into vertical columns in which 
the atoms have similar properties. This 

is because the atoms in the same vertical 
column must gain or lose the same number 
of electrons to attain a filled outer shell, and 
they therefore behave similarly when forming 
bonds with other atoms. Thus, for example, 
both magnesium (Mg) and calcium (Ca) tend 
to give away the two electrons in their outer 
shells to form ionic bonds with atoms such as 
chlorine (Cl), which need extra electrons to 
complete their outer shells. 

The chemistry of life is dominated by 
lighter elements. The four elements 
highlighted in red constitute 99% of the 
total number of atoms present in the human 
body and about 96% of our total weight. 

An additional seven elements, highlighted 
in blue, together represent about 0.9% of 
our total number of atoms. Other elements, 
shown in green, are required in trace 
amounts by humans. It remains unclear 
whether those elements shown in yellow are 
essential in humans or not. 

The atomic weights shown here are 
those of the most common isotope of each 
element. The vertical red line represents a 
break in the periodic table where a group of 
large atoms with similar chemical properties 
has been removed. 
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Figure 2-9 Covalent bonds are 
characterized by particular geometries. 
(A) The spatial arrangement of the covalent 
bonds that can be formed by oxygen, 
nitrogen, and carbon. (B) Molecules formed 
from these atoms therefore have precise 
three-dimensional structures defined by 
the bond angles and bond lengths for each 
covalent linkage. A water molecule, for 
example, forms a “V” shape with an angle 
close to 109°. 

In these ball-and-stick models, the 
different colored balls represent different 
atoms, and the sticks represent the covalent 
bonds. The colors traditionally used to 
represent the different atoms—black (or 
dark gray) for carbon, white for hydrogen, 
blue for nitrogen, and red for oxygen—were 
established by the chemist August Wilhelm 
Hofmann in 1865, when he used a set of 
colored croquet balls to build molecular 
models for a public lecture on “the 
combining power of atoms.” 
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Figure 2-8 The hydrogen molecule is held together by a covalent 
bond. Each hydrogen atom in isolation has a single electron, which 
means that its first (and only) electron shell is incompletely filled. By 
coming together to form a hydrogen molecule (H2, or hydrogen gas), 
the two atoms are able to share their electrons, so that each obtains 
a completely filled first shell, with the shared electrons adopting 
modified orbits around the two nuclei. The covalent bond between 
the two atoms has a defined length—0.074 nm, which is the distance 
between the two nuclei. If the atoms were closer together, the 
positively charged nuclei would repel each other; if they were farther 
apart, they would not be able to share electrons as effectively. 


when these nuclei are separated by a characteristic distance, called the 
bond length (Figure 2-8). 


Whereas an H atom can form only a single covalent bond, the other com- 
mon atoms that form covalent bonds in cells—O, N, S, and P, as well as 
the all-important C—can form more than one. The outermost shells of 
these atoms, as we have seen, can accommodate up to eight electrons, 
and they form covalent bonds with as many other atoms as necessary to 
reach this number. Oxygen, with six electrons in its outer shell, is most 
stable when it acquires two extra electrons by sharing with other atoms, 
and it therefore forms up to two covalent bonds. Nitrogen, with five outer 
electrons, forms a maximum of three covalent bonds, while carbon, with 
four outer electrons, forms up to four covalent bonds—thus sharing four 
pairs of electrons (see Figure 2-5). 


When one atom forms covalent bonds with several others, these multiple 
bonds have definite orientations in space relative to one another, reflect- 
ing the orientations of the orbits of the shared electrons. Covalent bonds 
between multiple atoms are therefore characterized by specific bond 
angles, as well as by specific bond lengths and bond energies (Figure 
2-9). The four covalent bonds that can form around a carbon atom, for 
example, are arranged as if pointing to the four corners of a regular tet- 
rahedron. The precise orientation of the covalent bonds around carbon 
dictates the three-dimensional geometry of all organic molecules. 


Some Covalent Bonds Involve More Than One Electron 
Pair 
Most covalent bonds involve the sharing of two electrons, one donated 


by each participating atom; these are called single bonds. Some covalent 
bonds, however, involve the sharing of more than one pair of electrons. 
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Four electrons can be shared, for example, two coming from each par- 
ticipating atom; such a bond is called a double bond. Double bonds are 
shorter and stronger than single bonds and have a characteristic effect 
on the geometry of molecules containing them. A single covalent bond 
between two atoms generally allows the rotation of one part of a mol- 
ecule relative to the other around the bond axis. A double bond prevents 
such rotation, producing a more rigid and less flexible arrangement of 
atoms (Figure 2-10). This restriction has a major influence on the three- 
dimensional shape of many macromolecules. 


Some molecules contain atoms that share electrons in a way that pro- 
duces bonds that are intermediate in character between single and 
double bonds. The highly stable benzene molecule, for example, is 
made up of a ring of six carbon atoms in which the bonding electrons 
are evenly distributed, although the arrangement is sometimes depicted 
as an alternating sequence of single and double bonds. Panel 2-1 
(pp. 66-67) reviews the covalent bonds commonly encountered in bio- 
logical molecules. 


Electrons in Covalent Bonds Are Often Shared 
Unequally 


When the atoms joined by a single covalent bond belong to different ele- 
ments, the two atoms usually attract the shared electrons to different 
degrees. Covalent bonds in which the electrons are shared unequally in 
this way are known as polar covalent bonds. A polar structure (in the elec- 
trical sense) is one in which the positive charge is concentrated toward 
one atom in the molecule (the positive pole) and the negative charge is 
concentrated toward another atom (the negative pole). The tendency of 
an atom to attract electrons is called its electronegativity, a property 
that was first described by the chemist Linus Pauling. 


Knowing the electronegativity of atoms allows one to predict the nature 
of the bonds that will form between them. For example, when atoms 
with different electronegativities are covalently linked, their bonds will 
be polarized. Among the atoms typically found in biological molecules, 
oxygen and nitrogen (with electronegativities of 3.4 and 3.0, respec- 
tively) attract electrons relatively strongly, whereas an H atom (with an 
electronegativity of 2.1) attracts electrons relatively weakly. Thus the 
covalent bonds between O and H (O-H) and between N and H (N-H) are 
polar (Figure 2-11). An atom of C and an atom of H, by contrast, have 
similar electronegativities (carbon is 2.6, hydrogen 2.1) and attract elec- 
trons more equally. Thus the bond between carbon and hydrogen, C-H, 
is relatively nonpolar. 


Covalent Bonds Are Strong Enough to Survive the 
Conditions Inside Cells 


We have already seen that the covalent bond between two atoms has 
a characteristic length that depends on the atoms involved (see Figure 
2-10). A further crucial property of any chemical bond is its strength. 
Bond strength is measured by the amount of energy that must be sup- 
plied to break the bond, usually expressed in units of either kilocalories 
per mole (kcal/mole) or kilojoules per mole (kJ/mole). A kilocalorie 
is the amount of energy needed to raise the temperature of 1 liter of 
water by 1°C. Thus, if 1 kilocalorie of energy must be supplied to break 
6 x 1023 bonds of a specific type (that is, 1 mole of these bonds), then the 
strength of that bond is 1 kcal/mole. One kilocalorie is equal to about 
4.2 kJ, which is the unit of energy universally employed by physical scien- 
tists and, increasingly, by cell biologists as well. 
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(A) ethane 


(B) ethene 


Figure 2-10 Carbon-carbon double 
bonds are shorter and more rigid than 
carbon-carbon single bonds. (A) The 
ethane molecule, with a single covalent 
bond between the two carbon atoms, shows 
the tetrahedral arrangement of the three 
single covalent bonds between each carbon 
atom and its three attached H atoms. The 
CH3 groups, joined by a covalent C-C 
bond, can rotate relative to one another 
around the bond axis. (B) The double 

bond between the two carbon atoms in a 
molecule of ethene (ethylene) alters the 
bond geometry of the carbon atoms and 
brings all the atoms into the same plane; 
the double bond prevents the rotation of 
one CH2 group relative to the other. 


Figure 2-11 In polar covalent bonds, 

the electrons are shared unequally. 
Comparison of electron distributions in the 
polar covalent bonds in a molecule of water 
(H20) and the nonpolar covalent bonds in a 
molecule of oxygen (O2). In H2O, electrons 
are more strongly attracted to the oxygen 
nucleus than to the H nucleus, as indicated 
by the distributions of the partial negative 
(5) and partial positive (6*) charges. 
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QUESTION 2-3 


CHAPTER 2. Chemical Components of Cells 


Discuss whether the following 
statement is correct: “An ionic 
bond can, in principle, be thought 
of as a very polar covalent bond. 
Polar covalent bonds, then, fall 
somewhere between ionic bonds 
at one end of the spectrum and 
nonpolar covalent bonds at the 
other end.” 


Figure 2-12 Sodium chloride is held 
together by ionic bonds. (A) An atom 
of sodium (Na) reacts with an atom of 
chlorine (Cl). Electrons of each atom are 
shown in their different shells; electrons 


in the 
filled) 


chemically reactive (incompletely 
outermost shells are shown in red. 


The reaction takes place with transfer of a 


single 


electron from sodium to chlorine, 


forming two electrically charged atoms, or 
ions, each with complete sets of electrons 


in thei 


r outermost shells. The two ions have 


opposite charge and are held together by 
electrostatic attraction. (B) The product of 
the reaction between sodium and chlorine, 


crysta 


line sodium chloride, contains sodium 


and chloride ions packed closely together 


inare 


gular array in which the charges are 


exactly balanced. (C) Color photograph of 


crysta 


s of sodium chloride. 


To get an idea of what bond strengths mean, it is helpful to compare 
them with the average energies of the impacts that molecules continually 
undergo owing to collisions with other molecules in their environment— 
their thermal, or heat, energy. Typical covalent bonds are stronger than 
these thermal energies by a factor of 100, so they are resistant to being 
pulled apart by thermal motions. In living organisms, covalent bonds are 
normally broken only during specific chemical reactions that are carefully 
controlled by highly specialized protein catalysts called enzymes. 


lonic Bonds Form by the Gain and Loss of Electrons 


In some substances, the participating atoms are so different in electro- 
negativity that their electrons are not shared at all—they are transferred 
completely to the more electronegative partner. The resulting bonds, 
called ionic bonds, are usually formed between atoms that can attain 
a completely filled outer shell most easily by donating electrons to—or 
accepting electrons from—another atom, rather than by sharing them. 
For example, returning to Figure 2-5, we see that a sodium (Na) atom 
can achieve a filled outer shell by giving up the single electron in its third 
shell. By contrast, a chlorine (Cl) atom can complete its outer shell by 
gaining just one electron. Consequently, if a Na atom encounters a Cl 
atom, an electron can jump from the Na to the Cl, leaving both atoms 
with filled outer shells. The offspring of this marriage between sodium, a 
soft and intensely reactive metal, and chlorine, a toxic green gas, is table 
salt (NaCl). 


When an electron jumps from Na to Cl, both atoms become electri- 
cally charged ions. The Na atom that lost an electron now has one less 
electron than it has protons in its nucleus; it therefore has a net single 
positive charge (Na*). The Cl atom that gained an electron now has one 
more electron than it has protons and has a net single negative charge 
(CI). Because of their opposite charges, the Na+ and CI ions are attracted 
to each other and are thereby held together by an ionic bond (Figure 
2-12A). Ions held together solely by ionic bonds are generally called salts 
rather than molecules. A NaCl crystal contains astronomical numbers of 
Na* and CI ions packed together in a precise, three-dimensional array 
with their opposite charges exactly balanced: a crystal only 1 mm across 
contains about 2 x 10! ions of each type (Figure 2-12B and C). 


sodium atom (Na) chlorine atom (Cl) _ positive negative 
i sodium ion (Na*) chloride ion (CI7) j 
(A) sodium chloride (NaCl) 


Because of the favorable interaction between ions and water molecules 
(which are polar), many salts (including NaCl) are highly soluble in water. 
They dissociate into individual ions (such as Nat and Cl), each sur- 
rounded by a group of water molecules. Positive ions are called cations 
and negative ions are called anions. Small inorganic ions such as Na+, CF, 
K+, and Ca% play important parts in many biological processes, including 
the electrical activity of nerve cells, as we discuss in Chapter 12. 


In aqueous solution, ionic bonds are 10-100 times weaker than the cova- 
lent bonds that hold atoms together in molecules. But, as we will see, 
such weak interactions nevertheless play an important role in the chem- 
istry of living things. 


Hydrogen Bonds Are Important Noncovalent Bonds 
for Many Biological Molecules 


Water accounts for about 70% of a cell’s weight, and most intracellular 
reactions occur in an aqueous environment. Thus the properties of water 
have put a permanent stamp on the chemistry of living things. In each 
molecule of water (H20), the two covalent H-O bonds are highly polar 
because the O is strongly attractive for electrons whereas the H is only 
weakly attractive. Consequently, in each water molecule, there is a pre- 
ponderance of positive charge on the two H atoms and negative charge 
on the O. When a positively charged region of one water molecule (that 
is, one of its H atoms) comes close to a negatively charged region (that 
is, the O) of a second water molecule, the electrical attraction between 
them can establish a weak bond called a hydrogen bond (Figure 2-13A). 


These bonds are much weaker than covalent bonds and are easily broken 
by random thermal motions. Thus each bond lasts only an exceedingly 
short time. But the combined effect of many weak bonds is far from 
trivial. Each water molecule can form hydrogen bonds through its two 
H atoms to two other water molecules, producing a network in which 
hydrogen bonds are being continually broken and formed (see Panel 2-3, 
pp. 70-71). It is because of these interlocking hydrogen bonds that water 
at room temperature is a liquid—with a high boiling point and high sur- 
face tension—and not a gas. Without hydrogen bonds, life as we know it 
could not exist. 


Hydrogen bonds are not limited to water. In general, a hydrogen bond 
can form whenever a positively charged H atom held in one molecule 
by a polar covalent linkage comes close to a negatively charged atom— 
typically an oxygen or a nitrogen—belonging to another molecule (Figure 
2-13B). Hydrogen bonds can also occur between different parts of a 
single large molecule, where they often help the molecule fold into a 
particular shape. 


Like molecules (or salts) that carry positive or negative charges, sub- 
stances that contain polar bonds and can form hydrogen bonds also mix 
well with water. Such substances are termed hydrophilic, meaning that 
they are “water-loving.” A large proportion of the molecules in the aque- 
ous environment of a cell fall into this category, including sugars, DNA, 
RNA, and a majority of proteins. Hydrophobic (“water-fearing”) mol- 
ecules, by contrast, are uncharged and form few or no hydrogen bonds, 
and they do not dissolve in water. These and other properties of water 
are reviewed in Panel 2-2 (pp. 68-69). 


Four Types of Weak Interactions Help Bring Molecules 
Together in Cells 


Much of biology depends on specific but transient interactions between 
one molecule and another. These associations are mediated by 
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Figure 2-13 Noncovalent hydrogen bonds 
form between water molecules and 
between many other polar molecules. 

(A) A hydrogen bond forms between two 
water molecules. The slight positive charge 
associated with the hydrogen atom is 
electrically attracted to the slight negative 
charge of the oxygen atom. (B) In cells, 
hydrogen bonds commonly form between 
molecules that contain an oxygen or 
nitrogen. The atom bearing the hydrogen 
is considered the H-bond donor and the 
atom that interacts with the hydrogen is the 
H-bond acceptor. 


QUESTION 2-4 


True or false? “When NaCl is 
dissolved in water, the water 
molecules closest to the ions 

will tend to preferentially orient 
themselves so that their oxygen 
atoms face the sodium ions and 
face away from the chloride ions.” 
Explain your answer. 
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Figure 2-14 A large molecule, such as 

a protein, can bind to another protein 
through noncovalent interactions on the 
surface of each molecule. In the aqueous 
environment of a cell, many individual weak 
interactions could cause the two proteins 
to recognize each other specifically and 
form a tight complex. Shown here is a 

set of electrostatic attractions between 
complementary positive and negative 
charges. 


noncovalent bonds, such as the hydrogen bonds just discussed. Although 
these noncovalent bonds are individually quite weak, their energies can 
sum to create an effective force between two molecules. 


The ionic bonds that hold together the Na* and CF ions in a salt crystal 
(see Figure 2-12) represent a second form of noncovalent bond called an 
electrostatic attraction. Electrostatic attractions are strongest when the 
atoms involved are fully charged, as are Na+ and Cl ions. But a weaker 
electrostatic attraction can occur between molecules that contain polar 
covalent bonds (see Figure 2-11). Like hydrogen bonds, electrostatic 
attractions are extremely important in biology. For example, any large 
molecule with many polar groups will have a pattern of partial positive 
and negative charges on its surface. When such a molecule encounters 
a second molecule with a complementary set of charges, the two will 
be drawn to each other by electrostatic attraction. Even though water 
greatly reduces the strength of these attractions in most biological set- 
tings, the large number of weak noncovalent bonds that form on the 
surfaces of large molecules can nevertheless promote strong and specific 
binding (Figure 2-14). 


A third type of noncovalent bond, called a van der Waals attraction, 
comes into play when any two atoms approach each other closely. These 
nonspecific interactions spring from fluctuations in the distribution of 
electrons in every atom, which can generate a transient attraction when 
the atoms are in very close proximity. These weak attractions occur in all 
types of molecules, even those that are nonpolar and cannot form ionic 
or hydrogen bonds. The relative lengths and strengths of these three 
types of noncovalent bonds are compared to the length and strength of 
covalent bonds in Table 2-1. 


The fourth effect that often brings molecules together is not, strictly speak- 
ing, a bond at all. In an aqueous environment, a hydrophobic force is 
generated by a pushing of nonpolar surfaces out of the hydrogen-bonded 
water network, where they would otherwise physically interfere with the 
highly favorable interactions between water molecules. Hydrophobic 
forces play an important part in promoting molecular interactions—in 
particular, in building cell membranes, which are constructed largely 
from lipid molecules with long hydrocarbon tails. In these molecules, the 
H atoms are covalently linked to C atoms by nonpolar bonds (see Panel 
2-1, pp. 66-67). Because the H atoms have almost no net positive charge, 
they cannot form effective hydrogen bonds to other molecules, including 
water. As a result, lipids can form the thin membrane barriers that keep 
the aqueous interior of the cell separate from the surrounding aqueous 
environment. 


All four types of weak chemical interactions important in biology are 
reviewed in Panel 2-3 (pp. 70-71). 


TABLE 2-1 LENGTH AND STRENGTH OF SOME CHEMICAL BONDS 


Bond Type Length* (nm) Strength (kJ/mole) 


Covalent 0.10 377 [90]** 377 [90] 
Noncovalent: ionic bond 0.25 335 [80] 12.6 [3] 
Noncovalent: hydrogen bond 0.17 16.7 [4] 4.2 [1] 
Noncovalent: van der Waals 0.85) 0.4 [0.1] 0.4 [0.1] 


attraction (per atom) 


*The bond lengths and strengths listed are approximate, because the exact 
values will depend on the atoms involved. 
**Values in brackets are kcal/mole. 1 kJ = 0.239 kcal and 1 kcal = 4.184 kJ. 


Some Polar Molecules Form Acids and Bases in Water 


One of the simplest kinds of chemical reaction, and one that has pro- 
found significance for cells, takes place when a molecule with a highly 
polar covalent bond between a hydrogen and another atom dissolves in 
water. The hydrogen atom in such a bond has given up its electron almost 
entirely to the companion atom, so it exists as an almost naked positively 
charged hydrogen nucleus—in other words, a proton (H*). When the polar 
molecule becomes surrounded by water molecules, the proton will be 
attracted to the partial negative charge on the oxygen atom of an adja- 
cent water molecule (see Figure 2-11); this proton can thus dissociate 
from its original partner and associate instead with the oxygen atom of 
the water molecule, generating a hydronium ion (H30*) (Figure 2-15A). 
The reverse reaction—in which a hydronium ion releases a proton—also 
takes place very readily, so in an aqueous solution, billions of protons are 
constantly flitting to and fro between one molecule and another. 


Substances that release protons when they dissolve in water, thus form- 
ing H30*, are termed acids. The higher the concentration of H30*, the 
more acidic the solution. Even in pure water, H30* is present at a concen- 
tration of 10-7 M, as a result of the movement of protons from one water 
molecule to another (Figure 2-15B). By tradition, the H30* concentration 
is usually referred to as the H* concentration, even though most protons 
in an aqueous solution are present as H30+. To avoid the use of unwieldy 
numbers, the concentration of H* is expressed using a logarithmic scale 
called the pH scale. Pure water has a pH of 7.0 and is thus neutral—that 
is, neither acidic (pH <7) nor basic (pH >7). 


Acids are characterized as being strong or weak, depending on how 
readily they give up their protons to water. Strong acids, such as hydro- 
chloric acid (HCl), lose their protons easily. Acetic acid, on the other 
hand, is a weak acid because it holds on to its proton fairly tightly when 
dissolved in water. Many of the acids important in the cell—such as mol- 
ecules containing a carboxyl (COOH) group—are weak acids (see Panel 
2-2, pp. 68-69). Their tendency to give up a proton with some reluctance 
is exploited in a variety of cellular reactions. 


Because protons can be passed readily to many types of molecules in 
cells, thus altering the molecules’ characters, the H* concentration inside 
a cell—its pH—must be closely controlled. Acids will give up their protons 
more readily if the H* concentration is low (and the pH is high) and will 
hold onto their protons (or accept them back) when the H* concentration 
is high (and the pH is low). 
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Figure 2-15 Protons move continuously 
from one molecule to another in aqueous 
solutions. (A) The reaction that takes place 
when a molecule of acetic acid dissolves in 
water. At pH 7, nearly all of the acetic acid 
molecules are present as acetate ions. 

(B) Water molecules are continually 
exchanging protons with each other to form 
hydronium and hydroxyl ions. These ions 

in turn rapidly recombine to form water 
molecules. 


49 


50 CHAPTER 2. Chemical Components of Cells 


Figure 2-16 In aqueous solutions, the 
concentration of hydroxyl (OH) ions 
increases as the concentration of H3Ot 
(or H*) ions decreases. The product of 
the two values, [OH] x [H*], is always 10714 
(moles/liter)2. At neutral pH, [OH] = [H*], 
and both ions are present at 107 M. Also 
shown are examples of common solutions 
along with their approximate pH values. 


QUESTION 2-5 


A. Are there H30* ions present in 
pure water at neutral pH (i.e., at pH 
= 7.0)? If so, how are they formed? 
B. If they exist, what is the ratio 
of H3O* ions to H2O molecules at 
neutral pH? (Hint: the molecular 
weight of water is 18, and 1 liter of 
water weighs 1 kg.) 


[H*] [OH] some solutions and their 
moles/liter moles/liter | pH values 
1 10-4 battery acid (0.5) 
107! 10°13 stomach acid (1.5) 
107 10712 lemon juice (2.3), cola (2.5) 
ACIDIC 10° 107! | orange juice (3.5) 
10+ 10° | beer (4.5) 
10° 5 10° black coffee (5.0), acid rain (5.6) 
10° 6 10% urine (6.0), milk (6.5) 
NEUTRAL 107 7 107 pure water (7.0) 
10°! 8 106 sea water (8.0) 
10° 9 10° hand soap (9.5) 
107? 10% milk of magnesia (10.5) 
BASIC 1071! 10° household ammonia (11.9) 
10°12 107? non-phosphate detergent (12.0) 
107” 1071 bleach (12.5) 
10°14 1 caustic soda (13.5) 


Molecules that accept protons when dissolved in water are called bases. 
Just as the defining property of an acid is that it raises the concentration 
of H30* ions by donating a proton to a water molecule, so the defining 
property of a base is that it raises the concentration of hydroxyl (OH) 
ions by removing a proton from a water molecule. Sodium hydroxide 
(NaOH) is basic (the term alkaline is also used). NaOH is considered a 
strong base because it readily dissociates in aqueous solution to form 
Nat ions and OH” ions. Weak bases—which have a weak tendency to 
accept a proton from water—however, are more important in cells. Many 
biologically important weak bases contain an amino (NH2) group, which 
can generate OH- by taking a proton from water: -NH2 + H2O — -NH3* + 
OH- (see Panel 2-2, pp. 68-69). 


Because an OH- ion combines with a proton to form a water molecule, 
an increase in the OH- concentration forces a decrease in the H+ concen- 
tration, and vice versa (Figure 2-16). A pure solution of water contains 
an equal concentration (10-7 M) of both ions, rendering it neutral (pH 7). 
The interior of a cell is kept close to neutral by the presence of buffers: 
mixtures of weak acids and bases that will adjust proton concentrations 
around pH 7 by releasing protons (acids) or taking them up (bases) when- 
ever the pH changes. This give-and-take keeps the pH of the cell relatively 
constant under a variety of conditions. 


SMALL MOLECULES IN CELLS 


Having looked at the ways atoms combine to form small molecules and 
how these molecules behave in an aqueous environment, we now exam- 
ine the main classes of small molecules found in cells and their biological 
roles. Amazingly, we will see that a few basic categories of molecules, 
formed from just a handful of different elements, give rise to all the 
extraordinary richness of form and behavior displayed by living things. 


A Cell Is Formed from Carbon Compounds 


If we disregard water, nearly all the molecules in a cell are based on car- 
bon. Carbon is outstanding among all the elements in its ability to form 
large molecules. Because a carbon atom is small and has four electrons 
and four vacancies in its outer shell, it readily forms four covalent bonds 


with other atoms (see Figure 2-9). Most importantly, one carbon atom 
can link to other carbon atoms through highly stable covalent C-C bonds, 
producing rings and chains that can form the backbone of complex mol- 
ecules with no obvious upper limit to their size. These carbon-containing 
compounds are called organic molecules. By contrast, all other mol- 
ecules, including water, are said to be inorganic. 


In addition to containing carbon, the organic molecules produced by cells 
frequently contain specific combinations of atoms, such as the methyl 
(-CH3), hydroxyl (-OH), carboxyl (-COOH), carbonyl (-C=O), phosphoryl 
(-PO32-), and amino (-NH2) groups. Each of these chemical groups has 
distinct chemical and physical properties that influence the behavior of 
the molecule in which the group occurs, including whether the molecule 
tends to gain or lose protons when dissolved in water and with which 
other molecules it will interact. Knowing these groups and their chemical 
properties greatly simplifies understanding the chemistry of life. The most 
common chemical groups and some of their properties are summarized 
in Panel 2-1 (pp. 66-67). 


Cells Contain Four Major Families of Small Organic 
Molecules 


The small organic molecules of the cell are carbon compounds with 
molecular weights in the range 100-1000 that contain up to 30 or so 
carbon atoms. They are usually found free in solution in the cytosol and 
have many different roles. Some are used as monomer subunits to con- 
struct the cell’s polymeric macromolecules—its proteins, nucleic acids, 
and large polysaccharides. Others serve as energy sources, being bro- 
ken down and transformed into other small molecules in a maze of 
intracellular metabolic pathways. Many have more than one role in the 
cell—acting, for example, as both a potential subunit for a macromol- 
ecule and as an energy source. The small organic molecules are much 
less abundant than the organic macromolecules, accounting for only 
about one-tenth of the total mass of organic matter in a cell. But small 
organic molecules adopt a huge variety of chemical forms. Nearly 4000 
different kinds of small organic molecules have been detected in the 
well-studied bacterium Escherichia coli. 


All organic molecules are synthesized from—and are broken down 
into—the same set of simple compounds. Both their synthesis and their 
breakdown occur through sequences of simple chemical changes that 
are limited in variety and follow step-by-step rules. As a consequence, 
the compounds in a cell are chemically related, and most can be clas- 
sified into a small number of distinct families. Broadly speaking, cells 
contain four major families of small organic molecules: the sugars, the 
fatty acids, the amino acids, and the nucleotides (Figure 2-17). Although 
many compounds present in cells do not fit into these categories, these 
four families of small organic molecules—together with the macromol- 
ecules made by linking them into long chains—account for a large frac- 
tion of a cell’s mass (Table 2-2). 


small organic building blocks 
of the cell 


[ FATTY ACIDS 


Small Molecules in Cells 


Figure 2-17 Sugars, fatty acids, amino 
acids, and nucleotides are the four main 
families of small organic molecules 

in cells. They form the monomeric 

building blocks, or subunits, for larger 
organic molecules, including most of the 
macromolecules and other molecular 
assemblies of the cell. Some, like the sugars 
and the fatty acids, are also energy sources. 
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Figure 2-18 The structure of glucose, 

a monosaccharide, can be represented 

in several ways. (A) A structural formula 

in which the atoms are shown as chemical 
symbols, linked together by solid lines 
representing the covalent bonds. The 
thickened lines are used to indicate the plane 
of the sugar ring and to show that the -H 

and -OH groups are not in the same plane as 
the ring. (B) Another kind of structural formula 
that shows the three-dimensional structure of 
glucose in a so-called “chair configuration.” 
(C) A ball-and-stick model in which the 
three-dimensional arrangement of the atoms 
in space is indicated. (D) A space-filling 
model, which, as well as depicting the three- 
dimensional arrangement of the atoms, also 
shows the relative sizes and surface contours 
of the molecule (Movie 2.1). The atoms in (C) 
and (D) are colored as in Figure 2-9: C, black; 
H, white; O, red. This is the conventional 
color-coding for these atoms and will be used 
throughout this book. 


Chemical Components of Cells 


Cell Weight of Types in Each Class 
1 
20 


Water 70 

Inorganic ions 1 

Sugars and precursors 1 250 
Amino acids and precursors 0.4 100 
Nucleotides and precursors 0.4 100 
Fatty acids and precursors 1 50 
Other small molecules 0.2 3000 
Phospholipids 2 4* 
Macromolecules (nucleic acids, 24 3000 


proteins, and polysaccharides) 


*There are four classes of phospholipids, each of which exists in many varieties 
(discussed in Chapter 4). 


Sugars Are both Energy Sources and Subunits of 
Polysaccharides 


The simplest sugars—the monosaccharides—are compounds with the 
general formula (CH2O)n, where n is usually 3, 4, 5, or 6. Glucose, for 
example, has the formula C¢Hi206 (Figure 2-18). Because of this simple 
formula, sugars, and the larger molecules made from them, are called 
carbohydrates. The formula, however, does not adequately define the 
molecule: the same set of carbons, hydrogens, and oxygens can be joined 
together by covalent bonds in a variety of ways, creating structures with 
different shapes. Thus glucose can be converted into a different sugar— 
mannose or galactose—simply by switching the orientations of specific 
-OH groups relative to the rest of the molecule (Panel 2-4, pp. 72-73). 
In addition, each of these sugars can exist in either of two forms, called 
the p-form and the t-form, which are mirror images of each other. Sets 
of molecules with the same chemical formula but different structures 
are called isomers, and mirror-image pairs of such molecules are called 


CH,OH 


CHOH o 


(B) 


optical isomers. Isomers are widespread among organic molecules in 
general, and they play a major part in generating the enormous variety 
of sugars. A more complete outline of sugar structures and chemistry is 
presented in Panel 2-4. 


Monosaccharides can be linked by covalent bonds—called glycosidic 
bonds—to form larger carbohydrates. Two monosaccharides linked 
together make a disaccharide, such as sucrose, which is composed of 
a glucose and a fructose unit. Larger sugar polymers range from the 
oligosaccharides (trisaccharides, tetrasaccharides, and so on) up to 
giant polysaccharides, which can contain thousands of monosaccharide 
subunits (monomers). In most cases, the prefix oligo- is used to refer to 
molecules made of a small number of monomers, typically 2 to 10 in the 
case of oligosaccharides. Polymers, in contrast, can contain hundreds or 
thousands of subunits. 


The way sugars are linked together illustrates some common features of 
biochemical bond formation. A bond is formed between an -OH group 
on one sugar and an -OH group on another by a condensation reaction, 
in which a molecule of water is expelled as the bond is formed (Figure 
2-19). The subunits in other biological polymers, including nucleic acids 
and proteins, are also linked by condensation reactions in which water 
is expelled. The bonds created by all of these condensation reactions can 
be broken by the reverse process of hydrolysis, in which a molecule of 
water is consumed. Generally speaking, condensation reactions, which 
synthesize larger molecules from smaller subunits, are energetically 
unfavorable; hydrolysis reactions, which break down larger molecules 
into smaller subunits, are energetically favorable (Figure 2-20). 


Because each monosaccharide has several free hydroxyl groups that can 
form a link to another monosaccharide (or to some other compound), 
sugar polymers can be branched, and the number of possible polysac- 
charide structures is extremely large. For this reason, it is much more 
difficult to determine the arrangement of sugars in a complex polysaccha- 
ride than it is to determine the nucleotide sequence of a DNA molecule or 
the amino acid sequence of a protein, in which each unit is joined to the 
next in exactly the same way. 


The monosaccharide glucose has a central role as an energy source for 
cells, as we explain in Chapter 13. It is broken down to smaller molecules 
in a series of reactions, releasing energy that the cell can harness to do 
useful work. Cells use simple polysaccharides composed only of glucose 
units—principally glycogen in animals and starch in plants—as long-term 
stores of glucose, held in reserve for energy production. 


Sugars do not function exclusively in the production and storage of 
energy. They are also used, for example, to make mechanical supports. 
The most abundant organic molecule on Earth—the cellulose that forms 
plant cell walls—is a polysaccharide of glucose. Another extraordinarily 
abundant organic substance, the chitin of insect exoskeletons and fungal 
cell walls, is also a polysaccharide—in this case, a linear polymer of a 
sugar derivative called N-acetylglucosamine (see Panel 2-4, pp. 72-73). 
Other polysaccharides, which tend to be slippery when wet, are the main 
components of slime, mucus, and gristle. 
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Figure 2-19 Two monosaccharides can 

be linked by a covalent glycosidic bond 

to form a disaccharide. This reaction 
belongs to a general category of reactions 
termed condensation reactions, in which two 
molecules join together as a result of the loss 
of a water molecule. The reverse reaction (in 
which water is added) is termed hydrolysis. 


Figure 2-20 Condensation and hydrolysis 
are reverse reactions. The large polymeric 
macromolecules of the cell are formed from 
subunits (or monomers) by condensation 
reactions, and they are broken down 

by hydrolysis. Condensation reactions 

are energetically unfavorable; thus 
macromolecule formation requires an input 
of energy, as we discuss in Chapter 3. 
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Figure 2-21 Fatty acids have both 
hydrophobic and hydrophilic components. 
The hydrophobic hydrocarbon chain is 
attached to a hydrophilic carboxylic acid 
group. Different fatty acids have different 
hydrocarbon tails. Palmitic acid is shown 
here. (A) Structural formula, showing the 
carboxylic acid head group in its ionized 
form, as it exists in water at pH 7. (B) Ball- 
and-stick model. (C) Space-filling model 
(Movie 2.2). 


Smaller oligosaccharides can be covalently linked to proteins to form gly- 
coproteins, or to lipids to form glycolipids (Panel 2-5, pp. 74-75), which 
are both found in cell membranes. The sugar side chains attached to 
glycoproteins and glycolipids in the plasma membrane are thought to 
help protect the cell surface and often help cells adhere to one another. 
Differences in the types of cell-surface sugars form the molecular basis 
for the human blood groups, information that dictates which blood types 
can be used during transfusions. 


Fatty Acid Chains Are Components of Cell Membranes 


A fatty acid molecule, such as palmitic acid, has two chemically distinct 
regions. One is a long hydrocarbon chain, which is hydrophobic and not 
very reactive chemically. The other is a carboxyl (-COOH) group, which 
behaves as an acid (carboxylic acid): in an aqueous solution, it is ion- 
ized (-COO’), extremely hydrophilic, and chemically reactive (Figure 
2-21). Molecules—such as fatty acids—that possess both hydrophobic 
and hydrophilic regions are termed amphipathic. Almost all the fatty acid 
molecules in a cell are covalently linked to other molecules by their car- 
boxylic acid group (see Panel 2-5, pp. 74-75). 


The hydrocarbon tail of palmitic acid is saturated: it has no double bonds 
between its carbon atoms and contains the maximum possible number 
of hydrogens. Some other fatty acids, such as oleic acid, have unsatu- 
rated tails, with one or more double bonds along their length. The double 
bonds create kinks in the hydrocarbon tails, interfering with their ability 
to pack together. Fatty acid tails are found in cell membranes, where 
the tightness of their packing affects the fluidity of the membrane. The 
many different fatty acids found in cells differ only in the length of their 
hydrocarbon chains and in the number and position of the carbon- 
carbon double bonds (see Panel 2-5). 


Fatty acids serve as a concentrated food reserve in cells: they can be bro- 
ken down to produce about six times as much usable energy, gram for 
gram, as glucose. Fatty acids are stored in the cytoplasm of many cells 
in the form of fat droplets composed of triacylglycerol molecules—com- 
pounds made of three fatty acid chains covalently joined to a glycerol 
molecule (Figure 2-22 and see Panel 2-5). Triacylglycerols are the ani- 
mal fats found in meat, butter, and cream, and the plant oils such as 
corn oil and olive oil. When a cell needs energy, the fatty acid chains 


hydrophilic [> © oO 
carboxylic YZ 
acid head C 


hydrophobic 
hydrocarbon tail CH3 


can be released from triacylglycerols and broken down into two-carbon 
units. These two-carbon units are identical to those derived from the 
breakdown of glucose, and they enter the same energy-yielding reaction 
pathways, as described in Chapter 13. 


Fatty acids and their derivatives, including triacylglycerols, are examples 
of lipids. Lipids are loosely defined as molecules that are insoluble in 
water but soluble in fat and organic solvents such as benzene. They typi- 
cally contain long hydrocarbon chains, as in the fatty acids, or multiple 
linked aromatic rings, as in the steroids (see Panel 2-5). 


The most unique function of fatty acids is in the establishment of the lipid 
bilayer, the structure that forms the basis for all cell membranes. These 
thin sheets, which enclose all cells and surround their internal organelles, 
are composed largely of phospholipids (Figure 2-23). 


Like triacylglycerols, most phospholipids are constructed mainly from fatty 
acids and glycerol. In these phospholipids, however, the glycerol is joined 
to two fatty acid chains, rather than to three as in triacylglycerols. The 
remaining -OH group on the glycerol is linked to a hydrophilic phosphate 
group, which in turn is attached to a small hydrophilic compound such as 
choline (see Panel 2-5, pp. 74-75). With their two hydrophobic fatty acid 
tails and a hydrophilic, phosphate-containing head, phospholipids are 
strongly amphipathic. This characteristic amphipathic composition and 
shape gives them very different physical and chemical properties from 
triacylglycerols, which are predominantly hydrophobic. In addition to 
phospholipids, cell membranes contain differing amounts of other lipids, 
including glycolipids, which are structurally similar to phospholipids but 
contain one or more sugars instead of a phosphate group. 


Thanks to their amphipathic nature, pure phospholipids readily form 
membranes in water. These lipids can spread over the surface of water 
to form a monolayer, with their hydrophobic tails facing the air and their 
hydrophilic heads in contact with the water. Alternatively, two of these 
phospholipid layers can readily combine tail-to-tail in water to form the 
phospholipid sandwich that is the lipid bilayer (see Chapter 11). 
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Figure 2-22 The properties of fats 
depend on the length and saturation 

of the fatty acid chains they carry. Fatty 
acids are stored in the cytosol of many cells 
in the form of droplets of triacylglycerol 
molecules made of three fatty acid chains 
joined to a glycerol molecule. (A) Saturated 
fats are found in meat and dairy products. 
(B) Plant oils, such as corn oil, contain 
unsaturated fatty acids, which may be 
monounsaturated (containing one double 
bond) or polyunsaturated (containing 
multiple double bonds). The presence of 
these double bonds causes plant oils to 

be liquid at room temperature. Although 
fats are essential in the diet, saturated fats 
raise the concentration of cholesterol in 
the blood, which tends to clog the arteries, 
increasing the risk of heart attacks and 
strokes. 


phospholipid 
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Figure 2-23 Phospholipids can aggregate to form cell membranes. Phospholipids contain two hydrophobic fatty acid tails and 

a hydrophilic head. (A) Phosphatidylcholine is the most common phospholipid in cell membranes. (B) Diagram showing how, in an 
aqueous environment, the hydrophobic tails of phospholipids pack together to form a lipid bilayer. In the lipid bilayer, the hydrophilic 
heads of the phospholipid molecules are on the outside, facing the aqueous environment, and the hydrophobic tails are on the inside, 


where water is excluded. 
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Figure 2-24 All amino acids have an amino carboxyl 
amino group, a carboxyl group, and a group group 
side chain (R) attached to their a-carbon \ 

: H H 

atom. In the cell, where the pH is close to | a | B 
7, free amino acids exist in their ionized HN —C — H,N —C — COO 
form; but, when they are incorporated into | pH7 | 
a polypeptide chain, the charges on their CH3 CH3 
amino and carboxyl groups are lost. (A) The %-carbon \ 


amino acid shown is alanine, one of the 
simplest amino acids, which has a methyl 
group (CH3) as its side chain. Its amino 
group is highlighted in blue and its 
carboxyl group in red. (B) A ball-and-stick 
model and (C) a space-filling model of 
alanine. In (B) and (C), the N atom is blue 
and the O atom is red. 


QUESTION 2-6 


Why do you suppose only L-amino 
acids and not a random mixture of 
the L- and p-forms of each amino 
acid are used to make proteins? 
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Figure 2-25 Amino acids in a protein 

are held together by peptide bonds. 

The four amino acids shown are linked 
together by three peptide bonds, one of 
which is highlighted in yellow. One of the 
amino acids, glutamic acid, is shaded in 
gray. The amino acid side chains are shown 
in red. The N-terminus of the polypeptide 
chain is capped by an amino group, and 
the C-terminus ends in a carboxyl group. 
The sequence of amino acids in a protein is 
abbreviated using either a three-letter or a 
one-letter code, and the sequence is always 
read starting from the N-terminus (see Panel 
2-6, pp. 76-77). In the example given, the 
sequence is Phe-Ser-Glu-Lys (or FSEK). 
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Amino Acids Are the Subunits of Proteins 


Amino acids are small organic molecules with one defining property: they 
all possess a carboxylic acid group and an amino group, both attached 
to a central a-carbon atom (Figure 2-24). This a-carbon also carries a 
specific side chain, the identity of which distinguishes one amino acid 
from another. 


Cells use amino acids to build proteins—polymers made of amino acids, 
which are joined head-to-tail in a long chain that folds up into a three- 
dimensional structure that is unique to each type of protein. The covalent 
bond between two adjacent amino acids in a protein chain is called a pep- 
tide bond, and the resulting chain of amino acids is therefore also known 
as a polypeptide. Peptide bonds are formed by condensation reactions 
that link one amino acid to the next. Regardless of the specific amino 
acids from which it is made, the polypeptide always has an amino (NH2) 
group at one end—its N-terminus—and a carboxyl (COOH) group at its 
other end—its C-terminus (Figure 2-25). This difference in the two ends 
gives a polypeptide a definite directionality—a structural (as opposed to 
electrical) polarity. 


Twenty types of amino acids are commonly found in proteins, each with a 
different side chain attached to its a-carbon atom (Panel 2-6, pp. 76-77). How 
this precise set of 20 amino acids came to be chosen is one of the mysteries 
surrounding the evolution of life; there is no obvious chemical reason why 
other amino acids could not have served just as well. But once the selection 
had been locked into place, it could not be changed, as too much chemistry 
had evolved to exploit it. Switching the types of amino acids used by cells— 
whether bacterial, plant, or animal—would require the organism to retool its 
entire metabolism to cope with the new building blocks. 


Like sugars, all amino acids (except glycine) exist as optical isomers 
termed D- and -forms (see Panel 2-6). But only -forms are ever found 
in proteins (although p-amino acids occur as part of bacterial cell walls 
and in some antibiotics, and p-serine is used as a signal molecule in the 
brain). The origin of this exclusive use of -amino acids to make proteins 
is another evolutionary mystery. 


The chemical versatility that the 20 standard amino acids provide is vitally 
important to the function of proteins. Five of the 20 amino acids—includ- 
ing lysine and glutamic acid, shown in Figure 2-25—have side chains 
that form ions in solution and can therefore carry a charge. The others 
are uncharged. Some amino acids are polar and hydrophilic, and some 
are nonpolar and hydrophobic (see Panel 2-6). As we discuss in Chapter 
4, the collective properties of the amino acid side chains underlie all the 
diverse and sophisticated functions of proteins. And proteins, which con- 
stitute half the dry mass of a cell, lie at the center of life’s chemistry. 


Nucleotides Are the Subunits of DNA and RNA 


DNA and RNA are built from subunits called nucleotides. Nucleotides 
consist of a nitrogen-containing ring compound linked to a five-carbon 
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sugar that has one or more phosphate groups attached to it (Panel 2-7, 
pp. 78-79). The sugar can be either ribose or deoxyribose. Nucleotides 
containing ribose are known as ribonucleotides, and those containing 
deoxyribose are known as deoxyribonucleotides. 


The nitrogen-containing rings of all these molecules are generally referred 
to as bases for historical reasons: under acidic conditions, they can each 
bind an H+ (proton) and thereby increase the concentration of OH- ions 
in aqueous solution. There is a strong family resemblance between the 
different nucleotide bases. Cytosine (C), thymine (T), and uracil (U) are 
called pyrimidines, because they all derive from a six-membered pyrimi- 
dine ring; guanine (G) and adenine (A) are purines, which bear a second, 
five-membered ring fused to the six-membered ring. Each nucleotide is 
named after the base it contains (see Panel 2-7, pp. 78-79). A base plus 
its sugar (without any phosphate group attached) is called a nucleoside. 


Nucleoside di- and triphosphates can act as short-term carriers of chemi- 
cal energy. Above all others, the ribonucleoside triphosphate known 
as adenosine triphosphate, or ATP (Figure 2-26), participates in the 
transfer of energy in hundreds of metabolic reactions. ATP is formed 
through reactions that are driven by the energy released from the break- 
down of foodstuffs. Its three phosphates are linked in series by two 
phosphoanhydride bonds (see Panel 2-7). Rupture of these phosphate 
bonds by hydrolysis releases large amounts of useful energy, also known 
as free energy (see Panel 3-1, pp. 94-95). Most often, it is the terminal 
phosphate group that is split off—or transferred to another molecule— 
to release energy that can be used to drive biosynthetic reactions 
(Figure 2-27). Other nucleotide derivatives serve as carriers for other 
chemical groups. All of this is described in Chapter 3. 
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Figure 2-26 Adenosine triphosphate 
(ATP) is a crucially important energy 
carrier in cells. (A) Structural formula, 
in which the three phosphate groups 
are shaded in yellow. The presence of 
the OH group on the second carbon of 
the sugar ring (red) distinguishes this 
sugar as ribose. (B) Ball-and-stick model 
(Movie 2.3). In (B), the P atoms are 
yellow. 


Figure 2-27 ATP is synthesized from ADP 
and inorganic phosphate, and it releases 
energy when it is hydrolyzed back to 
ADP and inorganic phosphate. The energy 
required for ATP synthesis is derived from 
either the energy-yielding oxidation of 
foodstuffs (in animal cells, fungi, and some 
bacteria) or the capture of light (in plant 
cells and some bacteria). The hydrolysis of 
ATP releases energy that is used to drive 
many processes inside cells. Together, the 
two reactions shown form the ATP cycle. 
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5’ end 


Figure 2-28 A short length of one 

chain of a deoxyribonucleic acid 

(DNA) molecule shows the covalent 
phosphodiester bonds linking four 
consecutive nucleotides. Because the 
bonds link specific carbon atoms in the 
sugar ring—known as the 5’ and 3’ carbon 
atoms—one end of a polynucleotide chain, 
the 5’ end, has a free phosphate group and 
the other, the 3’ end, has a free hydroxy! 
group. One of the nucleotides, T, is shaded 
in gray, and one phosphodiester bond is 
highlighted in yellow. The linear sequence 
of nucleotides in a polynucleotide chain is 
commonly abbreviated using a one-letter 
code, and the sequence is always read from 
the 5’ end. In the example illustrated, the 
sequence is GATC. 


Figure 2-29 Macromolecules are 
abundant in cells. The approximate 
composition (by mass) of a bacterial cell 
is shown. The composition of an animal 
cell is similar. 


Chemical Components of Cells 


Nucleotides also have a fundamental role in the storage and retrieval of 
biological information. They serve as building blocks for the construc- 
tion of nucleic acids—long polymers in which nucleotide subunits are 
linked by the formation of covalent phosphodiester bonds between the 
phosphate group attached to the sugar of one nucleotide and a hydroxyl 
group on the sugar of the next nucleotide (Figure 2-28). Nucleic acid 
chains are synthesized from energy-rich nucleoside triphosphates by 
a condensation reaction that releases inorganic pyrophosphate during 
phosphodiester bond formation (see Panel 2-7, pp. 78-79). 


There are two main types of nucleic acids, which differ in the type of sugar 
contained in their sugar-phosphate backbone. Those based on the sugar 
ribose are known as ribonucleic acids, or RNA, and contain the bases 
A, G, C, and U. Those based on deoxyribose (in which the hydroxyl group 
at the 2' position of the ribose carbon ring is replaced by a hydrogen) are 
known as deoxyribonucleic acids, or DNA, and contain the bases A, G, 
C, and T (T is chemically similar to the U in RNA; see Panel 2-7). RNA usu- 
ally occurs in cells in the form of a single-stranded polynucleotide chain, 
but DNA is virtually always in the form of a double-stranded molecule: 
the DNA double helix is composed of two polynucleotide chains that run 
in opposite directions and are held together by hydrogen bonds between 
the bases of the two chains (see Panel 2-3, pp. 70-71). 


The linear sequence of nucleotides in a DNA or an RNA molecule encodes 
genetic information. The two nucleic acids, however, have different roles 
in the cell. DNA, with its more stable, hydrogen-bonded helix, acts as 
a long-term repository for hereditary information, while single-stranded 
RNA is usually a more transient carrier of molecular instructions. The 
ability of the bases in different nucleic acid molecules to recognize and 
pair with each other by hydrogen-bonding (called base-pairing)—G with 
C, and A with either T or U—underlies all of heredity and evolution, as 
explained in Chapter 5. 


MACROMOLECULES IN CELLS 


On the basis of mass, macromolecules are by far the most abundant of 
the organic molecules in a living cell (Figure 2-29). They are the principal 
building blocks from which a cell is constructed and also the components 
that confer the most distinctive properties on living things. Intermediate 
in size and complexity between small organic molecules and organelles, 
macromolecules are constructed simply by covalently linking small 
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Figure 2-30 Polysaccharides, proteins, and nucleic acids are made 
from monomeric subunits. Each macromolecule is a polymer formed 
from small molecules (called monomers or subunits) that are linked 
together by covalent bonds. 


organic monomers, or subunits, into long chains, or polymers (Figure 
2-30 and How We Know, pp. 60-61). Yet they have many unexpected 
properties that could not have been predicted from their simple constitu- 
ents. For example, it took a long time to determine that the nucleic acids, 
DNA and RNA, store and transmit hereditary information (see How We 
Know, Chapter 5, pp. 193-195). 


Proteins are especially versatile and perform thousands of distinct func- 
tions. Many proteins act as highly specific enzymes that catalyze the 
chemical reactions that take place in cells. For example, one enzyme in 
plants, called ribulose bisphosphate carboxylase, converts CO3 to sugars, 
thereby creating most of the organic matter used by the rest of the living 
world. Other proteins are used to build structural components: tubulin, for 
example, self-assembles to make the cell’s long, stiff microtubules (see 
Figure 1-27B), and histone proteins assemble into disc-like structures 
that help wrap up the cell’s DNA in chromosomes. Yet other proteins, such 
as myosin, act as molecular motors to produce force and movement. We 
examine the molecular basis for many of these wide-ranging functions in 
later chapters. Here, we consider some of the general principles of macro- 
molecular chemistry that make all of these activities possible. 


Each Macromolecule Contains a Specific Sequence of 
Subunits 


Although the chemical reactions for adding subunits to each polymer are 
different in detail for proteins, nucleic acids, and polysaccharides, they 
share important features. Each polymer grows by the addition of a mono- 
mer onto one end of the polymer chain via a condensation reaction, in 
which a molecule of water is lost for each subunit that is added (Figure 
2-31). In all cases, the reactions are catalyzed by specific enzymes, which 
ensure that only the appropriate monomer is incorporated. 


The stepwise polymerization of monomers into a long chain is a simple 
way to manufacture a large, complex molecule, because the subunits are 
added by the same reaction performed over and over again by the same 
set of enzymes. In a sense, the process resembles the repetitive opera- 
tion of a machine in a factory—with some important differences. First, 
apart from some of the polysaccharides, most macromolecules are made 
from a set of monomers that are slightly different from one another; for 
example, proteins are constructed from 20 different amino acids (see 
Panel 2-6, pp. 76-77). Second, and most important, the polymer chain is 
not assembled at random from these subunits; instead, the subunits are 
added in a particular order, or sequence. 


The biological functions of proteins, nucleic acids, and many polysac- 
charides are absolutely dependent on the particular sequence of subunits 
in the linear chains. By varying the sequence of subunits, the cell could 
in principle make an enormous diversity of the polymeric molecules. 
Thus, for a protein chain 200 amino acids long, there are 202 possi- 
ble combinations (20 x 20 x 20 x 20... multiplied 200 times), while for a 
DNA molecule 10,000 nucleotides long (small by DNA standards), with 
its four different nucleotides, there are 4!°° different possibilities—an 
unimaginably large number. Thus the machinery of polymerization must 
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QUESTION 2-7 


What is meant by “polarity” of a 
polypeptide chain and by “polarity” 
of a chemical bond? How do the 
meanings differ? 


subunit growing polymer 


‘ae 


Figure 2-31 Macromolecules are formed 
by adding subunits to one end of a chain. 
In a condensation reaction, a molecule 

of water is lost with the addition of each 
monomer to one end of the growing chain. 
The reverse reaction—the breakdown of the 
polymer—occurs by the addition of water 
(hydrolysis). See also Figure 2-19. 
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HOW WE KNOW 


THE DISCOVERY OF MACROMOLECULES 


The idea that proteins, polysaccharides, and nucleic 
acids are large molecules that are constructed from 
smaller subunits, linked one after another into long 
molecular chains, may seem fairly obvious today. But 
this was not always the case. In the early part of the 
twentieth century, few scientists believed in the exist- 
ence of such biological polymers built from repeating 
units held together by covalent bonds. The notion that 
such “frighteningly large” macromolecules could be 
assembled from simple building blocks was considered 
“downright shocking” by chemists of the day. Instead, 
they thought that proteins and other seemingly large 
organic molecules were simply heterogeneous aggre- 
gates of small organic molecules held together by weak 
“association forces” (Figure 2-32). 


The first hint that proteins and other organic polymers 
are large molecules came from observing their behav- 
ior in solution. At the time, scientists were working 
with various proteins and carbohydrates derived from 
foodstuffs and other organic materials—albumin from 
egg whites, casein from milk, collagen from gelatin, 
and cellulose from wood. Their chemical compositions 
seemed simple enough: like other organic molecules, 
they contained carbon, hydrogen, oxygen, and, in the 
case of proteins, nitrogen. But they behaved oddly in 
solution, showing, for example, an inability to pass 
through a fine filter. 


Why these molecules misbehaved in solution was a 
puzzle. Were they really giant molecules, composed 
of an unusually large number of covalently linked 
atoms? Or were they more like a colloidal suspension 
of particles—a big, sticky hodgepodge of small organic 
molecules that associate only loosely? 


Figure 2-32 What might an organic macromolecule look 
like? Chemists in the early part of the twentieth century debated 
whether proteins, polysaccharides, and other apparently large 
organic molecules were (A) discrete particles made of an 
unusually large number of covalently linked atoms or (B) a loose 
aggregation of heterogeneous small organic molecules held 
together by weak forces. 


One way to distinguish between the two possibili- 
ties was to determine the actual size of one of these 
molecules. If a protein such as albumin were made of 
molecules all identical in size, that would support the 
existence of true macromolecules. Conversely, if albu- 
min were instead a miscellaneous conglomeration of 
small organic molecules, these should show a whole 
range of molecular sizes in solution. 


Unfortunately, the techniques available to scientists in 
the early 1900s were not ideal for measuring the sizes of 
such large molecules. Some chemists estimated a pro- 
tein’s size by determining how much it would lower a 
solution's freezing point; others measured the osmotic 
pressure of protein solutions. These methods were sus- 
ceptible to experimental error and gave variable results. 
Different techniques, for example, suggested that cel- 
lulose was anywhere from 6000 to 103,000 daltons in 
mass (where 1 dalton is approximately equal to the 
mass of a hydrogen atom). Such results helped to fuel 
the hypothesis that carbohydrates and proteins were 
loose aggregates of small molecules rather than true 
macromolecules. 


Many scientists simply had trouble believing that 
molecules heavier than about 4000 daltons—the larg- 
est compound that had been synthesized by organic 
chemists—could exist at all. Take hemoglobin, the oxy- 
gen-carrying protein in red blood cells. Researchers tried 
to estimate its size by breaking it down into its chemical 
components. In addition to carbon, hydrogen, nitrogen, 
and oxygen, hemoglobin contains a small amount of 
iron. Working out the percentages, it appeared that 
hemoglobin had one atom of iron for every 712 atoms 
of carbon—and a minimum weight of 16,700 daltons. 
Could a molecule with hundreds of carbon atoms in one 
long chain remain intact in a cell and perform specific 
functions? Emil Fischer, the organic chemist who deter- 
mined that the amino acids in proteins are linked by 
peptide bonds, thought that a polypeptide chain could 
grow no longer than about 30 or 40 amino acids. As 
for hemoglobin, with its purported 700 carbon atoms, 
the existence of molecular chains of such “truly fantas- 
tic lengths” was deemed “very improbable” by leading 
chemists. 


Definitive resolution of the debate had to await the 
development of new techniques. Convincing evidence 
that proteins are macromolecules came from studies 
using the ultracentrifuge—a device that uses centrifu- 
gal force to separate molecules according to their size 
(see Panel 4-3, pp. 164-165). Theodor Svedberg, who 
designed the machine in 1925, performed the first stud- 
ies. If a protein were really an aggregate of smaller 
molecules, he reasoned, it would appear as a smear 
of molecules of different sizes when sedimented in an 


ultracentrifuge. Using hemoglobin as his test protein, 
Svedberg found that the centrifuged sample revealed a 
single, sharp band with a molecular weight of 68,000 
daltons. The finding strongly supported the theory that 
proteins are true macromolecules (Figure 2-33). 


Additional evidence continued to accumulate through- 
out the 1930s, when other researchers were able 
to obtain crystals of pure protein that could be studied 
by x-ray diffraction. Only molecules with a uniform size 
and shape can form highly ordered crystals and diffract 
x-rays in such a way that their three-dimensional struc- 
ture can be determined, as we discuss in Chapter 4. 
A heterogeneous suspension could not be studied in 
this way. 
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Macromolecules in Cells 


We now take it for granted that large macromolecules 
carry out many of the most important activities in living 
cells. But chemists once viewed the existence of such 
polymers with the same sort of skepticism that a zoolo- 
gist might show on being told that “In Africa, there are 
elephants that are 100 meters long and 20 meters tall.” 
It took decades for researchers to master the techniques 
required to convince everyone that molecules ten times 
larger than anything they had ever encountered were 
a cornerstone of biology. As we shall see throughout 
this book, such a labored pathway to discovery is not 
unusual, and progress in science—as in the discovery 
of macromolecules—is often driven by advances in 
technology. 
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Figure 2-33 The ultracentrifuge helped to settle the debate about the nature of macromolecules. In the ultracentrifuge, 
centrifugal forces exceeding 500,000 times the force of gravity can be used to separate proteins or other large molecules. (A) In a 


modern ultracentrifuge, samples are loaded in a thin layer on top of a gradient of sucrose solution formed in a tube. The tube is placed 
in a metal rotor that is rotated at high speed in a vacuum. Molecules of different sizes sediment at different rates, and these molecules 
will therefore move as distinct bands in the sample tube. If hemoglobin were a loose aggregate of heterogeneous peptides, it would 
show a broad smear of sizes after centrifugation (top tube). Instead, it appears as a sharp band with a molecular weight of 68,000 
daltons (bottom tube). Although the ultracentrifuge is now a standard, almost mundane, fixture in most biochemistry laboratories, its 
construction was a huge technological challenge. The centrifuge rotor must be capable of spinning centrifuge tubes at high speeds for 
many hours at constant temperature and with high stability to avoid disrupting the gradient and ruining the samples. In 1926, Svedberg 
won the Nobel Prize in Chemistry for his ultracentrifuge design and its application to chemistry. (B) In his actual experiment, Svedberg 
filled a special tube in the centrifuge with a homogeneous solution of hemoglobin; by shining light through the tube, he then carefully 
monitored the moving boundary between the sedimenting protein molecules and the clear aqueous solution left behind (so-called 
boundary sedimentation). The more recently developed method shown in (A) is a form of band sedimentation. 
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Figure 2-34 Most proteins and 

many RNA molecules fold into a 
particularly stable three-dimensional 
shape, or conformation. This shape is 
directed mostly by a multitude of weak, 
noncovalent, intramolecular bonds. If the 
folded macromolecules are subjected to 
conditions that disrupt noncovalent bonds, 
the molecule becomes a flexible chain 
that loses both its conformation and its 
biological activity. 


QUESTION 2-8 


In principle, there are many 
different, chemically diverse 

ways in which small molecules 

can be joined together to form 
polymers. For example, the small 
molecule ethene (CH2=CH2) is 
used commercially to make the 
plastic polyethylene (...-CH2—-CH2— 
CH2—-CH2-CH>-...). The individual 
subunits of the three major classes 
of biological macromolecules, 
however, are all linked by similar 
reaction mechanisms—that is, 

by condensation reactions that 
eliminate water. Can you think of 
any benefits that this chemistry 
offers and why it might have been 
selected in evolution over a linking 
chemistry such as that used to 
produce polyethylene? 


Chemical Components of Cells 


2 


— 
a vi CONDITIONS 


THAT DISRUPT 


aq © NONCOVALENT 
> BONDS 
a stable folded unstructured 


conformation polymer chains 


be subject to a sensitive control that allows it to specify exactly which 
subunit should be added next to the growing polymer end. We discuss 
the mechanisms that specify the sequence of subunits in DNA, RNA, and 
protein molecules in Chapters 6 and 7. 


Noncovalent Bonds Specify the Precise Shape of a 
Macromolecule 


Most of the single covalent bonds that link together the subunits in a 
macromolecule allow rotation of the atoms that they join; thus the pol- 
ymer chain has great flexibility. In principle, this allows a single-chain 
macromolecule to adopt an almost unlimited number of shapes, or con- 
formations, as the polymer chain writhes and rotates under the influence 
of random thermal energy. However, the shapes of most biological mac- 
romolecules are highly constrained because of weaker, noncovalent 
bonds that form between different parts of the molecule. These weaker 
interactions are the electrostatic attractions, hydrogen bonds, van der 
Waals attractions, and hydrophobic force we described earlier (see Panel 
2-3). In many cases, noncovalent interactions ensure that the polymer 
chain preferentially adopts one particular conformation, determined 
by the linear sequence of monomers in the chain. Most protein mole- 
cules and many of the RNA molecules found in cells fold tightly into a 
highly preferred conformation in this way (Figure 2-34). These unique 
conformations—shaped by billions of years of evolution—determine the 
chemistry and activity of these macromolecules and dictate their interac- 
tions with other biological molecules. 


Noncovalent Bonds Allow a Macromolecule to Bind 
Other Selected Molecules 


As we discussed earlier, although noncovalent bonds are individually 
weak, they can add up to create a strong attraction between two mol- 
ecules when these molecules fit together very closely, like a hand in a 
glove, so that many noncovalent bonds can occur between them (see 
Panel 2-3). This form of molecular interaction provides for great specific- 
ity in the binding of a macromolecule to other small and large molecules, 
because the multipoint contacts required for strong binding make it pos- 
sible for a macromolecule to select just one of the many thousands of 
different molecules present inside a cell. Moreover, because the strength 
of the binding depends on the number of noncovalent bonds that are 
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Figure 2-35 Noncovalent bonds mediate interactions between macromolecules. They can also mediate interactions between a 


macromolecule and small molecules (see Movie 2.4). 


formed, associations of almost any strength are possible. As one example, 
binding of this type makes it possible for proteins to function as enzymes. 
Enzymes recognize their substrates via noncovalent interactions, and an 
enzyme that acts on a positively charged substrate will often use a nega- 
tively charged amino acid side chain to guide the substrate to its proper 
position. We discuss such interactions in greater detail in Chapter 4. 


Noncovalent bonds can also stabilize associations between any two 
macromolecules, as long as their surfaces match closely (Figure 2-35). 
Such associations allow macromolecules to be used as building blocks 
for the formation of much larger structures. For example, proteins often 
bind together into multiprotein complexes that function as intricate 
machines with multiple moving parts, carrying out such complex tasks 
as DNA replication and protein synthesis (Figure 2-36). In fact, noncova- 
lent bonds account for a great deal of the complex chemistry that makes 
life possible. 
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Figure 2-36 Both covalent bonds and noncovalent bonds are needed to form a 
macromolecular assembly such as a ribosome. Covalent bonds allow small organic 
molecules to join together to form macromolecules, which can assemble into large 
macromolecular complexes via noncovalent bonds. Ribosomes are large macromolecular 
machines that synthesize proteins inside cells. Each ribosome is composed of about 

90 macromolecules (proteins and RNA molecules), and it is large enough to see in the 
electron microscope (see Figure 7-34). The subunits, macromolecules, and ribosome 
shown here are drawn roughly to scale. 


QUESTION 2-9 


Why could covalent bonds not be 
used in place of noncovalent bonds 
to mediate most of the interactions 
of macromolecules? 
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CHAPTER 2 


Chemical Components of Cells 


ESSENTIAL CONCEPTS 


Living cells obey the same chemical and physical laws as nonliving 
things. Like all other forms of matter, they are made of atoms, which 
are the smallest unit of a chemical element that retains the distinc- 
tive chemical properties of that element. 


Cells are made up of a limited number of elements, four of which—C, 
H, N, O—make up about 96% of a cell’s mass. 


Each atom has a positively charged nucleus, which is surrounded by 
a cloud of negatively charged electrons. The chemical properties of 
an atom are determined by the number and arrangement of its elec- 
trons: it is most stable when its outer electron shell is completely 
filled. 


A covalent bond forms when a pair of outer-shell electrons is shared 
between two adjacent atoms; if two pairs of electrons are shared, a 
double bond is formed. A cluster of two or more atoms held together 
by covalent bonds is known as a molecule. 


When an electron jumps from one atom to another, two ions of oppo- 
site charge are generated; these ions are held together by mutual 
attraction, forming a noncovalent ionic bond. 


Cells are 70% water by weight; the chemistry of life therefore takes 
place in an aqueous environment. 


Living organisms contain a distinctive and restricted set of small, 
carbon-based (organic) molecules, which are essentially the same 
for every living species. The main categories are sugars, fatty acids, 
amino acids, and nucleotides. 


Sugars are a primary source of chemical energy for cells and 
can also be joined together to form polysaccharides or shorter 
oligosaccharides. 


Fatty acids are an even richer energy source than sugars, but their 
most essential function is to form lipid molecules that assemble into 
sheet-like cell membranes. 


The vast majority of the dry mass of a cell consists of macromol- 
ecules—mainly polysaccharides, proteins, and nucleic acids (DNA 
and RNA); these macromolecules are formed as polymers of sugars, 
amino acids, or nucleotides, respectively. 


The most diverse and versatile class of macromolecules are proteins, 
which are formed from 20 types of amino acids that are covalently 
linked by peptide bonds into long polypeptide chains. Proteins con- 
stitute half of the dry mass of a cell. 


Nucleotides play a central part in energy-transfer reactions within 
cells; they are also joined together to form information-containing 
RNA and DNA molecules, each of which is composed of only four 
types of nucleotides. 


Protein, RNA, and DNA molecules are synthesized from subunits by 
repetitive condensation reactions, and it is the specific sequence of 
subunits that determines their unique functions. 


Four types of weak noncovalent bonds—hydrogen bonds, electro- 
static attractions, van der Waals attractions, and the hydrophobic 
force—enable macromolecules to bind specifically to other macro- 
molecules or to selected small molecules. 


Noncovalent bonds between different regions of a polypeptide or RNA 
chain allow these chains to fold into unique shapes (conformations). 


Questions 


KEY TERMS 


acid electrostatic attraction molecule 

amino acid fatty acid monomer 

atom hydrogen bond noncovalent bond 
atomic weight hydrolysis nucleotide 

ATP hydronium ion organic molecule 
Avogadro's number hydrophilic PH scale 

base hydrophobic polar 

buffer hydrophobic force polymer 

chemical bond inorganic protein 

chemical group ion proton 
condensation reaction ionic bond RNA 
conformation lipid sequence 
covalent bond lipid bilayer subunit 

DNA macromolecule sugar 

electron molecular weight van der Waals attraction 


electronegativity 


QUESTIONS 


QUESTION 2-10 


Which of the following statements are correct? Explain your 
answers. 


A. An atomic nucleus contains protons and neutrons. 
B. An atom has more electrons than protons. 
C. The nucleus is surrounded by a double membrane. 


D. All atoms of the same element have the same number of 
neutrons. 


E. The number of neutrons determines whether the nucleus 
of an atom is stable or radioactive. 


F. Both fatty acids and polysaccharides can be important 
energy stores in the cell. 


G. Hydrogen bonds are weak and can be broken by thermal 
energy, yet they contribute significantly to the specificity of 
interactions between macromolecules. 


QUESTION 2-11 


To gain a better feeling for atomic dimensions, assume that 
the page on which this question is printed is made entirely of 
the polysaccharide cellulose, whose molecules are described 
by the formula (C,H2,0,), where n can be a quite large 
number and is variable from one molecule to another. The 
atomic weights of carbon, hydrogen, and oxygen are 12, 1, 
and 16, respectively, and this page weighs 5 g. 


A. How many carbon atoms are there in this page? 


B. In cellulose, how many carbon atoms would be stacked 
on top of each other to span the thickness of this page (the 
size of the page is 21.2 cm x 27.6 cm, and it is 0.07 mm 
thick)? 


C. Now consider the problem from a different angle. 
Assume that the page is composed only of carbon atoms. 

A carbon atom has a diameter of 2 x 10-19 m (0.2 nm); how 
many carbon atoms of 0.2 nm diameter would it take to span 
the thickness of the page? 


D. Compare your answers from parts B and C and explain 
any differences. 


QUESTION 2-12 


A. How many electrons can be accommodated in the first, 
second, and third electron shells of an atom? 


B. How many electrons would atoms of the elements listed 
below have to gain or lose to obtain a completely filled outer 
shell? 


helium gain lose __ 
oxygen gain lose __ 
carbon gain __ lose __ 
sodium gain__ lose __ 
chlorine gain lose __ 


C. What do the answers tell you about the reactivity of 
helium and the bonds that can form between sodium and 
chlorine? 


QUESTION 2-13 


The elements oxygen and sulfur have similar chemical 
properties because they both have six electrons in their 
outermost electron shells. Indeed, both elements form 
molecules with two hydrogen atoms, water (H20) and 
hydrogen sulfide (H2S). Surprisingly, at room temperature, 
water is a liquid, yet H2S is a gas, despite sulfur being much 
larger and heavier than oxygen. Explain why this might be 
the case. 


QUESTION 2-14 


Write the chemical formula for a condensation reaction of 
two amino acids to form a peptide bond. Write the formula 
for its hydrolysis. 
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PANEL 2-1 CHEMICAL BONDS AND GROUPS 


CARBON SKELETONS 


Carbon has a unique role in the cell because of its 
ability to form strong covalent bonds with other 
carbon atoms. Thus carbon atoms can join to form: 


chains 
Vf NS VV WV 
a a a ae 


7% £% FX F% 


also written as PITS 


COVALENT BONDS 


branched trees 


also written as — 


A covalent bond forms when two atoms come very close 
together and share one or more of their outer-shell electrons. 
Each atom forms a fixed number of covalent bonds in a 


defined spatial arrangement. 
SINGLE BONDS: two electrons shared per bond 


RR p a 


DOUBLE BONDS: four electrons shared per bond 


Y R R 


The precise spatial arrangement of covalent bonds influences 


Atoms joined by two 

or more covalent bonds 
cannot rotate freely 
around the bond axis. 
This restriction has a 
major influence on the 
three-dimensional shape 


the three-dimensional structure and chemistry of molecules. 


ALTERNATING DOUBLE BONDS 


A carbon chain can include double 
bonds. If these are on alternate carbon 
atoms, the bonding electrons move 
within the molecule, stabilizing the 
structure by a phenomenon called 
resonance. 


/ 

=C C 
\ / 
C=C 
/ \ 


the truth is somewhere between 


these two structures H 


Alternating double bonds in a ring 
can generate a very stable structure. 


often written as © 


also written as CO 


of many macromolecules. 


C-H COMPOUNDS 


Carbon and hydrogen 
together make stable 
compounds (or groups) 
called . These 
are nonpolar, do not form 
hydrogen bonds, and are 
generally insoluble in water. 


a T 
N N 
AONAN AON ASN 
o o fa fa) 
= ca 


E 
iS) 
a 


E aE I 
OE 
Noa Na 
E IL I 
N N N 


T 
Uo 
C 


part of the hydrocarbon “tail” 
of a fatty acid molecule 


C-O COMPOUNDS 


Many biological compounds contain a carbon covalently 


67 


C-N COMPOUNDS 


Amines and amides are two important examples of 


bonded to an oxygen. For example, compounds containing a carbon linked to a nitrogen. 


in water combine with an H* ion to become 


The -OH is called a positively charged. 


group. 


are formed by combining an acid and an 
The C=O is called a amine. Unlike amines, amides are uncharged in water. 
group. An example is the peptide bond that joins amino acids 
in a protein. 


The -COOH is called a acid 
group. In water, 
this loses an H* ion to 


= Nitrogen also occurs in several ring compounds, including 
become -COO . 


important constituents of nucleic acids: purines and 
Esters are formed by combining an pyrimidines. NH 
acid and an alcohol. 2 
AS 


ll cytosine (a pyrimidine) 


SULFHYDRYL GROUP The -¢- H is called a sulfhydryl group. In the amino acid cysteine, the sulfhydryl group may exist in 


l 
the reduced form, -Ç> H or more rarely in an oxidized, cross-bridging form, Lie — Ae 
i i 


PHOSPHATES 


Inorganic phosphate is a stable ion formed from 
phosphoric acid, H3PO4. It is also written as P; . 


Phosphate esters can form between a phosphate and a free hydroxyl group. 
are often covalently attached to proteins in this way. 


O | ©) also 
Il written as 


Te ee x 
—C— OH + HO—P—O => —C—O—P—O> +hHO| | 


i | i -¢-0o-@ 
O © | 


The combination of a phosphate and a carboxyl group, or two or more phosphate groups, produces an acid anhydride. 
Because compounds of this type release a large amount of free energy when the bond is broken by hydrolysis in the cell, 
they are often said to contain a “high-energy” bond. 


H,O 


“high-energy” acyl phosphate also written as 


bond (carboxylic-phosphoric 
acid anhydride) found in 
some metabolites 


“high-energy” phosphoanhydride 
bond found in molecules 
such as ATP 


also written as 
— O—P)}—P 
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HYDROGEN BONDS 


Because they are polarized, two 
adjacent H,O molecules can form 
a noncovalent linkage known as a 
hydrogen bond. Hydrogen bonds 
have only about 1/20 the strength 
of a covalent bond. 


öt 


b = o o 
hydrogen 
Hydrogen bonds are strongest when d bond 
the three atoms lie in a straight line. 


Two atoms connected by a covalent bond may exert different attractions for 
the electrons of the bond. In such cases, the bond is polar, with one end 
slightly negatively charged (5-) and the other slightly positively charged (8+). 


electropositive 
region 


electronegative 
region 


Although a water molecule has an overall neutral charge (having the same 
number of electrons and protons), the electrons are asymmetrically distributed, 
making the molecule polar. The oxygen nucleus draws electrons away from 

the hydrogen nuclei, leaving the hydrogen nuclei with a small net positive charge. 
The excess of electron density on the oxygen atom creates weakly negative 
regions at the other two corners of an imaginary tetrahedron. On these pages, 
we review the chemical properties of water and see how water influences the 
behavior of biological molecules. 


HYDROPHILIC MOLECULES 


Substances that dissolve readily in water are termed hydrophilic. They include 
ions and polar molecules that attract water molecules through electrical charge 
effects. Water molecules surround each ion or polar molecule and carry it 

into solution. 


H 
H ae 
H H / Hu Q 
Hego i A O o f 
H DE H6* HO `H \ \ UH 
+ H T 
Soe Na H H 5+ Ga A A HiO 
Oð "a iy 
§-O o—H pri O=C H 
/ ò` st ot K < 
H | s LH 
H7O H | | H N—H iO 
O H / fi \ 
i HI Q: H H 
H = 
lonic substances such as sodium chloride E 


dissolve because water molecules are 
attracted to the positive (Nat) or negative 
(CI) charge of each ion. 


Polar substances such as urea 
dissolve because their molecules 
form hydrogen bonds with the 
surrounding water molecules. 


bond lengths 


oF 
hydrogen 
bond 
H 0.17 nm 
Sena 
O ummm H — O — 
Z L 
H 0.10 nm 


covalent bond 


WATER STRUCTURE 


Molecules of water join together transiently 
in a hydrogen-bonded lattice. 


The cohesive nature of water is 
responsible for many of its unusual 
properties, such as high surface tension, 
high specific heat capacity, and high heat 
of vaporization. 


HYDROPHOBIC MOLECULES 


Substances that contain a preponderance 
of nonpolar bonds are usually insoluble 

in water and are termed hydrophobic. 
Water molecules are not attracted to such 
hydrophobic molecules and so have little 
tendency to surround them and bring them 
into solution. 


yu 
Hg 
Ho H a 
H H 
D7 \ | 
n O=O 
X Ve H H 
=C m | 
SY NP 
c Ay, / 
H / N H% x. 
| H H o” 
Ow | 
H H 


Hydrocarbons, which contain many 
C-H bonds, are especially hydrophobic. 


WATER AS A SOLVENT 


Many substances, such as household sugar (sucrose), dissolve in water. That is, their 
molecules separate from each other, each becoming surrounded by water molecules. 


ae 
sugar 
dissolves 


water 
molecule 


sugar crystal 


ACIDS 


Substances that release hydrogen ions (protons) into solution 
are called acids. 


H* + 
hydrogen ion 


Gl: 


chloride ion 


HCI 
hydrochloric acid 
(strong acid) 


— 


Many of the acids important in the cell are not completely 
dissociated, and they are therefore weak acids—for example, 
the carboxyl group (-COOH), which dissociates to give a 
hydrogen ion in solution. 


H* + 


carboxyl group 
(weak acid) 


Note that this is a reversible reaction. 


pH 
H* 

conc. pH 

The acidity of a moles/liter 
solution is defined 1 0 
by the concentration (conc.) 10"! — 1 
of hydronium ions (H30%*) it 102 2 
possesses, generally ACIDIC 49-3 3 

abbreviated as Ht. á 
: 10 4 

For convenience, we 3 
use the pH scale, where 10 5 
10°° 6 
107 7 


pH = -log;o[H*] 


For pure water 
[H*] = 107 moles/liter 


When a substance dissolves in a 

liquid, the mixture is termed a solution. 
The dissolved substance (in this case 
sugar) is the solute, and the liquid that 
does the dissolving (in this case water) 
is the solvent. Water is an excellent 
solvent for hydrophilic substances 
because of its polar bonds. 


sugar molecule 


HYDROGEN ION EXCHANGE 


Positively charged hydrogen ions (H*) can spontaneously 
move from one water molecule to another, thereby creating 
two ionic species. 


nN a ` i © ep 
— al 
ye O — O—H + O 
H H 
hydronium ion hydroxyl ion 
often written as: H,O — H* + OH” 
hydrogen hydroxyl 

ion ion 


Because the process is rapidly reversible, hydrogen ions are 
continually shuttling between water molecules. Pure water 
contains equal concentrations of hydronium ions and 
hydroxyl ions (both 107” M). 


BASES 


Substances that reduce the number of hydrogen ions in 
solution are called bases. Some bases, such as ammonia, 
combine directly with hydrogen ions. 


NH3 + Ht 
ammonia hydrogen ion 


—s 


NH," 


ammonium ion 
Other bases, such as sodium hydroxide, reduce the number of 


H* ions indirectly, by producing OH” ions that then combine 
directly with H* ions to make H30. 


NaOH Nay + OH” 
sodium hydroxide sodium hydroxyl 
(strong base) ion ion 


Many bases found in cells are partially associated with H* ions 
and are termed weak bases. This is true of compounds that 
contain an amino group (-NH3), which has a weak tendency 
to reversibly accept an H* ion from water, thereby 
increasing the concentration of free OH ions. 


-NH, + HË —— -NH;* 
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Organic molecules can interact with other molecules through 
three types of short-range attractive forces known as 
noncovalent bonds: van der Waals attractions, electrostatic 
attractions, and hydrogen bonds. The repulsion of 
hydrophobic groups from water is also important for these 


interactions and for the folding of biological macromolecules. 


k RS 
noncovalent > NS 
bond i 2 í ANN 
y wW™\ YY 
Tr Sa 
a — 
Ul 


Weak noncovalent bonds have less than 1/20 the strength of 
a strong covalent bond. They are strong enough to provide 
tight binding only when many of them are formed 
simultaneously. 


HYDROGEN BONDS 


As already described for water (see Panel 2-2, pp. 68-69), 
hydrogen bonds form when a hydrogen atom is 
“sandwiched” between two electron-attracting atoms 
(usually oxygen or nitrogen). 


Hydrogen bonds are strongest when the three atoms are 


in a straight line: 
Amino acids in a polypeptide chain can be hydrogen-bonded 


Examples in macromolecules: 
together in a folded protein. 


l \ 
` 
| 
| COMM] HN 
Ba. R—C—H H—C—R 
C=O IN H—N 
l 
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Si et: em Se 


Two bases, G and C, are hydrogen-bonded in a DNA double helix. 


H 


H 
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VAN DER WAALS ATTRACTIONS 


If two atoms are too close together, they repel each other 
very strongly. For this reason, an atom can often be 
treated as a sphere with a fixed radius. The characteristic 
“size” for each atom is specified by a unique van der 
Waals radius. The contact distance between any two 
noncovalently bonded atoms is the sum of their van der 
Waals radii. 


© @ee 


0.12 nm 0.2 nm 0.15 nm 0.14 nm 
radius radius radius radius 


At very short distances, any two atoms show a weak 
bonding interaction due to their fluctuating electrical 
charges. The two atoms will be attracted to each other 
in this way until the distance between their nuclei is 
approximately equal to the sum of their van der Waals 
radii. Although they are individually very weak, such 
van der Waals attractions can become important when 
two macromolecular surfaces fit together very closely, 
because many atoms are involved. 

Note that when two atoms form a covalent bond, the 
centers of the two atoms (the two atomic nuclei) are 
much closer together than the sum of the two van der 
Waals radii. Thus, 


Ld L_} LJ 
0.4nm 0.15 nm 0.13 nm 
two non-bonded two carbon two carbon 


carbon atoms atoms held by a atoms held by a 


single covalent double covalent 
bond bond 


HYDROGEN BONDS IN WATER 


Any two atoms that can form hydrogen bonds to each other 
can alternatively form hydrogen bonds to water molecules. 
Because of this competition with water molecules, the 
hydrogen bonds formed in water between two peptide bonds, 
for example, are relatively weak. 


peptide 
bond 


ec 20 | 
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ELECTROSTATIC ATTRACTIONS 


occur both between 
fully charged groups (ionic bond) and between 
partially charged groups on polar molecules. 


= Ji g— 


The force of attraction between the two partial 
charges, 5* and ð, falls off rapidly as the 
distance between the charges increases. 


In the absence of water, ionic bonds are very strong. 


They are responsible for the strength of such 
minerals as marble and agate, and for crystal 
formation in common table salt, NaCl. 


HYDROPHOBIC FORCES 
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ELECTROSTATIC ATTRACTIONS 
IN WATER 


Charged groups are shielded by their 
interactions with water molecules. 
Electrostatic attractions are therefore 
quite weak in water. 


Inorganic ions in solution can also cluster around 
charged groups and further weaken these electrostatic 
attractions. 


Despite being weakened by water and inorganic 
ions, electrostatic attractions are very important 
in biological systems. For example, an enzyme 
that binds a positively charged substrate will 
often have a negatively charged amino acid side 
chain at the appropriate place. 


substrate 


© 


Water forces hydrophobic groups together 

in order to minimize their disruptive effects on 
the water network formed by the hydrogen bonds 
between water molecules. Hydrophobic groups 
held together in this way are sometimes said 

to be held together by “hydrophobic 

bonds,” even though the attraction is 

actually caused by a repulsion from water. 
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MONOSACCHARIDES 


Monosaccharides usually have the general formula (CHO), where n can be 3, 4, 5, or 6, and have two or more hydroxyl groups. 
fe} 
They either contain an aldehyde group (—¢<,,) and are called aldoses, or a ketone group (>¢=0 ) and are called ketoses. 


3-carbon (TRIOSES) 5-carbon (PENTOSES) 6-carbon (HEXOSES) 


ALDOSES 


glyceraldehyde 


KETOSES 


dihydroxyacetone ribulose fructose 


RING FORMATION ISOMERS 


In aqueous solution, the aldehyde or ketone group of a sugar Many monosaccharides differ only in the spatial arrangement 

molecule tends to react with a hydroxyl group of the same of atoms—that is, they are isomers. For example, glucose, 

molecule, thereby closing the molecule into a ring. galactose, and mannose have the same formula (C6H41206) but 
differ in the arrangement of groups around one or two carbon 


atoms. 
ÇH2OH CHOH 


glucose 


ribose 


glucose mannose 


These small differences make only minor changes in the 
chemical properties of the sugars. But the differences are 


recognized by enzymes and other proteins and therefore can 
Note that each carbon atom has a number. have major biological effects. 
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a AND B LINKS SUGAR DERIVATIVES 

The hydroxyl group on the carbon that carries the The hydroxyl groups of 

aldehyde or ketone can rapidly change from one a simple monosaccharide, 

position to the other. These two positions are called such as glucose, can be 

a and B. replaced by other 
groups. 


As soon as one sugar is linked to another, the a or 
B form is frozen. 


DISACCHARIDES 


The carbon that carries the aldehyde 
or the ketone can react with any 
hydroxyl group on a second sugar 
molecule to form a i 
Three common disaccharides are 


maltose (glucose + glucose) 
lactose (galactose + glucose) 
sucrose (glucose + fructose) 


The reaction forming sucrose is 
shown here. 


OLIGOSACCHARIDES AND POLYSACCHARIDES 


Large linear and branched molecules can be made from simple repeating sugar subunits. 
Short chains are called , and long chains are called ; 
Glycogen, for example, is a polysaccharide made entirely of glucose subunits joined together. 


branch points 


COMPLEX OLIGOSACCHARIDES 


In many cases, a sugar sequence 

is nonrepetitive. Many different 
molecules are possible. Such 

complex oligosaccharides are 

usually linked to proteins or to lipids, 
as is this oligosaccharide, which is 
part of a cell-surface molecule 

that defines a particular blood group. 
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FATTY ACIDS 


All fatty acids have a 
at one end and a long 
hydrocarbon tail at the other. 


CARBOXYL GROUP 


If free, the carboxyl group of a 
fatty acid will be ionized. 


Hundreds of different kinds of fatty acids exist. Some have one or more double bonds in their 
hydrocarbon tail and are said to be . Fatty acids with no double bonds are 


O O 
N 
C 


This double bond 
is rigid and creates 
f? kink in the chain. 
| The rest of the chain 
is free to rotate 
about the other C-C 
bonds. 


space-filling model carbon skeleton 


TRIACYLGLYCEROLS Fatty acids are stored in cells as an energy reserve 
(fats and oils) through an ester linkage to 
to form triacylglycerols. 


glycerol 


Phospholipids are the major constituents 
PHOSPHOLIPIDS of cell membranes. 


choline 
hydrophilic Q 


O head 


PPPOE LOOP OP 


But more often it is linked to 
other groups to form either 


POS 0 SOP 


hydrophobic 
fatty acid tails 


In phospholipids, two of the -OH groups in 
glycerol are linked to fatty acids, while the third 
-OH group is linked to phosphoric acid. The 
phosphate, which carries a negative charge, is 
further linked to one of a variety of small polar 
groups, such as choline. 


general structure of 
a phospholipid 
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LIPID AGGREGATES POLYISOPRENOIDS 


Long-chain polymers 


Fatty acids have a hydrophilic head — ‘ i 00000: r— surface film of isoprene 


and a hydrophobic tail. ——_ 


m micelle 


In water, they can form either a surface 
film or small, spherical micelles. 


Their derivatives can form larger aggregates held together by hydrophobic forces: 


form large, spherical and form self-sealing lipid 
fat droplets in the cell cytoplasm. bilayers, which are the basis for all cell membranes. 


t 


200 nm 
or more 


| 


OTHER LIPIDS Lipids are defined as water-insoluble 
molecules that are soluble in organic 
solvents. Two other common types of lipids 
are steroids and polyisoprenoids. Both are 
made from isoprene units. 


STEROIDS Steroids have a common multiple-ring structure. 


—found in many cell membranes —male sex hormone 


GLYCOLIPIDS 


Like phospholipids, these compounds are composed of a hydrophobic 
region, containing two long hydrocarbon tails, and a polar region, 
which contains one or more sugars. Unlike phospholipids, there is 

no phosphate. 


—used 
to carry activated sugars 
in the membrane- 
associated synthesis of 
glycoproteins and some 
polysaccharides 
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FAMILIES OF BASIC SIDE CHAINS 
AMINO ACIDS 


lysine arginine histidine 


The common amino acids (Lys, or K) (Arg, or R) (His, or H) 
are grouped according to 
whether their side chains 
are 


acidic 
basic 
uncharged polar 


nonpolar This group is 


very basic 


These 20 amino acids because its 


are given both three-letter 7 positive charge These nitrogens have a 

and one-letter abbreviations. is stabilized by 3 relatively weak affinity for an 
, resonance (see H* and are only partly positive 

Thus: alanine = Ala = A Panel 2-1). at neutral pH. 


THE AMINO ACID OPTICAL ISOMERS The a-carbon atom is asymmetric, 


allowing for two mirror-image 
The general formula of an amino acid is (or stereo-) isomers, L and D. 


a-carbon atom 
H a 
f | 
amino carboxyl 
group HaN—¢ M group 
R 
Ps side chain 


R is commonly one of 20 different side chains. 
At pH 7, both the amino and carboxyl groups 
are ionized. 


H 
© | 
G C02 


R Proteins contain exclusively L-amino acids. 


PEPTIDE BONDS 


In proteins, amino acids are joined together by an The four atoms involved in each peptide bond form a rigid 
amide linkage, called a peptide bond. planar unit (red box). There is no rotation around the C-N bond. 


H2O 


Proteins are long polymers amino terminus, or 
of amino acids linked by N-terminus 
peptide bonds, and they 

are always written with the 

N-terminus toward the left. i H © g carboxyl terminus, or 
Peptides are shorter, usually 4 da N C-terminus 

fewer than 50 amino acids long. i 

The sequence of this tripeptide 
is histidine-cysteine-valine. 


These two single bonds allow rotation, so that long 
chains of amino acids are very flexible. 


ACIDIC SIDE CHAINS NONPOLAR SIDE CHAINS 


alanine 
| aspartic acid glutamic acid (Ala, or A) 
(Asp, or D) (Glu, or E) 


leucine 
(Leu, or L) 


UNCHARGED POLAR SIDE CHAINS 


proline 


asparagine glutamine (Pro, or P) 


(Asn, or N) (Gin, or Q) 


(actually an `CH3 
imino acid) 


methionine l 
(Met, or M) 


Although the amide N is not charged at 
neutral pH, it is polar. 


| 
5 —CH, 


serine threonine tyrosine 
(Ser, or S) (Thr, or T) (Tyr, or Y) 
glycine 
(Gly, or G) 


valine 
(Val, or V) 


isoleucine 


(Ile, or I) 


phenylalanine 


(Phe, or F) 


tryptophan 


(Trp, or W) 


cysteine 
(Cys, or C) 


A disulfide bond (red) can form between two cysteine side 


chains in proteins. 
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The bases are nitrogen-containing ring 
compounds, either pyrimidines or purines. 


SS 


C 
| 
HC 


thymine PURINE 


PHOSPHATES NUCLEOTIDES BASE-SUGAR 


T ; ; : oP LINKAGE 
The phosphates are normally joined to A nucleotide consists of a nitrogen-containing 
the C5 hydroxyl of the ribose or base, a five-carbon sugar, and one or more 
deoxyribose sugar (designated 5'). Mono-, phosphate groups. 


di-, and triphosphates are common. 
BASE 


© 
l NH3 N-glycosidic 


-O— P—O—CH, Anp bond BASE 


Or PHOSPHATE 


as in 
ep ATP Nucleotides The base is linked to 
| are the the same carbon (C1) 
subunits of used in sugar-sugar 
The phosphate makes a nucleotide the nucleic acids. bonds. 
negatively charged. 


SUGARS 


B-p-ribose 
used in ribonucleic acid (RNA) 


- SENSE two kinds of 
a five-carbon sugar pentoses are used 


HOCH, 


B-p-2-deoxyribose 


as used in deoxyribonucleic acid (DNA 
Each numbered carbon on the sugar of a nucleotide is oe ( ) 


followed by a prime mark; therefore, one speaks of the 
“5-prime carbon,” etc. 
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NOMENCLATURE The names can be confusing, but the abbreviations are clear. S 


BASE NUCLEOSIDE Nucleotides and their derivatives can be 
abbreviated to three capital letters. 
adenine adenosine Some examples follow: 
BASE + SUGAR = NUCLEOSIDE 
guanine guanosine AMP = adenosine monophosphate 
dAMP = deoxyadenosine monophosphate 
cytosine cytidine UDP = uridine diphosphate 
ATP = adenosine triphosphate 
uracil uridine 


thymine thymidine 


BASE + SUGAR + PHOSPHATE = NUCLEOTIDE 


NUCLEIC ACIDS NUCLEOTIDES AND THEIR DERIVATIVES HAVE 

To form nucleic acid polymers, nucleotides MANY OTHER FUNCTIONS 

are joined together by phosphodiester 

bonds between the 5’ and 3’ carbon @ As nucleoside di- and triphosphates, they carry chemical energy in their 
atoms of adjacent sugar rings. The linear easily hydrolyzed phosphoanhydride bonds. 

sequence of nucleotides in a nucleic acid 


chain is abbreviated using a one-letter : NH3 
code, such as AGCTT, starting with the phosphoanhydride bonds 
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example: ATP (or ) 
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@ They are used as small intracellular signaling molecules in the cell. 
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QUESTION 2-15 


Which of the following statements are correct? Explain your 
answers. 


A. Proteins are so remarkably diverse because each is made 
from a unique mixture of amino acids that are linked in 
random order. 


B. Lipid bilayers are macromolecules that are made up 
mostly of phospholipid subunits. 


C. Nucleic acids contain sugar groups. 
D. Many amino acids have hydrophobic side chains. 


E. The hydrophobic tails of phospholipid molecules are 
repelled from water. 


F. DNA contains the four different bases A, G, U, and C. 


QUESTION 2-16 


A. How many different molecules composed of (a) two, 
(b) three, and (c) four amino acids, linked together by 
peptide bonds, can be made from the set of 20 naturally 
occurring amino acids? 


B. Assume you were given a mixture consisting of one 
molecule each of all possible sequences of a smallish protein 
of molecular mass 4800 daltons. If the average molecular 
mass of an amino acid is, say, 120 daltons, how much would 
the sample weigh? How big a container would you need to 
hold it? 


C. What does this calculation tell you about the fraction 

of possible proteins that are currently in use by living 
organisms (the average molecular mass of proteins is about 
30,000 daltons)? 


QUESTION 2-17 


This is a biology textbook. Explain why the chemical 
principles that are described in this chapter are important in 
the context of modern cell biology. 


QUESTION 2-18 


A. Describe the similarities and differences between van der 
Waals attractions and hydrogen bonds. 


B. Which of the two bonds would form (a) between two 
hydrogens bound to carbon atoms, (b) between a nitrogen 
atom and a hydrogen bound to a carbon atom, and 

(c) between a nitrogen atom and a hydrogen bound to an 
oxygen atom? 


QUESTION 2-19 

What are the forces that determine the folding of a 
macromolecule into a unique shape? 

QUESTION 2-20 


Fatty acids are said to be “amphipathic.” What is meant by 
this term, and how does an amphipathic molecule behave in 
water? Draw a diagram to illustrate your answer. 


QUESTION 2-21 


Are the formulas in Figure Q2-21 correct or incorrect? 
Explain your answer in each case. 
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Energy, Catalysis, and 
Biosynthesis 


One property above all makes living things seem almost miraculously 
different from nonliving matter: they create and maintain order in a uni- 
verse that is tending always toward greater disorder. To accomplish this 
remarkable feat, the cells in a living organism must continuously carry 
out a never-ending stream of chemical reactions to maintain their struc- 
ture, meet their metabolic needs, and stave off unrelenting chemical 
decay. In these reactions, small organic molecules—amino acids, sug- 
ars, nucleotides, and lipids—can be taken apart or modified to supply 
the many other small molecules that the cell requires. These molecules 
are also used to construct an enormously diverse range of large mol- 
ecules, including the proteins, nucleic acids, and other macromolecules 
that constitute most of the mass of living systems and endow them with 
their distinctive properties. 


Each cell can be viewed as a tiny chemical factory, performing many mil- 
lions of reactions every second. This incessant activity requires both a 
source of atoms in the form of food molecules and a source of energy. 
Both the atoms and the energy must come, ultimately, from the nonliving 
environment. In this chapter, we discuss why cells require energy, and 
how they use energy and atoms from their environment to create and 
maintain the molecular order that makes life possible. 


Most of the chemical reactions that cells perform would normally occur 
only at temperatures that are much higher than those inside a cell. Each 
reaction therefore requires a major boost in chemical reactivity to enable 
it to proceed rapidly within the cell. This boost is provided by a large 
set of specialized proteins called enzymes, each of which accelerates, or 
catalyzes, just one of the many possible reactions that a particular 
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Figure 3—1 A series of enzyme-catalyzed molecule molecule molecule molecule molecule molecule 
reactions forms a linked pathway. Each A j A — Bo —____. g j @ = ® 
CATALYSIS CATALYSIS CATALYSIS CATALYSIS CATALYSIS 


chemical reaction is catalyzed by a distinct 
enzyme. Together, this set of enzymes, 
acting in series, converts molecule A to 
molecule F. 


BY ENZYME 1 BY ENZYME 2 BY ENZYME 3 BY ENZYME 4 BY ENZYME 5 


molecule could in principle undergo. These enzyme-catalyzed reac- 
tions are usually connected in series, so that the product of one reaction 
becomes the starting material for the next (Figure 3—1). The long, linear 
reaction pathways that result are in turn linked to one another, forming a 
complex web of interconnected reactions. 


Rather than being an inconvenience, the necessity for catalysis is a ben- 
efit, as it allows the cell to precisely control its metabolism—the sum 
total of all the chemical reactions it needs to carry out to survive, grow, 
and reproduce. This control is central to the chemistry of life. 


Two opposing streams of chemical reactions occur in cells: the catabolic 
pathways and the anabolic pathways. The catabolic pathways (catabo- 
lism) break down foodstuffs into smaller molecules, thereby generating 
both a useful form of energy for the cell and some of the small molecules 
that the cell needs as building blocks. The anabolic, or biosynthetic, path- 
ways (anabolism) use the energy harnessed by catabolism to drive the 
synthesis of the many molecules that form the cell. Together, these two 
sets of reactions constitute the metabolism of the cell (Figure 3-2). 


The details of the reactions that comprise cell metabolism are part of the 
subject matter of biochemistry, and they need not concern us here. But 
the general principles by which cells obtain energy from their environ- 
ment and use it to create order are central to cell biology. We therefore 
begin this chapter by explaining why a constant input of energy is needed 
to sustain living organisms. We then discuss how enzymes catalyze the 
reactions that produce biological order. Finally, we describe the mol- 
ecules inside cells that carry the energy that makes life possible. 


THE USE OF ENERGY BY CELLS 


Left to themselves, nonliving things eventually become disordered: build- 
ings crumble and dead organisms decay. Living cells, by contrast, not 
only maintain but actually generate order at every level, from the large- 
scale structure of a butterfly or a flower down to the organization of the 
molecules that make up such organisms (Figure 3-3). This property of life 
is made possible by elaborate molecular mechanisms that extract energy 
from the environment and convert it into the energy stored in chemical 
bonds. Biological structures are therefore able to maintain their form, 
even though the materials that form them are continually being broken 
down, replaced, and recycled. Your body has the same basic structure it 
had 10 years ago, even though you now contain atoms that, for the most 
part, were not part of your body then. 
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Biological Order Is Made Possible by the Release of 
Heat Energy from Cells 


The universal tendency of things to become disordered is expressed in 
a fundamental law of physics called the second law of thermodynamics. 
This law states that in the universe as a whole, or in any isolated sys- 
tem (a collection of matter that is completely cut off from the rest of the 
universe), the degree of disorder can only increase. The second law of 
thermodynamics has such profound implications for living things that it 
is worth restating in several ways. 


We can express the second law in terms of probability by stating that 
systems will change spontaneously toward those arrangements that have 
the greatest probability. Consider a box in which 100 coins are all lying 
heads up. A series of events that disturbs the box—for example, someone 
jiggling it a bit—will tend to move the arrangement toward a mixture of 
50 heads and 50 tails. The reason is simple: there is a huge number of 
possible arrangements of the individual coins that can achieve the 50-50 
result, but only one possible arrangement that keeps them all oriented 
heads up. Because the 50-50 mixture accommodates a greater number 
of possibilities and places fewer constraints on the orientation of each 
individual coin, we say that it is more “disordered.” For the same rea- 
son, one’s living space will become increasingly disordered without an 
intentional effort to keep it organized. Movement toward disorder is a 
spontaneous process, and requires a periodic input of energy to reverse 
it (Figure 3-4). 


“SPONTANEOUS” REACTION 


| as time elapses y 


ORGANIZED EFFORT REQUIRING ENERGY INPUT 
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Figure 3-3 Biological structures are 
highly ordered. Well-defined, ornate, 

and beautiful spatial patterns can be 

found at every level of organization in 

living organisms. Shown are: (A) protein 
molecules in the coat of a virus (a parasite 
that, although not technically alive, contains 
the same types of molecules as those 
found in living cells); (B) the regular array 
of microtubules seen in a cross section of a 
sperm tail; (C) surface contours of a pollen 
grain; (D) cross section of a fern stem, 
showing the patterned arrangement of 
cells; and (E) a spiral array of leaves, each 
made of millions of cells. (A, courtesy of 
Robert Grant, Stéphane Crainic, and James 
M. Hogle; B, courtesy of Lewis Tilney; 

C, courtesy of Colin MacFarlane and 

Chris Jeffree; D, courtesy of Jim Haseloff.) 


Figure 3-4 The spontaneous tendency 
toward disorder is an everyday 
experience. Reversing this natural tendency 
toward disorder requires an intentional 
effort and an input of energy. In fact, from 
the second law of thermodynamics, we 

can be certain that the human intervention 
required will release enough heat to the 
environment to more than compensate for 
the reestablishment of order in this room. 
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Figure 3-5 Living cells do not defy the 
second law of thermodynamics. In the 
diagram on the left, the molecules of both 
the cell and the rest of the universe (the 
environment) are depicted in a relatively 
disordered state. In addition, red arrows 
suggest the relative amount of thermal 
motion of the molecules both inside and 
outside the cell. In the diagram on the 
right, the cell has taken in energy from 
food molecules, carried out a reaction 
that gives order to the molecules that the 
cell contains, and released heat (yellow 
arrows) into the environment. The released 
heat increases the disorder in the cell's 
surroundings—as depicted here by the 
increase in thermal motion of the molecules 
in the environment and the distortion 

of those molecules due to enhanced 
vibration and rotation. The second law of 
thermodynamics is thereby satisfied, even 
as the cell grows and constructs larger 
molecules. 
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sea of matter cell 


increased order 


increased disorder 


The measure of a system's disorder is called the entropy of the system, 
and the greater the disorder, the greater the entropy. Thus another way 
to express the second law of thermodynamics is to say that systems 
will change spontaneously toward arrangements with greater entropy. 
Living cells—by surviving, growing, and forming complex communities 
and even whole organisms—generate order and thus might appear to 
defy the second law of thermodynamics. This is not the case, however, 
because a cell is not an isolated system. Rather, a cell takes in energy 
from its environment—in the form of food, inorganic molecules, or pho- 
tons of light from the sun—and uses this energy to generate order within 
itself, forging new chemical bonds and building large macromolecules. 
In the course of performing the chemical reactions that generate order, 
some energy is inevitably lost in the form of heat (see Figure 3-2). Heat 
is energy in its most disordered form—the random jostling of molecules 
(analogous to the random jostling of the coins in the box). Because the 
cell is not an isolated system, the heat energy produced by metabolic 
reactions is quickly dispersed into the cell’s surroundings. There, the 
heat increases the intensity of the thermal motions of nearby molecules, 
thereby increasing the entropy of the cell’s environment (Figure 3-5). 


To satisfy the second law of thermodynamics, the amount of heat released 
by a cell must be great enough that the increased order generated inside 
the cell is more than compensated for by the increased disorder gener- 
ated in the environment. In other words, the chemical reactions inside a 
cell must increase the total entropy of the entire system: that of the cell 
plus its environment. Thanks to the cell’s activity, the universe thereby 
becomes more disordered—and the second law of thermodynamics is 
obeyed. 


Cells Can Convert Energy from One Form to Another 


Where does the heat released by cells as they generate order come from? 
To understand that, we need to consider another important physical law. 
According to the first law of thermodynamics, energy cannot be created or 
destroyed—but it can be converted from one form to another (Figure 3—6). 
Cells take advantage of this law of thermodynamics, for example, when 
they convert the energy from sunlight into the energy in the chemical 
bonds of sugars and other small organic molecules during photosynthe- 
sis. Although the chemical reactions that power such energy conversions 
can change how much energy is present in one form or another, the first 
law tells us that the total amount of energy in the universe must always 
be the same. 


Heat, too, is a product of energy conversion. When an animal cell breaks 
down foodstuffs, some of the energy in the chemical bonds in the food 
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molecules (chemical-bond energy) is converted into the thermal motion 
of molecules (heat energy). This conversion of chemical energy into heat 
energy causes the universe as a whole to become more disordered—as 
required by the second law of thermodynamics. But a cell cannot derive 
any benefit from the heat energy it produces unless the heat-generating 
reactions are directly linked to processes that maintain molecular order 
inside the cell. It is the tight coupling of heat production to an increase 
in order that distinguishes the metabolism of a cell from the wasteful 
burning of fuel in a fire. Later in this chapter, we illustrate how this cou- 
pling occurs. For the moment, it is sufficient to recognize that—by directly 
linking the “burning” of food molecules to the generation of biological 
order—cells are able to create and maintain an island of order in a uni- 
verse tending toward chaos. 


Photosynthetic Organisms Use Sunlight to Synthesize 
Organic Molecules 


All animals live on energy stored in the chemical bonds of organic mol- 
ecules, which they take in as food. These food molecules also provide the 


photosynthesis 
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Figure 3-6 Different forms of energy are 
interconvertible, but the total amount 
of energy must be conserved. (A) We 
can use the height and weight of the brick 
to predict exactly how much heat will be 
released when it hits the floor. (B) The 
large amount of chemical-bond energy 
released when water (H20) is formed from 
H2 and Oz is initially converted to very 
rapid thermal motions in the two new HzO 
molecules; however, collisions with other 
H2O molecules almost instantaneously 
spread this kinetic energy evenly throughout 
the surroundings (heat transfer), making 
the new H20 molecules indistinguishable 
from all the rest. (C) Cells can convert 
chemical-bond energy into kinetic energy 
to drive, for example, molecular motor 
proteins; however, this occurs without 

the intermediate conversion of chemical 
energy to electrical energy that a man- 
made appliance such as this fan requires. 
(D) Some cells can also harvest the energy 
from sunlight to form chemical bonds via 
photosynthesis. 
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Figure 3-7 With few exceptions, the 
radiant energy of sunlight sustains 

all life. Trapped by plants and some 
microorganisms through photosynthesis, 
light from the sun is the ultimate source of 
all energy for humans and other animals. 
(Wheat Field Behind Saint-Paul Hospital 
with a Reaper by Vincent van Gogh. 
Courtesy of Museum Folkwang, Essen.) 


QUESTION 3-1 


Consider the equation 
light energy + CO2 + H20 —> 
sugars + O2 + heat energy 
Would you expect this reaction to 
occur in a single step? Why must 
heat be generated in the reaction? 
Explain your answers. 


Figure 3-8 Photosynthesis takes N 
in two stages. The activated carriers 
generated in the first stage, ATP and 
NADPH, are described in detail later in the 
chapter. 


atoms that animals need to construct new living matter. Some animals 
obtain their food by eating other animals, others by eating plants. Plants, 
by contrast, obtain their energy directly from sunlight. Thus, the energy 
animals obtain by eating plants—or by eating animals that have eaten 
plants—ultimately comes from the sun (Figure 3-7). 


Solar energy enters the living world through photosynthesis, a process 
that converts the electromagnetic energy in sunlight into chemical-bond 
energy in cells. Photosynthetic organisms—including plants, algae, and 
some bacteria—use the energy they derive from sunlight to synthesize 
small chemical building blocks such as sugars, amino acids, nucleotides, 
and fatty acids. These small molecules in turn are converted into the 
macromolecules—the proteins, nucleic acids, and polysaccharides—that 
form the plant. 


We describe the elegant mechanisms that underlie photosynthesis in 
detail in Chapter 14. Generally speaking, the reactions of photosynthe- 
sis take place in two stages. In the first stage, energy from sunlight is 
captured and transiently stored as chemical-bond energy in specialized 
molecules called activated carriers, which we discuss in more detail later 
in the chapter. All of the oxygen (O2) in the air we breathe is generated by 
the splitting of water molecules during this first stage of photosynthesis. 


In the second stage, the activated carriers are used to help drive a carbon- 

fixation process, in which sugars are manufactured from carbon dioxide 
gas (CO2). In this way, photosynthesis generates an essential source of 
stored chemical-bond energy and other organic materials—for the plant 
itself and for any animals that eat it. The two stages of photosynthesis are 
summarized in Figure 3-8. 


Cells Obtain Energy by the Oxidation of Organic 
Molecules 


To live, grow, and reproduce, all organisms rely on the energy stored in 
the chemical bonds of organic molecules—either the sugars that a plant 
has produced by photosynthesis as food for itself or the mixture of large 
and small molecules that an animal has eaten. In both plants and ani- 
mals, this chemical energy is extracted from food molecules by a process 
of gradual oxidation, or controlled burning. 


Earth’s atmosphere is about 21% oxygen. In the presence of oxygen, the 
most energetically stable form of carbon is CO2 and that of hydrogen is 
H20; the oxidation of carbon-containing molecules is therefore ener- 
getically very favorable. A cell is able to obtain energy from sugars or 
other organic molecules by allowing the carbon and hydrogen atoms in 
these molecules to combine with oxygen—that is, become oxidized—to 
produce CO? and H20, respectively. This complex step-wise process by 
which food molecules are broken down to produce energy is known as 
cell respiration. 


Photosynthesis and cell respiration are complementary processes (Figure 
3-9). Plants, animals, and microorganisms have existed together on this 
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planet for so long that they have become an essential part of each other's 
environments. The oxygen released by photosynthesis is consumed by 
nearly all organisms for the oxidative breakdown of organic molecules. 
And some of the COz molecules that today are incorporated into organic 
molecules by photosynthesis in a green leaf were released yesterday into 
the atmosphere by the respiration of an animal, a fungus, or the plant 
itself{—or by the burning of fossil fuels. Carbon atoms therefore pass 
through a huge cycle that involves the entire biosphere—the collection 
of living things on Earth—as they move between individual organisms 
(Figure 3-10). 


Oxidation and Reduction Involve Electron Transfers 


The cell does not oxidize organic molecules in one step, as occurs when 
organic material is burned in a fire. Through the use of enzyme catalysts, 
metabolism directs the molecules through a series of chemical reactions, 
few of which actually involve the direct addition of oxygen. Before we 
consider these reactions, we need to explain what is meant by oxidation. 


Although the term oxidation literally means the addition of oxygen 
atoms to a molecule, oxidation is said to occur in any reaction in which 
electrons are transferred between atoms. Oxidation, in this sense, 
involves the removal of electrons from an atom. Thus, Fe2* is oxidized 
when it loses an electron to become Fe*+. The converse reaction, called 
reduction, involves the addition of electrons to an atom. Fe** is reduced 
when it gains an electron to become Fe?*, and a chlorine atom is 
reduced when it gains an electron to become CI 


Because the number of electrons is conserved in a chemical reaction 
(there is no net loss or gain), oxidation and reduction always occur 
simultaneously: that is, if one molecule gains an electron in a reaction 
(reduction), a second molecule must lose the electron (oxidation). 


CO, in atmosphere and water 


animals 


sediments and 
fossil fuels 
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Figure 3-9 Photosynthesis and cell 
respiration are complementary processes 
in the living world. The left side of the 
diagram shows how photosynthesis— 
carried out by plants and photosynthetic 
microorganisms—uses the energy of 
sunlight to produce sugars and other 
organic molecules from the carbon 
atoms in COz in the atmosphere. In turn, 
these molecules serve as food for other 
organisms. The right side of the diagram 
shows how cell respiration in most 
organisms—including plants and other 
photosynthetic organisms—uses O2 to 
oxidize food molecules, releasing the same 
carbon atoms in the form of COz back to the 
atmosphere. In the process, the organisms 
obtain the useful chemical-bond energy that 
they need to survive. 

The first cells on Earth are thought to have 
been capable of neither photosynthesis 
nor cell respiration (discussed in Chapter 
14). However, photosynthesis must have 
preceded cell respiration on the Earth, 
because there is strong evidence that 
billions of years of photosynthesis were 
required to release enough O>2 to create an 
atmosphere that could support respiration. 


Figure 3-10 Carbon atoms cycle 
continuously through the biosphere. 
Individual carbon atoms are incorporated 
into organic molecules of the living world by 
the photosynthetic activity of plants, algae, 
and bacteria. They then pass to animals 
and microorganisms—as well as into 
organic material in soil and oceans—and 
are ultimately restored to the atmosphere 
in the form of CO2 when organic molecules 
are oxidized by cells during respiration 

or burned by humans as fossil fuels. In 

this diagram, the green arrow denotes an 
uptake of CO2, whereas the red arrows 
indicate CO? release. 
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Figure 3-11 Oxidation and reduction involve a shift in the balance of electrons. 
(A) When two atoms form a polar covalent bond, the atom that ends up with a greater 
share of electrons (represented by the blue clouds) is said to be reduced, while the 
other atom, with a lesser share of electrons, is said to be oxidized. Electrons are 
attracted to the atom that has greater electronegativity (as discussed in Chapter 2, 

p. 45). As a result, the reduced atom acquires a partial negative charge (87); conversely, 
the oxidized atom acquires a partial positive charge (6*), as the positive charge on 

the atomic nucleus now exceeds the total charge of the electrons surrounding it. (B) 

A simple reduced carbon compound, such as methane, can be oxidized in a stepwise 
fashion by the successive replacement of its covalently bonded hydrogen atoms with 

oxygen atoms. With each step, electrons are shifted away from the carbon, and the 

carbon atom becomes progressively more oxidized. Moving in the opposite direction, Cep 
carbon dioxide becomes progressively more reduced as its oxygen atoms are 

replaced by hydrogens to yield methane. carbon dioxide 


t formaldehyde 


| formic acid 


Why is a “gain” of electrons referred to as a “reduction”? The term arose 
before anything was known about the movement of electrons. Originally, 
reduction reactions involved a liberation of oxygen—for example, when 
metals are extracted from ores by heating—which caused the samples to 
become lighter; in other words, “reduced” in mass. 


It is important to recognize that the terms oxidation and reduction apply 
even when there is only a partial shift of electrons between atoms. When 
a carbon atom becomes covalently bonded to an atom with a strong 
affinity for electrons—oxygen, chlorine, or sulfur, for example—it gives up 
more than its equal share of electrons to form a polar covalent bond. The 
positive charge of the carbon nucleus now slightly exceeds the negative 
charge of its electrons, so that the carbon atom acquires a partial positive 
charge (65*) and is said to be oxidized. Conversely, the carbon atom in a 
C-H bond has somewhat more than its share of electrons; it acquires a 
partial negative charge (ò) and so is said to be reduced (Figure 3-11A). 


In such oxidation-reduction reactions, electrons generally do not travel 
alone. When a molecule in a cell picks up an electron (e), it often picks up 
a proton (H*) at the same time (protons being freely available in water). 
The net effect in this case is to add a hydrogen atom to the molecule: 


A +€ +H*— AH 


Even though a proton is involved (in addition to the electron), such 
hydrogenation reactions are reductions, and the reverse dehydrogenation 
reactions are oxidations. An easy way to tell whether an organic mol- 
ecule is being oxidized or reduced is to count its C-H bonds: an increase 
in the number of C-H bonds indicates a reduction, whereas a decrease 
indicates an oxidation (Figure 3-11B). 


As we will see later in this chapter—and again in Chapter 13—cells use 
enzymes to catalyze the oxidation of organic molecules in small steps, 
through a sequence of reactions that allows much of the energy that is 
released to be harvested in useful forms, instead of being liberated as heat. 


FREE ENERGY AND CATALYSIS 


Life depends on the highly specific chemical reactions that take place 
inside cells. The vast majority of these reactions are catalyzed by pro- 
teins called enzymes. Enzymes, like cells, must obey the second law of 


thermodynamics. Although an individual enzyme can greatly accelerate 
an energetically favorable reaction—one that produces disorder in the 
universe—it cannot force an energetically unfavorable reaction to occur. 
Cells, however, must do just that in order to grow and divide—or just to 
survive. They must build highly ordered and energy-rich molecules from 
small and simple ones—a process that requires an input of energy. 


To understand how enzymes promote the acceleration of the specific 
chemical reactions needed to sustain life, we first need to examine the 
energetics involved. In this section, we consider how the free energy of 
molecules contributes to their chemistry, and we see how free-energy 
changes—which reflect how much total disorder is generated in the uni- 
verse by a reaction—influence whether and how a reaction will proceed. 
Examining these energetic concepts will reveal how enzymes work- 
ing together can exploit the free-energy changes of different reactions 
to drive the energetically unfavorable reactions that produce biological 
order. This type of enzyme-assisted catalysis is crucial for cells: without 
it, life could not exist. 


Chemical Reactions Proceed in the Direction That 
Causes a Loss of Free Energy 


Paper burns readily, releasing into the atmosphere water and carbon 
dioxide as gases, while simultaneously releasing energy as heat: 


paper + O2 > smoke + ashes + heat + CO2 + H2O 


This reaction occurs in only one direction: smoke and ashes never spon- 
taneously gather carbon dioxide and water from the heated atmosphere 
and reconstitute themselves into paper. When paper burns, most of 
its chemical energy is dissipated as heat. This heat is not lost from the 
universe, since energy can never be created or destroyed; instead, it is 
irretrievably dispersed in the chaotic random thermal motions of mol- 
ecules. In the language of thermodynamics, there has been a release of 
free energy—that is, energy that can be harnessed to do work or drive 
chemical reactions. This release reflects a loss of orderliness in the way 
the energy and molecules had been stored in the paper; the greater the 
free-energy change, the greater the amount of disorder created in the 
universe when the reaction occurs. 


We will discuss free energy in more detail shortly, but a general principle 
can be summarized as follows: chemical reactions proceed only in the 
direction that leads to a loss of free energy. In other words, the sponta- 
neous direction for any reaction is the direction that goes “downhill.” A 
“downhill” reaction in this sense is said to be energetically favorable. 


Enzymes Reduce the Energy Needed to Initiate 
Spontaneous Reactions 


Although the most energetically favorable form of carbon under ordi- 
nary conditions is CO2, and that of hydrogen is H20, a living organism 
will not disappear in a puff of smoke, and the book in your hands will 
not burst spontaneously into flames. This is because the molecules in 
both the living organism and the book are in a relatively stable state, and 
they cannot be changed to lower-energy states without an initial input of 
energy. In other words, a molecule requires a boost over an energy barrier 
before it can undergo a chemical reaction that moves it to a lower-energy 
(more stable) state. This boost is known as the activation energy (Figure 
3-12A). In the case of a burning book, the activation energy is provided 
by the heat of a lighted match. But cells can’t raise their temperature to 
drive biological reactions. Inside cells, the push over the energy barrier is 
aided by enzymes. 


QUESTION 3-2 


Free Energy and Catalysis 


In which of the following reactions 
does the red atom undergo an 


oxidation? 
A. Na— Nat (Na atom — Nat ion) 
B. Cl> Cr (Cl atom — CI ion) 
C. CH3CH2OH — CH3CHO 
(ethanol — acetaldehyde) 

D. CH3CHO — CH3COO- 

(acetaldehyde — acetic acid) 
E. CH2=CH2 — CH3CH3 


(ethene — ethane) 
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Figure 3-12 Even energetically favorable 
reactions require activation energy to get 
them started. (A) Compound Y (a reactant) 
is in a relatively stable state; thus energy 

is required to convert it to compound X 

(a product), even though X is at a lower 
overall energy level than Y. This conversion 
will not take place, therefore, unless 
compound Y can acquire enough activation 
energy (energy a minus energy b) from its 
surroundings to undergo the reaction that 
converts it into compound X. This energy 
may be provided by means of an unusually 
energetic collision with other molecules. For 
the reverse reaction, X — Y, the activation 
energy required will be much larger (energy 
a minus energy œo); this reaction will therefore 
occur much more rarely. The total energy 
change for the energetically favorable 
reaction Y — X is energy c minus energy 

b, a negative number, which corresponds 

to a loss of free energy. (B) Energy barriers 
for specific reactions can be lowered by 
catalysts, as indicated by the line marked d. 
Enzymes are particularly effective catalysts 
because they greatly reduce the activation 
energy for the reactions they catalyze. Note 
that activation energies are always positive. 


Figure 3-13 Lowering the activation 
energy greatly increases the probability 
that a reaction will occur. At any given 
instant, a population of identical substrate 
molecules will have a range of energies, 
distributed as shown on the graph. The 
varying energies come from collisions with 
surrounding molecules, which make the 
substrate molecules jiggle, vibrate, and 
spin. For a molecule to undergo a chemical 
reaction, the energy of the molecule must 
exceed the activation-energy barrier for that 
reaction (dashed lines); for most biological 
reactions, this almost never happens 
without enzyme catalysis. Even with enzyme 
catalysis, only a small fraction of substrate 
molecules (red shaded area) will experience 
the highly energetic collisions needed to 
reach an energy state high enough for them 
to undergo a reaction. 
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activation enzyme lowers 
energy for activation 
reaction energy for 


YX 


catalyzed 


reaction 
Y>X 


total energy ——» 
total energy ———» 


(A) uncatalyzed (B) 
reaction pathway 


enzyme-catalyzed 
reaction pathway 


Each enzyme binds tightly to one or two molecules, called substrates, 
and holds them in a way that greatly reduces the activation energy needed 
to facilitate a specific chemical interaction between them (Figure 3-12B). 
A substance that can lower the activation energy of a reaction is termed 
a catalyst; catalysts increase the rate of chemical reactions because they 
allow a much larger proportion of the random collisions with surround- 
ing molecules to kick the substrates over the energy barrier, as illustrated 
in Figure 3-13 and Figure 3-14A. Enzymes are among the most effective 
catalysts known. They can speed up reactions by a factor of as much as 
10!4—that is, trillions of times faster than the same reactions would pro- 
ceed without an enzyme catalyst. Enzymes therefore allow reactions that 
would not otherwise occur to proceed rapidly at the normal temperature 
inside cells. 


Unlike the effects of temperature, enzymes are highly selective. Each 
enzyme usually speeds up—or catalyzes—only one particular reaction 
out of the several possible reactions that its substrate molecules could 
undergo. In this way, enzymes direct each of the many different mol- 
ecules in a cell along specific reaction pathways (Figure 3-14B and C), 
thereby producing the compounds that the cell actually needs. 


Like all catalysts, enzyme molecules themselves remain unchanged after 
participating in a reaction and can therefore act over and over again 
(Figure 3-15). In Chapter 4, we will discuss further how enzymes work, 
after we have looked in detail at the molecular structure of proteins. 


The Free-Energy Change for a Reaction Determines 
Whether It Can Occur 


According to the second law of thermodynamics, a chemical reaction 
can proceed only if it results in a net (overall) increase in the disorder of 


energy needed 
to undergo an 
uncatalyzed 
chemical reaction 


energy required 
to undergo 
the enzyme-catalyzed 
chemical reaction 


number of molecules —> 


Ld 
molecules with 
average energy 


energy per molecule —> 


uncatalyzed reaction—waves not large 
enough to surmount barrier 


(A) 


enzyme catalysis 


uncatalyzed 
(B) of reaction 1 © 


the universe (see Figure 3-5). Disorder increases when useful energy that 
could be harnessed to do work is dissipated as heat. The useful energy in 
a system is known as its free energy, or G. And because chemical reac- 
tions involve a transition from one molecular state to another, the term 
that is of most interest to chemists and cell biologists is the free-energy 
change, denoted AG (“Delta G”). 


Let’s consider a collection of molecules. AG measures the amount of dis- 
order created in the universe when a reaction involving these molecules 
takes place. Energetically favorable reactions, by definition, are those that 
create disorder in the universe by decreasing the free energy of the sys- 
tem to which they belong; in other words, they have a negative AG (Figure 
3-16). 


A reaction can occur spontaneously only if AG is negative. On a mac- 
roscopic scale, an energetically favorable reaction with a negative AG 
is the relaxation of a compressed spring into an expanded state, which 
releases its stored elastic energy as heat to its surroundings. On a micro- 
scopic scale, an energetically favorable reaction—one with a negative 
AG—occurs when salt (NaCl) dissolves in water. Note that just because 
a reaction can occur spontaneously does not mean it will occur quickly. 
The decay of diamonds into graphite is a spontaneous process—but it 
takes millions of years. 
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complex complex 
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Figure 3-14 Enzymes catalyze reactions 
by lowering the activation-energy barrier. 
(A) The dam represents the activation 
energy, which is lowered by enzyme 
catalysis. Each green ball represents 

a potential substrate molecule that is 
bouncing up and down in energy level 
owing to constant encounters with waves, 
an analogy for the thermal bombardment of 
substrate molecules by surrounding water 
molecules. When the barrier—the activation 
energy—is lowered significantly, the balls 
(substrate molecules) with sufficient energy 
can roll downhill, an energetically favorable 
movement. (B) The four walls of the box 
represent the activation-energy barriers 

for four different chemical reactions that 

are all energetically favorable because the 
products are at lower energy levels than 

the substrates. In the left-hand box, none 
of these reactions occurs because even 

the largest waves are not large enough to 
surmount any of the energy barriers. In the 
right-hand box, enzyme catalysis lowers 

the activation energy for reaction number 

1 only; now the jostling of the waves allows 
the substrate molecule to pass over this 
energy barrier, allowing reaction 1 to 
proceed (Movie 3.1). (C) A branching set 
of reactions with a selected set of enzymes 
(yellow boxes) serves to illustrate how a 
series of enzyme-catalyzed reactions—by 
controlling which reaction will take place 

at each junction—determines the exact 
reaction pathway followed by each molecule 
inside the cell. 


Figure 3-15 Enzymes convert substrates 
to products while remaining unchanged 
themselves. Catalysis takes place in a cycle 
in which a substrate molecule (rea) binds 
to an enzyme and undergoes a reaction to 
form a product molecule (yellow), which 
then gets released. Although the enzyme 
participates in the reaction, it remains 
unchanged. 
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The free energy of Y 
is greater than the free 
energy of X. Therefore 


ENERGETICALLY ) AG is negative (< 0), and 
FAVORABLE the disorder of the 


REACTION universe increases when 
Y is converted to X. 


this reaction can occur spontaneously 


®© If the reaction XY 
occurred, AG would 

be positive (> 0), and 
ENERGETICALLY the universe would 


UNFAVORABLE ® bema more 
REACTION ordered. 


this reaction can occur only if 
it is driven by being coupled to a second, 
energetically favorable reaction 


Figure 3-16 Energetically favorable 
reactions have a negative AG, whereas 
energetically unfavorable reactions have 
a positive AG. Imagine, for example, that 
molecule Y has a free energy (G) of 

10 kilojoules (kJ) per mole, whereas X has 
a free energy of 4 kJ/mole. The reaction 

Y — X therefore has a AG of —6 kJ/mole, 
making it energetically favorable. 


negative 
AG 


positive 
AG 


Figure 3-17 Reaction coupling can drive 
an energetically unfavorable reaction. The 
energetically unfavorable (AG > 0) reaction 
X — Y cannot occur unless it is coupled to 
an energetically favorable (AG < 0) reaction 
C — D, such that the net free-energy 
change for the pair of reactions is negative 
(less than 0). 


Energetically unfavorable reactions, by contrast, create order in the 
universe; they have a positive AG. Such reactions—for example, the 
formation of a peptide bond between two amino acids—cannot occur 
spontaneously; they take place only when they are coupled to a second 
reaction with a negative AG large enough that the net AG of the entire 
process is negative (Figure 3-17). Life is possible because enzymes can 
create biological order by coupling energetically unfavorable reactions 
with energetically favorable ones. These critical concepts are summa- 
rized, with examples, in Panel 3-1 (pp. 94-95). 


AG Changes as a Reaction Proceeds Toward Equilibrium 


It’s easy to see how a tensed spring, when left to itself, will relax and 
release its stored energy to the environment as heat. But chemical 
reactions are a bit more complex—and harder to intuit. That’s because 
whether a reaction will proceed in a particular direction depends not only 
on the energy stored in each individual molecule, but also on the con- 
centrations of the molecules in the reaction mixture. Going back to our 
jiggling box of coins, more coins will flip from a head to a tail orientation 
when the box contains 90 heads and 10 tails than when the box contains 
10 heads and 90 tails. 


The same is true for a chemical reaction. As the energetically favorable 
reaction Y > X proceeds, the concentration of the product X will increase 
and the concentration of the substrate Y will decrease. This change in 
relative concentrations of substrate and product will cause the ratio of Y 
to X to shrink, making the initially favorable AG less and less negative. 
Unless more Y is added, the reaction will slow and eventually stop. 


Because AG changes as products accumulate and substrates are depleted, 
chemical reactions will generally proceed until they reach a state of 
equilibrium. At that point, the rates of the forward and reverse reactions 
are equal, and there is no further net change in the concentrations of 
substrate or product (Figure 3-18). For reactions at chemical equilibrium, 
AG = 0, so the reaction will not proceed forward or backward, and no 
work can be done. 


Such a state of chemical inactivity would be incompatible with life, inevi- 
tably allowing chemical decay to overcome the cell. Living cells work 
hard to avoid reaching a state of complete chemical equilibrium. They 
are constantly exchanging materials with their environment: replenish- 
ing nutrients and eliminating waste products. In addition, many of the 
individual reactions in the cell’s complex metabolic network also exist 
in disequilibrium because the products of one reaction are continually 
being siphoned off to become the substrates in a subsequent reaction. 
Rarely do products and substrates reach concentrations at which the for- 
ward and reverse reaction rates are equal. 


The Standard Free-Energy Change, AC’, Makes It 
Possible to Compare the Energetics of Different Reactions 


Because AG depends on the concentrations of the molecules in the reac- 
tion mixture at any given time, it is not a particularly useful value for 
comparing the relative energies of different types of chemical reactions. 
But such energetic assessments are necessary, for example, to predict 
whether an energetically favorable reaction is likely to have a AG negative 
enough to drive an energetically unfavorable reaction. To compare reac- 
tions in this way, we need to turn to the standard free-energy change 
of a reaction, AG°. A reaction’s AG? is independent of concentration; it 
depends only on the intrinsic characters of the reacting molecules, based 


FOR THE ENERGETICALLY FAVORABLE REACTION Y — X, 
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Figure 3-18 Reactions will eventually 
reach a chemical equilibrium. At that 
point, the forward and the backward 

fluxes of reacting molecules are equal and 
opposite. The widths of the arrows indicate 
the relative rates at which an individual 
molecule converts. 
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when X and Y are at equal concentrations, [Y] = [X], the formation of X 

is energetically favored. In other words, the AG of Y — X is negative and 

the AG of X — Y is positive. Nevertheless because of thermal bombardments, 
there will always be some X converting to Y. 


THUS, FOR EACH INDIVIDUAL MOLECULE, 
U — Hy 
© x 
r— > 
x © 


Therefore, if one starts with an 
equal mixture, the ratio of X to Y 
molecules will increase 


conversion of 
Y to X will 
occur often. 


Conversion of X to Y 
will occur less often 
than the transition 
Y — X, because it 
requires a more 
energetic collision. 


EVENTUALLY, there will be a large enough excess of X over Y to just 
compensate for the slow rate of X — Y, such that the number of X molecules 
being converted to Y molecules each second is exactly equal to the number 
of Y molecules being converted to X molecules each second. At this point, 
the reaction will be at equilibrium. 


AT EQUILIBRIUM, there is no net change in the ratio of Y to X, and the 
AG for both forward and backward reactions is zero. 


on their behavior under ideal conditions where the concentrations of all 
the reactants are set to the same fixed value of 1 mole/liter in aqueous 
solution. 


A large body of thermodynamic data has been collected from which AG° 
can be calculated for most metabolic reactions. Some common reactions 
are compared in terms of their AG° in Panel 3-1 (pp. 94-95). 


The AG of a reaction can be calculated from AG? if the concentrations of 
the reactants and products are known. For the simple reaction Y > X, 


their relationship follows this equation: 
[X] 


AG = AG? + RT In — 
[Y] 


where AG is in kilojoules per mole, [Y] and [X] denote the concentrations 
of Y and X in moles/liter (a mole is 6 x 1023 molecules of a substance), In 
is the natural logarithm, and RT is the product of the gas constant, R, and 
the absolute temperature, T. At 37°C, RT = 2.58. 


From this equation, we can see that when the concentrations of reac- 
tants and products are equal—in other words, [X]/[Y] = 1—the value of AG 
equals the value of AG®° (because In 1 = 0). Thus when the reactants and 
products are present in equal concentrations, the direction of the reaction 
depends entirely on the intrinsic properties of the molecules. 


QUESTION 3-3 


Consider the analogy of the jiggling 
box containing coins that was 
described on page 83. The reaction, 
the flipping of coins that either 

face heads up (H) or tails up (T), is 
described by the equation 

H <> T, where the rate of the 
forward reaction equals the rate of 
the reverse reaction. 

A. What are AG and AG? in this 
analogy? 

B. What corresponds to the 
temperature at which the reaction 
proceeds? What corresponds to the 
activation energy of the reaction? 
Assume you have an “enzyme,” 
called jigglase, which catalyzes this 
reaction. What would the effect of 
jigglase be and what, mechanically, 
might jigglase do in this analogy? 
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This panel reviews the concept of free energy and offers 
examples showing how changes in free energy determine 
whether—and how—biological reactions occur. 


The molecules of a living cell possess energy because of their 
vibrations, rotations, and movement through space, and 
because of the energy that is stored in the bonds between 


individual atoms. 
= 


ed 
=. E 
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The free energy, G (in kJ/mole), measures the energy of a 
molecule that could in principle be used to do useful work at 
constant temperature, as in a living cell. Energy can also be 
expressed in calories (1 joule = 0.24 calories). 


= 
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REACTIONS CAUSE DISORDER 


Think of a chemical reaction occurring in a cell that has a 
constant temperature and volume. This reaction can produce 
disorder in two ways. 


K Changes of bond energy of the reacting molecules can 


cause heat to be released, which disorders the environment 


around the cell. 


heat 
— 
cell 


i The reaction can decrease the amount of order in the 
cell—for example, by breaking apart a long 
chain of molecules, or by disrupting an interaction that 
prevents bond rotations. 


C- ED 


PREDICTING REACTIONS 


To predict the outcome of a reaction (Will it proceed to the 
right or to the left? At what point will it stop?), we must 
determine its standard free-energy change (AG°). 

This quantity represents the gain or loss of free energy as one 
mole of reactant is converted to one mole of product under 
“standard conditions” (all molecules present in aqueous 
solution at a concentration of 1 M and pH 7.0). 


AG° for some reactions 


v 
= glucose 1-P —» glucose 6-P -7.3 kJ/mole 
E 

2 sucrose —> glucose + fructose -23 kJ/mole 
= 

= ATP —> ADP + ® -30.5 kJ/mole 


glucose + 60, —> 6CO,+6H,O -2867 kJ/mole 


AG ("DELTA G”) 


Changes in free energy occurring in a reaction are 
denoted by AG, where “A” indicates a difference. Thus, 
for the reaction 

A+B— C+D 


AG = free energy (C + D) minus free energy (A + B) 


AG measures the amount of disorder caused by a 
reaction: the change in order inside the cell, plus the 
change in order of the surroundings caused by the heat 
released. 


AG is useful because it measures how far away from 
equilibrium a reaction is. The reaction 


Di — Aor +O 


has a large negative AG because cells keep the reaction 
a long way from equilibrium by continually making fresh 
ATP. However, if the cell dies, then most of its ATP will be 
hydrolyzed until equilibrium is reached; at equilibrium, 
the forward and backward reactions occur at equal rates 
and AG = 0. 


SPONTANEOUS REACTIONS 


From the second law of thermodynamics, we know 
that the disorder of the universe can only increase. AG 
is negative if the disorder of the universe (reaction plus 
surroundings) increases. 


In other words, a chemical reaction that occurs 
spontaneously must have a negative AG: 


Gproducts -= Greactants = AG < 0 


The difference in free energy of 100 mL of 
10 mM sucrose (common sugar) and 100 mL of 10 mM 
glucose plus 10 mM fructose is about -23 joules. 
Therefore, the hydrolysis reaction that produces two 
monosaccharides from a disaccharide (sucrose > 
glucose + fructose) can proceed spontaneously. 


N a á 


es 
eo 


a 
o- -23 joules 
e 


sucrose glucose + 
fructose 


In contrast, the reverse reaction (glucose + fructose > 
sucrose), which has a AG of +23 joules, could not occur 
without an input of energy from a coupled reaction. 


REACTION RATES 


A spontaneous reaction is not necessarily a rapid reaction: 

a reaction with a negative free-energy change (AG) will not 
necessarily occur rapidly by itself. Consider, for example, the 
combustion of glucose in oxygen: 


l 
OH H © + 6O, —* 6CO, + 6H,0 


AG° = -2867 kJ/mole 


Even this highly favorable reaction may not occur for centuries 
unless enzymes are present to speed up the process. Enzymes 
are able to catalyze reactions and speed up their rate, but they 
cannot change the AG° of a reaction. 


CHEMICAL EQUILIBRIA 


A fixed relationship exists between the standard 
free-energy change of a reaction, AG°, and its equilibrium 
constant K. For example, the reversible reaction 


Yae=x 


will proceed until the ratio of concentrations [X]/[Y] is 
equal to K (note: square brackets [ ] indicate 
concentration). At this point, the free energy of the 
system will have its lowest value. 


t 


equilibrium 
point 


free 
energy lowest 
of system S <«— free 
$ energy 
Ix] > 
[Y] 
At 37°C, AG° = -5.94 logioK (see text, p. 96) 
K = 10746594 
For example, the reaction 
CH,OH CH,O-P) 
S o-® on 
— 
glucose 1-P glucose 6-P 


has AG° = -7.3 kJ/mole. Therefore, its equilibrium 
constant 
K = 107352 = 191-23) -17 


So the reaction will reach steady state when 
[glucose 6-P]/[glucose 1-P] = 17 
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COUPLED REACTIONS 


Reactions can be “coupled” together if they share one or 
more intermediates. In this case, the overall free-energy 
change is simply the sum of the individual AG° values. A 
reaction that is unfavorable (has a positive AG°) can for this 
reason be driven by a second, highly favorable reaction. 


SINGLE REACTION 


© + — @eoa% 5 
+23 kJ/mole 
glucose fructose sucrose 


NET RESULT: reaction will not occur 


ATP — ADP + @® AG°=-30.5 kJ/mole 


NET RESULT: reaction is highly favorable 


COUPLED REACTIONS 


@ + ar — @® +a 


AG°= 
glucose glucose 1-P 

23 -30.5= 
ee + > @ + ® -7.5 kJ/mole 
glucose 1-P fructose sucrose 


sucrose is made in a reaction driven 
by the hydrolysis of ATP 


HIGH-ENERGY BONDS 


One of the most common reactions in the cell is 
hydrolysis, in which a covalent bond is split by adding 
water. 


* hydrolysis 
B) ——> OH +H 


The AG° for this reaction is sometimes loosely termed 
the “bond energy.” Compounds such as acetyl 
phosphate and ATP, which have a large negative AG° 
of hydrolysis in an aqueous solution, are said to have 
“high-energy” bonds. 


AG° 

(kJ/mole) 
acetyl{P) —» acetate + @) -43.1 
ATP — ADP + (P) -30.5 
glucose 6-P —» glucose + (P) -13.8 


(Note that, for simplicity, HyO is omitted from the above 
equations.) 
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TABLE 3-1 RELATIONSHIP 
BETWEEN THE STANDARD FREE- 
ENERGY CHANGE, AG°, AND THE 
EQUILIBRIUM CONSTANT 


Equilibrium 


Standard Free-Energy 
Change (AG°) for 
Reaction 

Y — X (kJ/mole) 


Constant 


[X]/[Y] 


103 -17.8 
102 -11.9 

10 -5.9 

1 0 

O 5.9 
ie 11.9 
10 17.8 
Ona 23.8 
10-5 29.7 


Values of the equilibrium constant were 
calculated for the simple chemical 
reaction Y — X, using the equation given 
in the text. 


The AG” values given here are in 
kilojoules per mole at 37°C. As explained 
in the text, AG? represents the free- 
energy difference under standard 
conditions (where all components 

are present at a concentration of 

1 mole/liter). 


From this table, we see that if there 

is a favorable free-energy change 

of -17.8 kJ/mole for the transition Y > X, 
there will be 1000 times more molecules 
of X than of Y at equilibrium (K = 1000). 
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The Equilibrium Constant Is Directly Proportional to AG? 


As mentioned earlier, all chemical reactions tend to proceed toward 
equilibrium. Knowing where that equilibrium lies for any given reaction 
will reveal which way the reaction will proceed—and how far it will go. 
For example, if a reaction is at equilibrium when the concentration of the 
product is ten times the concentration of the substrate, and we begin with 
a surplus of substrate and little or no product, the reaction will continue 
to proceed forward. The ratio of substrate to product at this equilibrium 
point is called the reaction’s equilibrium constant, K. For the simple 
reaction Y > X, 
[X] 
K=— 
[Y] 
where [X] is the concentration of the product and [Y] is the concentration 
of the substrate at equilibrium. In the example we just described, K = 10. 


The equilibrium constant depends on the intrinsic properties of the mol- 
ecules involved, as expressed by AQ’. In fact, the equilibrium constant is 
directly proportional to AG°. Let’s see why. 


At equilibrium, the rate of the forward reaction is exactly balanced by 
the rate of the reverse reaction. At that point, AG = 0, and there is no net 
change of free energy to drive the reaction in either direction (see Panel 
3-1, pp. 94-95). 


Now, if we return to the equation presented on page 93, 
[X] 
AG = AG° + RT In — 
[Y] 


we can see that, at equilibrium at 37°C, where AG = 0 and the constant 
RT = 2.58, this equation becomes: 


[X] 
AG?° = -2.58 In — 
[Y] 
In other words, AG° is directly proportional to the equilibrium constant, K: 
AG° = -2.58 In K 


If we convert this equation from natural log (In) to the more commonly 
used base-10 logarithm (log), we get 


AG° = -5.94 log K 


This equation reveals how the equilibrium ratio of Y to X, expressed as 
the equilibrium constant K, depends on the intrinsic character of the 
molecules, as expressed in the value of AG°. Thus, for the reaction we 
presented, Y — X, where K = 10, AG? = -5.94 kJ/mole. In fact, for every 
5.94 kJ/mole difference in free energy at 37°C, the equilibrium constant 
for a reaction changes by a factor of 10, as shown in Table 3-1. Thus, the 
more energetically favorable the reaction, the more product will accu- 
mulate when the reaction proceeds to equilibrium. For a reaction with a 
AG” of -17.8 kJ/mole, K will equal 1000, which means that at equilibrium, 
there will be 1000 molecules of product for every molecule of substrate 
present. 


In Complex Reactions, the Equilibrium Constant Includes 
the Concentrations of All Reactants and Products 


We have so far discussed the simplest of reactions, Y > X, in which a sin- 
gle substrate is converted into a single product. But inside cells, it is more 
common for two reactants to combine to form a single product: A + B > 
AB. How can we predict how this reaction will proceed? 


The same principles apply, except that in this case the equilibrium con- 
stant K includes the concentrations of both of the reactants, in addition 


to the concentration of the product: 
K = [AB]/[A][B] 


The concentrations of both reactants are multiplied in the denominator 
because the formation of product AB depends on the collision of A and 
B, and these encounters occur at a rate that is proportional to [A] x [B] 
(Figure 3-19). As with single-substrate reactions, AG° = -5.94 log K at 
37°C. Thus, the relationship between K and AG? is the same as that 
shown in Table 3-1. 


The Equilibrium Constant Also Indicates the Strength of 
Noncovalent Binding Interactions 


The concept of free-energy change does not apply only to chemical reac- 
tions where covalent bonds are being broken and formed. It is also used 
to quantitate the strength of interactions in which one molecule binds to 
another by means of noncovalent interactions (discussed in Chapter 2, 
p. 48). Two molecules will bind to each other if the free-energy change for 
the interaction is negative; that is, the free energy of the resulting com- 
plex is lower than the sum of the free energies of the two partners when 
unbound. Noncovalent interactions are immensely important to cells. 
They include the binding of substrates to enzymes, the binding of tran- 
scription regulators to DNA, and the binding of one protein to another to 
make the many different structural and functional protein complexes that 
operate in a living cell. 


The equilibrium constant, K, used to describe reactions in which cova- 
lent bonds are formed and broken, also reflects the binding strength of a 
noncovalent interaction between two molecules. This binding strength 
is a very useful quantity because it indicates how specific the interaction 
is between the two molecules. When molecule A binds to molecule B 
to form the complex AB, the reaction proceeds until it reaches equilib- 
rium. At which point the number of association events precisely equals 
the number of dissociation events; at this point, the concentrations of 
reactants A and B, and of the complex AB, can be used to determine the 
equilibrium constant K (see Figure 3-19). 


K becomes larger as the binding energy—that is, the energy released in 
the binding interaction—increases. In other words, the larger K is, the 
greater is the drop in free energy between the dissociated and associ- 
ated states, and the more tightly the two molecules will bind. Even a 
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Figure 3-19 The equilibrium constant, 
K, for the reaction A + B — AB depends 
on the concentrations of A, B, and AB. 
Molecules A and B must collide in order 
to interact, and the association rate is 
therefore proportional to the product of 
their individual concentrations [A] x [B]. As 
shown, the ratio of the rate constants kon 
and kot for the association (bond formation) 
and the dissociation (bond breakage) 
reactions, respectively, is equal to the 
equilibrium constant, K. 
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CHAPTER 3 


Consider 1000 molecules of A and 1000 
molecules of B in the cytosol of a eukaryotic 
cell. The concentration of both will be 
about 10° M. 

If the equilibrium constant (K) for 
A + B —> AB is 10" liters/mole, then at 
equilibrium there will be 


270 270 730. 
AB 
molecules molecules complexes 


If the equilibrium constant is a little weaker, 
say 10° liters/mole—a value that represents a 
loss of 11.9 kJ/mole of binding energy from 
the example above, or 2-3 fewer hydrogen 
bonds—then there will be 
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Figure 3-20 Small changes in the 
number of weak bonds can have drastic 
effects on a binding interaction. This 
example illustrates the dramatic effect of 
the presence or absence of a few weak 
noncovalent bonds in the interaction 
between two cytosolic proteins. 


Figure 3-21 An energetically unfavorable 
reaction can be driven by an energetically 
favorable follow-on reaction that acts 

as a chemical siphon. (A) At equilibrium, 
there are twice as many X molecules as Y 
molecules. (B) At equilibrium, there are 25 
times more Z molecules than Y molecules. 
(C) If the reactions in (A) and (B) are 
coupled, nearly all of the X molecules will 
be converted to Z molecules, as shown. In 
terms of energetics, the AG? of the Y > Z 
reaction is so negative that, when coupled 
to the X > Y reaction, it lowers the AG of 
X — Y. This is because the AG of X > Y 
decreases as the ratio of Y to X declines 
(see Figure 3-18). 
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change of a few noncovalent bonds can have a striking effect on a bind- 
ing interaction, as illustrated in Figure 3-20. In this example, a loss of 
11.9 kJ/mole of binding energy, equivalent to eliminating a few hydro- 
gen bonds from a binding interaction, can be seen to cause a dramatic 
decrease in the amount of complex that exists at equilibrium. 


For Sequential Reactions, the Changes in Free Energy 
Are Additive 


Now we return to our original concern regarding how cells can generate 
and maintain order. And more specifically: how can enzymes catalyze 
reactions that are energetically unfavorable? 


One way they do so is by directly coupling energetically unfavorable 
reactions with energetically favorable ones. Consider, for example, two 
sequential reactions, 


X—YandY>Z 


where the AG° values are +21 and -54 kJ/mole, respectively. (Recall that 
a mole is 6 x 102° molecules of a substance.) The unfavorable reaction, 
X — Y, will not occur spontaneously. However, it can be driven by the 
favorable reaction Y — Z, provided that the second reaction follows the 
first. That’s because the overall free-energy change for the coupled reac- 
tion is equal to the sum of the free-energy changes for each individual 
reaction. In this case, the AG” for the coupled reaction, X — Y — Z, will be 
-33 kJ/mole, making the overall pathway energetically favorable. 


Cells can therefore cause the energetically unfavorable transition, X —> Y, 
to occur if an enzyme catalyzing the X — Y reaction is supplemented by a 
second enzyme that catalyzes the energetically favorable reaction, Y > Z. 
In effect, the reaction Y > Z acts as a “siphon,” pulling the conversion of 
all of molecule X to molecule Y, and then to molecule Z (Figure 3-21). 
Several of the reactions in the long pathway that converts sugars into 
CO and H20 are energetically unfavorable. This pathway nevertheless 
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Figure 3-22 The cytosol is crowded with various molecules. 
Only the macromolecules, which are drawn to scale and 
displayed in different colors, are shown. Enzymes and other 
macromolecules diffuse relatively slowly in the cytosol, in part 
because they interact with so many other macromolecules. Small 
molecules, by contrast, can diffuse nearly as rapidly as they do 
in water (see Movie 1.2). (From S.R. McGuffee and A.H. Elcock, 
PLoS Comput. Biol. 6(3): 1000694, 2010.) 


proceeds rapidly to completion because the total AG° for the series of 
sequential reactions has a large negative value. 


Forming a sequential pathway, however, is not the answer for many 
other metabolic needs. Often the desired reaction is simply X — Y, with- 
out further conversion of Y to some other product. Fortunately, there are 
other, more general ways of using enzymes to couple reactions together, 
involving the production of activated carriers that can shuttle energy 
from one reaction site to another, as we discuss shortly. 


Enzyme-catalyzed Reactions Depend on Rapid 
Molecular Collisions 


Thus far we have talked about chemical reactions as if they take place 
in isolation. But the cytosol of a cell is densely packed with molecules of 
various shapes and sizes (Figure 3-22). So how do enzymes and their 
substrates, which are present in relatively small amounts in the cytosol 
of a cell, manage to find each other? And how do they do it so quickly? 
Observations indicate that a typical enzyme can capture and process 
about a thousand substrate molecules every second. 


Rapid binding is possible because molecular motions are enormously 
fast—very much faster than the human mind can easily imagine. Because 
of heat energy, molecules are in constant motion and consequently 
will explore the cytosolic space very efficiently by wandering randomly 
through it—a process called diffusion. In this way, every molecule in the 
cytosol collides with a huge number of other molecules each second. 
As these molecules in solution collide and bounce off one another, an 
individual molecule moves first one way and then another, its path con- 
stituting a random walk (Figure 3-23). 


Although the cytosol of a cell is densely packed with molecules of vari- 
ous shapes and sizes, experiments in which fluorescent dyes and other 
labeled molecules are injected into the cell cytosol show that small 
organic molecules diffuse through this aqueous gel nearly as rapidly as 
they do through water. A small organic molecule, such as a substrate, 
takes only about one-fifth of a second on average to diffuse a distance of 
10 um. Diffusion is therefore an efficient way for small molecules to move 
limited distances in the cell. 


Because proteins diffuse through the cytosol much more slowly than do 
small molecules, the rate at which an enzyme will encounter its sub- 
strate depends on the concentration of the substrate. The most abundant 
substrates are present in the cell at a concentration of about 0.5 mM. 
Because pure water is 55 M, there is only about one such substrate mol- 
ecule in the cell for every 10° water molecules. Nevertheless, the site on 
an enzyme that binds this substrate will be bombarded by about 500,000 
random collisions with the substrate every second! For a substrate con- 
centration tenfold lower (0.05 mM), the number of collisions drops to 
50,000 per second, and so on. These incredibly numerous collisions play 
a critical role in life’s chemistry. 
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QUESTION 3-4 


For the reactions shown in Figure 
3-21, sketch an energy diagram 
similar to that in Figure 3—12 for 
the two reactions alone and for 
the combined reactions. Indicate 
the standard free-energy changes 
for the reactions X — Y, Y > Z, 
and X — Z in the graph. Indicate 
how enzymes that catalyze these 
reactions would change the energy 
diagram. 


net distance 
traveled 


Figure 3-23 A molecule traverses 

the cytosol by taking a random walk. 
Molecules in solution move in a random 
fashion due to the continual buffeting they 
receive in collisions with other molecules. 
This movement allows small molecules to 
diffuse rapidly throughout the cell cytosol 
(Movie 3.2). 
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The enzyme carbonic anhydrase 
is one of the speediest enzymes 
known. It catalyzes the rapid 
conversion of CO2 gas into the 
much more soluble bicarbonate ion 
(HCO3-). The reaction: 

CO2 + H20 — HCO3 + Abe 
is very important for the efficient 
transport of CO2 from tissues, where 
COz is produced by respiration, 
to the lungs, where it is exhaled. 
Carbonic anhydrase accelerates the 
reaction 107-fold, hydrating 10° CO2 
molecules per second at its maximal 
speed. What do you suppose limits 
the speed of the enzyme? Sketch 
a diagram analogous to the one 
shown in Figure 3-13 and indicate 
which portion of your diagram has 
been designed to display the 
107-fold acceleration. 


QUESTION 3-6 


In cells, an enzyme catalyzes 

the reaction AB — A + B. It was 
isolated, however, as an enzyme that 
carries out the opposite reaction 

A + B — AB. Explain the paradox. 
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QUESTION 3-5 


Noncovalent Interactions Allow Enzymes to Bind Specific 
Molecules 


The first step in any enzyme-catalyzed chemical reaction is the bind- 
ing of the substrate. Once this step has taken place, the substrate must 
remain bound to the enzyme long enough for the chemistry to occur. 
The association of enzyme and substrate is stabilized by the formation of 
multiple, weak bonds between the participating molecules. These weak 
interactions—which can include hydrogen bonds, van der Waals attrac- 
tions, and electrostatic attractions (discussed in Chapter 2)—persist until 
random thermal motion causes the molecules to dissociate again. 


When two colliding molecules have poorly matching surfaces, few non- 
covalent bonds are formed, and their total energy is negligible compared 
with that of thermal motion. In this case, the two molecules dissociate 
as rapidly as they come together (see Figure 2-35). As we saw in Figure 
3-20, even small changes in the number of noncovalent bonds made 
between two interacting molecules can have a dramatic effect on their 
ability to form a complex. Poor noncovalent bond formation is what 
prevents unwanted associations from forming between mismatched 
molecules, such as those between an enzyme and the wrong substrate. 
Only when the enzyme and substrate are well matched do they form 
many weak interactions. It is these numerous noncovalent bonds that 
keep them together long enough for a covalent bond in the substrate 
molecule to be formed or broken, converting substrate to product. 


Enzymes are remarkable catalysts, capturing substrates and releas- 
ing products in mere milliseconds. But though an enzyme can lower 
the activation energy for a reaction, such as Y — X (see Figure 3-12), 
it is important to note that the same enzyme will also lower the activa- 
tion energy for the reverse reaction X — Y to exactly the same degree. 
That’s because the same noncovalent bonds are formed with the enzyme 
whether the reaction goes forward or backward. The forward and back- 
ward reactions will therefore be accelerated by the same factor by an 
enzyme, and the equilibrium point for the reaction—and thus its AG°— 
remains unchanged (Figure 3-24). 
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Figure 3-24 Enzymes cannot change the equilibrium point for reactions. 
Enzymes, like all catalysts, speed up the forward and reverse rates of a reaction by 
the same amount. Therefore, for both the (A) uncatalyzed and (B) catalyzed reactions 
shown here, the number of molecules undergoing the transition Y —> X is equal 

to the number of molecules undergoing the transition X — Y when the ratio of X 
molecules to Y molecules is 7 to 1, as illustrated. In other words, both the catalyzed 
and uncatalyzed reactions will eventually reach the same equilibrium point, although 
the catalyzed reaction will reach equilibrium much faster. 


ACTIVATED CARRIERS AND BIOSYNTHESIS 


Much of the energy released by an energetically favorable reaction such as 
the oxidation of a food molecule must be stored temporarily before it can 
be used by cells to fuel energetically unfavorable reactions, such as the 
synthesis of all the other molecules needed by the cell. In most cases, the 
energy is stored as chemical-bond energy in a set of activated carriers, 
small organic molecules that contain one or more energy-rich covalent 
bonds. These molecules diffuse rapidly and carry their bond energy from 
the sites of energy generation to the sites where energy is used either for 
biosynthesis or for the many other energy-requiring activities that a cell 
must perform (Figure 3-25). In a sense, cells use activated carriers like 
money to pay for the energetically unfavorable reactions that otherwise 
would not take place. 


Activated carriers store energy in an easily exchangeable form, either as 
a readily transferable chemical group or as readily transferable (“high- 
energy”) electrons. They can serve a dual role as a source of both energy 
and chemical groups for biosynthetic reactions. As we shall discuss 
shortly, the most important activated carriers are ATP and two molecules 
that are close chemical cousins, NADH and NADPH. 


An understanding of how cells transform the energy locked in food mol- 
ecules into a form that can be used to do work required the dedicated 
effort of the world’s finest chemists (How We Know, pp. 102-103). Their 
discoveries, amassed over the first half of the twentieth century, marked 
the dawn of the study of biochemistry. 


The Formation of an Activated Carrier Is Coupled to an 
Energetically Favorable Reaction 


When a fuel molecule such as glucose is oxidized inside a cell, enzyme- 
catalyzed reactions ensure that a large part of the free energy released is 
captured in a chemically useful form, rather than being released waste- 
fully as heat. When your cells oxidize the sugar from a chocolate bar, 
that energy allows you to power metabolic reactions; burning that same 
chocolate bar in the street will get you nowhere, warming the environ- 
ment while producing no metabolically useful energy. 


In cells, energy capture is achieved by means of a special form of coupled 
reaction, in which an energetically favorable reaction is used to drive an 
energetically unfavorable one, so that an activated carrier or some other 
useful molecule is produced. Such coupling requires enzyme catalysis, 
which is fundamental to all of the energy transactions in the cell. 


ENERGY 


ENERGY 


inactive carrier 


=a 
ae 


activated carrier 


energetically 
favorable 
reaction 


energetically 
unfavorable 
reaction 


ANABOLISM 


oxidized food 
molecule 


CATABOLISM 


Activated Carriers and Biosynthesis 


Figure 3-25 Activated carriers can store 
and transfer energy in a form that cells 
can use. By serving as intracellular energy 
shuttles, activated carriers perform their 
function as go-betweens that link the 
release of energy from the breakdown of 
food molecules (catabolism) to the energy- 
requiring biosynthesis of small and large 
organic molecules (anabolism). 
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” HOW WE KNOW 


“HIGH-ENERGY” PHOSPHATE BONDS POWER CELL PROCESSES 


Cells require a continuous stream of energy to gener- 
ate and maintain order, while acquiring the materials 
they need to survive, grow, and reproduce. But even as 
late as 1921, very little was known about how energy— 
which for animal cells is derived from the breakdown 
of nutrients—is biochemically transformed, stored, and 
released for work in the cell. It would take the efforts of 
a handful of biochemists, many of whom worked with 
Otto Meyerhof—a pioneer in the field of cell metabo- 
lism—to get a handle on this fundamental problem. 


Muscling in 


Meyerhof was trained as a physician in Heidelberg, 
Germany, and he had a strong interest in physiologi- 
cal chemistry; in particular, he wondered how energy 
is transformed during chemical reactions in cells. He 
recognized that between its initial entry in the form of 
food and its final dissipation as heat, a large amount of 
energy must be made available by a series of interme- 
diate chemical steps that allow the cell or organism to 
maintain itself in a state of dynamic equilibrium. 


To explore how these mysterious chemical transforma- 
tions power the work done by cells, Meyerhof focused 
his attention on muscle. Muscle tissue could be isolated 
from an animal, such as a frog, and stimulated to con- 
tract with a pulse of electricity. And contraction provided 
a dramatic demonstration of the conversion of energy to 
a usable, mechanical form. 


When Meyerhof got started, all that was known about the 
chemistry of contraction is that, in active muscle tissue, 
lactic acid is generated by a process of fermentation. As 
Meyerhof's first order of business, he demonstrated that 
this lactic acid comes from the breakdown of glycogen— 
a branched polymer made of glucose units that serves 
as an energy store in animal cells, particularly in muscle 
(see Panel 2-4, p. 73). 


While Meyerhof focused on the chemistry, English phys- 
iologist Archibald “A.V.” Hill determined that working 
muscles give off heat, both as they contract and as they 
recover; further, he found that the amount of heat cor- 
relates with how hard the muscle is working. 


Hill and Meyerhof then showed that the heat produced 
during muscle relaxation was linked to the resynthesis 
of glycogen. A portion of the lactic acid made by the 
muscle would be completely oxidized to CO2 and water, 
and the energy from this oxidative breakdown would 
be used to convert the remaining lactic acid back to 
glycogen. This conversion of glycogen to lactic acid— 
and back again—provided the first evidence of cyclical 
energy transformation in cells (Figure 3-26). And in 
1922, it earned Meyerhof and Hill a Nobel Prize. 
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Figure 3-26 A "lactic acid cycle” was thought to supply the 
energy needed to power muscle contraction. Preparations 
of frog muscle were stimulated to contract while being held at 
constant length (isometric contraction). As shown, contraction 
was accompanied by the breakdown of glycogen and the 
formation of lactic acid. The energy released by this oxidation 
was thought to somehow power muscle contraction. Lactic acid 
is converted back to glycogen as the muscle recovers. 


In the mail 


But did the conversion of glycogen into lactic acid 
directly power the mechanical work of muscle contrac- 
tion? Meyerhof had thought so—until 1927, when a letter 
arrived from Danish physiologist Einar Lundsgaard. In 
it, Lundsgaard told Meyerhof of the surprising results of 
some experiments he had performed both on isolated 
muscles and in living rabbits and frogs. Lundsgaard had 
injected muscles with iodoacetate, a compound that 
inhibits an enzyme involved in the breakdown of sugars 
(as we discuss in Chapter 13). In these iodoacetate- 
treated muscles, fermentation was blocked and no lactic 
acid could be made. 


What Lundsgaard discovered was that the poisoned 
muscles continued to contract. Indeed, animals injected 
with the compound at first “behaved quite normally,” 
wrote Fritz Lipmann, a biochemist who was working 
in Meyerhof'’s laboratory. But after a few minutes, they 
suddenly keeled over, their muscles frozen in rigor. 


But if the formation of lactic acid was not providing fuel 
for muscle contraction, what was? Lundsgaard went on 
to show that the source of energy for muscle contraction 
in poisoned muscles appeared to be a recently discov- 
ered molecule called creatine phosphate. When lactic 
acid formation was blocked by iodoacetate, muscle con- 
traction was accompanied by the hydrolysis of creatine 
phosphate. When the supply of creatine phosphate was 
exhausted, muscles seized up permanently. 


“The turmoil that this news created in Meyerhof's labo- 
ratory is difficult to realize today,” wrote Lipmann. The 
finding contradicted Meyerhof's theory that lactic acid 
formation powered muscle contraction. And it pointed 
toward not just an alternative molecule, but a whole new 
idea: that certain phosphate bonds, when hydrolyzed, 
could provide energy. “Lundsgaard had discovered that 
the muscle machine can be driven by phosphate-bond 
energy, and he shrewdly realized that this type of energy 
was ‘nearer,’ as he expressed it, to the conversion of 
metabolic energy into mechanical energy than lactic 
acid,” wrote Lipmann. 


But rather than being upset, Meyerhof welcomed 
Lundsgaard to his lab in Heidelberg, where he was serv- 
ing as director of the Kaiser Wilhelm Physiology Institute. 
There, Lundsgaard made very careful measurements 
showing that the breakdown of creatine phosphate— 
and the heat it generated—closely tracked the amount 
of tension generated by intact muscle. 


The most direct conclusion that could be drawn from 
these observations is that the hydrolysis of creatine 
phosphate supplied the energy that powers muscle 
contraction. But in one of his papers published in 1930, 
Lundsgaard was careful to note that there was another 
possibility: that in normal muscle, both lactic acid for- 
mation and creatine phosphate hydrolysis transferred 
energy to a third, yet-to-be identified system. This is 
where ATP comes in. 


Squiggle P 


Even before Lundsgaard’s eye-opening observations, 
Meyerhof had an interest in the amount of energy 
contained in various metabolic compounds, particu- 
larly those that contained phosphate. He thought that 
metabolic energy sources might be identified by finding 
naturally occurring molecules that release unusually 
large amounts of heat when hydrolyzed. Creatine phos- 
phate was one of those compounds. Another was ATP, 
which had been discovered in 1929—by Meyerhof’s 
assistant, chemist Karl Lohmann, and, at the same time, 
by biochemists Cyrus Fiske and Yellapragada Subbarow 
working in America. 


By 1935, Lohmann had demonstrated that the hydrolytic 
breakdown of creatine phosphate occurs through the 
transfer of its phosphate group to ADP to form ATP. It is 
the hydrolysis of ATP that serves as the direct source of 
energy for muscle contraction; creatine phosphate pro- 
vides a reservoir of “high-energy” phosphate groups that 
replenish depleted ATP and maintain the needed ratio of 
ATP to ADP (Figure 3-27). 


In 1941, Lipmann published a 63-page review in the 
inaugural issue of Advances in Enzymology. Entitled “The 
metabolic generation and utilization of phosphate bond 
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Figure 3—27 Creatine phosphate serves as an intermediate 
energy store. An enzyme called creatine kinase transfers a 
phosphate group from creatine phosphate to ADP when ATP 
concentrations are low; the same enzyme can catalyze the 
reverse reaction to generate a pool of creatine phosphate 
when ATP concentrations are high. Here, the “high-energy” 
phosphate bonds are symbolized by ~P. AMP is adenosine 
monophosphate (see Figure 3-41). 


energy,” this article introduced the symbol ~P (or “squig- 
gle P”) to denote an energy-rich phosphate bond—one 
whose hydrolysis yields enough energy to drive ener- 
getically unfavorable reactions and processes (Figure 
3-28). 


Although several molecules contain such high-energy 
phosphate bonds (see Panel 3-1, p. 95), it is the hydrol- 
ysis of ATP that provides the driving force for most 
of the energy-requiring reactions in living systems, 
including the contraction of muscles, the transport of 
substances across membranes, and the synthesis of 
macromolecules including proteins, nucleic acids, and 
carbohydrates. Indeed, in a memorial written after 
the death of Meyerhof in 1951, Lipmann—who would 
shortly win his own Nobel Prize for work on a differ- 
ent activated carrier—wrote: “The discovery of ATP thus 
was the key that opened the gates to the understanding 
of the conversion mechanisms of metabolic energy.” 
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Figure 3—28 High-energy phosphate bonds generate an 
energy current (red) that powers cell reactions. This diagram, 
modeled on a figure published in Lipmann’s 1941 article in 
Advances in Enzymology, shows how energy released by the 
metabolism of food molecules (represented by the “metabolic 
wheel”) is captured in the form of high-energy phosphate bonds 
(~P) of ATP, which are used to power all other cell reactions. After 
the high-energy bonds are hydrolyzed, the inorganic phosphate 
released is recycled and reused, as indicated. 
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Figure 3—29 A mechanical model 
illustrates the principle of coupled 
chemical reactions. (A) The spontaneous 
reaction shown could serve as an analogy 
for the direct oxidation of glucose to CO2 
and H20, which produces only heat. 

(B) The same reaction is coupled to a 
second reaction, which could serve as 

an analogy for the synthesis of activated 
carriers. (C) The energy produced in (B) is 
in a more useful form than in (A) and can 
be used to drive a variety of otherwise 
energetically unfavorable reactions. 


QUESTION 3-7 


Use Figure 3—29B to illustrate the 
following reaction driven by the 
hydrolysis of ATP: 

X + ATP — Y + ADP + P; 
A. In this case, which molecule or 
molecules would be analogous to 
(i) rocks at the top of the cliff, 
(ii) broken debris at the bottom of 
the cliff, (iii) the bucket at its highest 
point, and (iv) the bucket on the 
ground? 
B. What would be analogous to 
(i) the rocks hitting the ground in 
the absence of the paddle wheel in 
Figure 3—29A and (ii) the hydraulic 
machine in Figure 3—29C? 
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hydraulic 
machine 


part of the kinetic energy is used to lift 
a bucket of water, and a correspondingly 
smaller amount is transformed into heat 


the potential energy stored in the 
raised bucket of water can be used to 
drive hydraulic machines that carry out 
a variety of useful tasks 


To provide an everyday representation of how coupled reactions work, 
let’s consider a mechanical analogy in which an energetically favorable 
chemical reaction is represented by rocks falling from a cliff. The kinetic 
energy of falling rocks would normally be entirely wasted in the form of 
heat generated by friction when the rocks hit the ground (Figure 3—29A). 
By careful design, however, part of this energy could be used to drive a 
paddle wheel that lifts a bucket of water (Figure 3—29B). Because the 
rocks can now reach the ground only after moving the paddle wheel, 
we Say that the energetically favorable reaction of rocks falling has been 
directly coupled to the energetically unfavorable reaction of lifting the 
bucket of water. Because part of the energy is used to do work in (B), the 
rocks hit the ground with less velocity than in (A), and correspondingly 
less energy is wasted as heat. The energy saved in the elevated bucket of 
water can then be used to do useful work (Figure 3—29C). 


Analogous processes occur in cells, where enzymes play the role of the 
paddle wheel in Figure 3-29B. By mechanisms that we discuss in Chapter 
13, enzymes couple an energetically favorable reaction, such as the oxi- 
dation of food molecules, to an energetically unfavorable reaction, such 
as the generation of activated carriers. As a result, the amount of heat 
released by the oxidation reaction is reduced by exactly the amount of 
energy that is stored in the energy-rich covalent bonds of the activated 
carrier. That saved energy can then be used to power a chemical reaction 
elsewhere in the cell. 


ATP Is the Most Widely Used Activated Carrier 


The most important and versatile of the activated carriers in cells is ATP 
(adenosine 5’-triphosphate). Just as the energy stored in the raised bucket 
of water in Figure 3-29B can be used to drive a wide variety of hydraulic 
machines, ATP serves as a convenient and versatile store, or currency, of 
energy that can be used to drive a variety of chemical reactions in cells. 
As shown in Figure 3-30, ATP is synthesized in an energetically unfa- 
vorable phosphorylation reaction, in which a phosphate group is added 
to ADP (adenosine 5'-diphosphate). When required, ATP gives up this 
energy packet in an energetically favorable hydrolysis to ADP and inor- 
ganic phosphate (Pj). The regenerated ADP is then available to be used 
for another round of the phosphorylation reaction that forms ATP, creat- 
ing an ATP cycle in the cell. 


i ATP 
phosphoanhydride bonds 


O-CH, 


energy from energy available 


sunlight or from. | AG° > 0 AG° < 0 | to drive energetically 
the breakdown unfavorable 
of food reactions 
o + O-CH, 
inorganic 
phosphate (®) 


The large negative AG? of the ATP hydrolysis reaction arises from a 
number of factors. Release of the terminal phosphate group removes an 
unfavorable repulsion between adjacent negative charges; in addition, 
the inorganic phosphate ion (Pj) released is stabilized by favorable hydro- 
gen-bond formation with water. 


The energetically favorable reaction of ATP hydrolysis is coupled to 
many otherwise unfavorable reactions through which other molecules 
are synthesized. We will encounter several of these reactions in this 
chapter, where we will see exactly how this coupling is carried out. ATP 
hydrolysis is often accompanied by a transfer of the terminal phosphate 
in ATP to another molecule, as illustrated in Figure 3-31. Any reaction 
that involves the transfer of a phosphate group to a molecule is termed 
a phosphorylation reaction. Phosphorylation reactions are examples of 
condensation reactions (see Figure 2-19), and they occur in many impor- 
tant cell processes: they activate substrates for a subsequent reaction, 
mediate movement, and serve as key constituents of intracellular signal- 
ing pathways (discussed in Chapter 16). 


ATP is the most abundant activated carrier in cells. It is used to sup- 
ply energy for many of the pumps that actively transport substances into 
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Figure 3-30 The interconversion of 

ATP and ADP occurs in a cycle. The two 
outermost phosphate groups in ATP are 
held to the rest of the molecule by “high- 
energy” phosphoanhydride bonds and 

are readily transferred to other organic 
molecules. Water can be added to ATP to 
form ADP and inorganic phosphate (P). 
Inside a cell, this hydrolysis of the terminal 
phosphate of ATP yields between 46 and 
54 kJ/mole of usable energy. (Although the 
AG° of this reaction is -30.5 kJ/mole, its AG 
inside cells is much more negative, because 
the ratio of ATP to the products ADP and P; 
is kept so high.) 

The formation of ATP from ADP and P; 
reverses the hydrolysis reaction; because 
this condensation reaction is energetically 
unfavorable, it must be coupled to a highly 
energetically favorable reaction to occur. 


QUESTION 3-8 


The phosphoanhydride bond that 
links two phosphate groups in ATP 
in a high-energy linkage has a AG? 
of -30.5 kJ/mole. Hydrolysis of this 
bond in a cell liberates from 46 to 
54 kJ/mole of usable energy. How 
can this be? Why do you think a 
range of energies is given, rather 
than a precise number as for AG°? 


Figure 3—31 The terminal phosphate 

of ATP can be readily transferred to 
other molecules. Because an energy- 

rich phosphoanhydride bond in ATP 

is converted to a less energy-rich 
phosphoester bond in the phosphate- 
accepting molecule, this reaction is 
energetically favorable, having a large 
negative AG? (see Panel 3-1, pp. 94-95). 
Phosphorylation reactions of this type are 
involved in the synthesis of phospholipids 
and in the initial steps of the breakdown of 
sugars, as well as in many other metabolic 
and intracellular signaling pathways. 
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Figure 3-32 An energetically unfavorable 
biosynthetic reaction can be driven by 
ATP hydrolysis. (A) Schematic illustration of 
the condensation reaction described in the 
text. In this set of reactions, a phosphate 
group is first donated by ATP to form a 
high-energy intermediate, A-O-POs, 
which then reacts with the other substrate, 
B-H, to form the product A-B. (B) Reaction 
showing the biosynthesis of the amino acid 
glutamine from glutamic acid. Glutamic 
acid, which corresponds to the A-OH 
shown in (A), is first converted to a high- 
energy phosphorylated intermediate, which 
corresponds to A-O-PO3, This intermediate 
then reacts with ammonia (which 
corresponds to B-H) to form glutamine. 

In this example, both steps occur on the 
surface of the same enzyme, glutamine 
synthetase (not shown). ATP hydrolysis can 
drive this energetically unfavorable reaction 
because it produces a favorable free-energy 
change (AG° of -30.5 kJ/mole) that is larger 
in magnitude than the energy required for 
the synthesis of glutamine from glutamic 
acid plus NH3 (AG° of +14.2 kJ/mole). For 
clarity, the glutamic acid side chain is shown 
in its uncharged form. 
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or out of the cell (discussed in Chapter 12) and to power the molecular 
motors that enable muscle cells to contract and nerve cells to transport 
materials along their lengthy axons (discussed in Chapter 17), to name 
just two important examples. Why evolution selected this particular 
nucleoside triphosphate over the others as the major carrier of energy, 
however, remains a mystery. GTP, although chemically similar to ATP, is 
involved in a different set of functions in the cell, as we discuss in later 
chapters. 


Energy Stored in ATP Is Often Harnessed to Join Two 
Molecules Together 


A common type of reaction that is needed for biosynthesis is one in which 
two molecules, A and B, are joined together by a covalent bond to pro- 
duce A-B in an energetically unfavorable condensation reaction: 


A-OH + B-H — A-B + H20 


ATP hydrolysis can be coupled indirectly to this reaction to make it go 
forward. In this case, energy from ATP hydrolysis is first used to con- 
vert A-OH to a higher-energy intermediate compound, which then reacts 
directly with B-H to give A-B. The simplest mechanism involves the 
transfer of a phosphate from ATP to A-OH to make A-O-PO3, in which 
case the reaction pathway contains only two steps (Figure 3—32A). The 
condensation reaction, which by itself is energetically unfavorable, has 
been forced to occur by being coupled to ATP hydrolysis in an enzyme- 
catalyzed reaction pathway. 


A biosynthetic reaction of exactly this type is employed to synthesize the 
amino acid glutamine, as illustrated in Figure 3—32B. We will see later 
in the chapter that very similar (but more complex) mechanisms are also 
used to produce nearly all of the large molecules of the cell. 


NADH and NADPH Are Both Activated Carriers of 
Electrons 


Other important activated carriers participate in oxidation-reduction 
reactions and are also commonly part of coupled reactions in cells. These 
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activated carriers are specialized to carry both high-energy electrons and 
hydrogen atoms. The most important of these electron carriers are NADH 
(nicotinamide adenine dinucleotide) and the closely related molecule 
NADPH (nicotinamide adenine dinucleotide phosphate). Both NADH and 
NADPH carry energy in the form of two high-energy electrons plus a pro- 
ton (H+), which together form a hydride ion (H-). When these activated 
carriers pass their hydride ion to a donor molecule, they become oxidized 
to form NAD* and NADP*, respectively. 


reduced electron 
carrier 


Like ATP, NADPH is an activated carrier that participates in many 
important biosynthetic reactions that would otherwise be energetically 
unfavorable. NADPH is produced according to the general scheme shown 
in Figure 3-33. During a special set of energy-yielding catabolic reac- 
tions, a hydride ion is removed from the substrate molecule and added to 
the nicotinamide ring of NADP* to form NADPH. This is a typical oxida- 
tion-reduction reaction: the substrate is oxidized and NADP* is reduced. 


The hydride ion carried by NADPH is given up readily in a subsequent 
oxidation-reduction reaction, because the nicotinamide ring can achieve 
a more stable arrangement of electrons without it (Figure 3-34). In this 
subsequent reaction, which regenerates NADP*, the NADPH becomes 
oxidized and the substrate becomes reduced—thus completing the 
NADPH cycle (see Figure 3-33). NADPH is efficient at donating its hydride 
ion to other molecules for the same reason that ATP readily transfers a 
phosphate: in both cases, the transfer is accompanied by a large negative 
free-energy change. One example of the use of NADPH in biosynthesis is 
shown in Figure 3-35. 
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Figure 3-33 NADPH is an activated 
carrier of electrons that participates in 
oxidation-reduction reactions. NADPH is 
produced in reactions of the general type 
shown on the left, in which two electrons 
are removed from a substrate (A-H). The 
oxidized form of the carrier molecule, 
NADP?, receives these two electrons as one 
hydrogen atom plus an electron (a hydride 
ion). Because NADPH holds its hydride ion 
in a high-energy linkage, this ion can easily 
be transferred to other molecules, such as 
B, as shown on the right. In this reaction, 
NADPH is re-oxidized to yield NADP*, thus 
completing the cycle. 


oxidized electron carrier 


N 


NH, 


Figure 3-34 NADPH accepts and 
donates electrons via its nicotinamide 
ring. NADPH donates its high-energy 
electrons together with a proton (the 
equivalent of a hydride ion, H7). This 
reaction, which oxidizes NADPH to 
NADP*, is energetically favorable 
because the nicotinamide ring is more 
stable when these electrons are absent. 
The ball-and-stick model on the left 
shows the structure of NADP+. NAD+ 
and NADH are identical in structure 

to NADP* and NADPH, respectively, 
except that they lack the phosphate 
group, as indicated. 
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Figure 3-35 NADPH participates in the final stage of one 
of the biosynthetic routes leading to cholesterol. As in many 
other biosynthetic reactions, the reduction of the C=C bond 

is achieved by the transfer of a hydride ion from the activated 
carrier NADPH, plus a proton (H+) from solution. 


NADPH and NADH Have Different Roles in Cells 


NADPH and NADH differ in a single phosphate group, which is located far 
from the region involved in electron transfer in NADPH (see Figure 3-34). 
Although this phosphate group has no effect on the electron-transfer 
properties of NADPH compared with NADH, it is nonetheless crucial 
for their distinctive roles, as it gives NADPH a slightly different shape 
from NADH. This subtle difference in conformation makes it possible for 
the two carriers to bind as substrates to different sets of enzymes and 
thereby deliver electrons (in the form of hydride ions) to different target 
molecules. 


Why should there be this division of labor? The answer lies in the need 
to regulate two sets of electron-transfer reactions independently. NADPH 
operates chiefly with enzymes that catalyze anabolic reactions, supply- 
ing the high-energy electrons needed to synthesize energy-rich biological 
molecules. NADH, by contrast, has a special role as an intermediate in 
the catabolic system of reactions that generate ATP through the oxida- 
tion of food molecules, as we discuss in Chapter 13. The genesis of NADH 
from NAD+ and that of NADPH from NADP* occurs by different pathways 
that are independently regulated, so that the cell can adjust the supply 
of electrons for these two contrasting purposes. Inside the cell, the ratio 
of NAD* to NADH is kept high, whereas the ratio of NADP* to NADPH is 
kept low. This arrangement provides plenty of NAD* to act as an oxidiz- 
ing agent and plenty of NADPH to act as a reducing agent—as required 
for their special roles in catabolism and anabolism, respectively (Figure 
3-36). 


Cells Make Use of Many Other Activated Carriers 


In addition to ATP (which transfers a phosphate) and NADPH and NADH 
(which transfer electrons and hydrogen), cells make use of other activated 
carriers that pick up and carry a chemical group in an easily transferred, 
high-energy linkage. FADH, like NADH and NADPH, carries hydrogen and 
high-energy electrons (see Figure 13-13B). But other important reactions 
involve the transfers of acetyl, methyl, carboxyl, and glucose groups from 
activated carriers for the purpose of biosynthesis (Table 3-2). Coenzyme 
A, for example, can carry an acetyl group in a readily transferable link- 
age. This activated carrier, called acetyl CoA (acetyl coenzyme A), is 
shown in Figure 3-37. It is used, for example, to sequentially add two- 
carbon units in the biosynthesis of the hydrocarbon tails of fatty acids. 


Figure 3-36 NADPH and NADH have different roles in the 
cell, and the relative concentrations of these carrier molecules 
influence their affinity for electrons. Keeping reduced NADPH 
at a higher concentration than its oxidized counterpart, NADP*, 
makes NADPH a stronger electron donor. This arrangement 
ensures that NADPH can serve as a reducing agent for anabolic 
reactions. The reverse is true for NADH. Cells keep the amount 
of reduced NADH lower than that of NAD*, which makes NAD* 
a better electron acceptor. Thus NAD* acts as an effective 
oxidizing agent, accepting electrons generated during oxidative 
breakdown of food molecules. 


TABLE 3-2 SOME ACTIVATED CARRIERS WIDELY USED IN METABOLISM 
Group Carried in High-Energy Linkage 


ATP phosphate 

NADH, NADPH, FADH2 electrons and hydrogens 
Acetyl CoA acetyl group 
Carboxylated biotin carboxyl group 
S-adenosylmethionine methyl group 

Uridine diphosphate glucose glucose 


In acetyl CoA and the other activated carriers in Table 3-2, the transfer- 
able group makes up only a small part of the molecule. The rest consists 
of a large organic portion that serves as a convenient “handle,” facilitat- 
ing the recognition of the carrier molecule by specific enzymes. As with 
acetyl CoA, this handle portion very often contains a nucleotide. This 
curious fact may be a relic from an early stage of cell evolution. It is 
thought that the main catalysts for early life-forms on Earth were RNA 
molecules (or their close relatives) and that proteins were a later evo- 
lutionary addition. It is therefore tempting to speculate that many of the 
activated carriers that we find today originated in an earlier RNA world, 
where their nucleotide portions would have been useful for binding these 
carriers to RNA-based catalysts, or ribozymes (discussed in Chapter 7). 


Activated carriers are usually generated in reactions coupled to ATP 
hydrolysis, as shown for biotin in Figure 3-38. Therefore, the energy that 
enables their groups to be used for biosynthesis ultimately comes from 
the catabolic reactions that generate ATP. The same principle applies to 
the synthesis of large macromolecules—nucleic acids, proteins, and poly- 
saccharides—as we discuss next. 
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Figure 3-37 Acetyl coenzyme A (CoA) is 
another important activated carrier. 

A ball-and-stick model is shown above the 
structure of acetyl CoA. The sulfur atom 


(orange) forms a thioester bond to acetate. 
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Because the thioester bond is a high-energy 


linkage, it releases a large amount of free 
energy when it is hydrolyzed. Thus the 
acetyl group carried by CoA can be readily 
transferred to other molecules. 
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Figure 3-38 Biotin transfers a carboxyl 
group to a substrate. Biotin is a vitamin 
that is used by a number of enzymes to 
transfer a carboxyl group to a substrate. 
Shown here is the reaction in which biotin, 
held by the enzyme pyruvate carboxylase, 
accepts a carboxyl group from bicarbonate 
and transfers it to pyruvate, producing 
oxaloacetate, a molecule required in the 
citric acid cycle (discussed in Chapter 

13). Other enzymes use biotin to transfer 
carboxyl groups to other molecules. 

Note that the synthesis of carboxylated 
biotin requires energy derived from ATP 


hydrolysis—a general feature that applies to 


many activated carriers. 


Figure 3-39 In cells, macromolecules are 
synthesized by condensation reactions 
and broken down by hydrolysis reactions. 
Condensation reactions are all energetically 
unfavorable, whereas hydrolysis reactions 
are all energetically favorable. 
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The Synthesis of Biological Polymers Requires 
an Energy Input 


The macromolecules of the cell constitute the vast majority of its dry 
mass—that is, the mass not due to water. These molecules are made 
from subunits (or monomers) that are linked together by bonds formed 
during an enzyme-catalyzed condensation reaction. The reverse reac- 
tion—the breakdown of polymers—occurs through enzyme-catalyzed 
hydrolysis reactions. These hydrolysis reactions are energetically favora- 
ble, whereas the corresponding biosynthetic reactions require an energy 
input and are more complex (Figure 3-39). 


The nucleic acids (DNA and RNA), proteins, and polysaccharides are all 
polymers that are produced by the repeated addition of a subunit onto 
one end of a growing chain. The mode of synthesis of each of these 
macromolecules is outlined in Figure 3—40. As indicated, the condensa- 
tion step in each case depends on energy provided by the hydrolysis of a 
nucleoside triphosphate. And yet, except for the nucleic acids, there are 
no phosphate groups left in the final product molecules. How, then, is the 
energy of ATP hydrolysis coupled to polymer synthesis? 


Each type of macromolecule is generated by an enzyme-catalyzed path- 
way that resembles the one discussed previously for the synthesis of 
the amino acid glutamine (see Figure 3-32). The principle is exactly the 
same, in that the -OH group that will be removed in the condensation 
reaction is first activated by forming a high-energy linkage to a second 
molecule. The mechanisms used to link ATP hydrolysis to the synthe- 
sis of proteins and polysaccharides, however, are more complex than 
that used for glutamine synthesis. In the biosynthetic pathways leading 
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H o Figure 3—40 The synthesis of macromolecules requires an input of energy. 
l Wa Synthesis of a portion of (A) a polysaccharide, (B) a nucleic acid, and (C) a protein is 
ae N—C—C shown here. In each case, synthesis involves a condensation reaction in which water 
i i ea is lost; the atoms involved are shaded in pink. Not shown is the consumption of 


protein 


high-energy nucleoside triphosphates that is required to activate each subunit prior 
to its addition. In contrast, the reverse reaction—the breakdown of all three types of 


polymers—occurs through the simple addition of water, or hydrolysis (not shown). 


to these macromolecules, several high-energy intermediates are con- 
sumed in series to generate the final high-energy bond that will be 
broken during the condensation step. One important example of such a 
biosynthetic reaction, that of protein synthesis, is discussed in detail in 
Chapter 7. 


There are limits to what each activated carrier can do in driving bio- 
synthesis. For example, the AG for the hydrolysis of ATP to ADP and 
inorganic phosphate (Pj) depends on the concentrations of all of the reac- 
tants, and under the usual conditions in a cell, it is between -46 and -54 
kJ/mole. In principle, this hydrolysis reaction can be used to drive an 
unfavorable reaction with a AG of, perhaps, +40 kJ/mole, provided that a 
suitable reaction path is available. For some biosynthetic reactions, how- 
ever, even -54 kJ/mole may be insufficient. In these cases, the path of 
ATP hydrolysis can be altered so that it initially produces AMP and pyro- 
phosphate (PP), which is itself then hydrolyzed in solution in a subsequent 
step (Figure 3—41). The whole process makes available a total AG of about 
-109 kJ/mole. The biosynthetic reaction involved in the synthesis of 
nucleic acids (polynucleotides) is driven in this way (Figure 3—42). 


QUESTION 3-9 


Which of the following reactions will 
occur only if coupled to a second, 
energetically favorable reaction? 

A. glucose + O2— CO2 + H20 

B. CO2 + H20 — glucose + O2 

C. nucleoside triphosphates —> 
DNA 

D. nucleotide bases — nucleoside 
triphosphates 

E. ADP + P; — ATP 
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Figure 3-41 In an alternative route for 
the hydrolysis of ATP, pyrophosphate 

is first formed and then hydrolyzed in 
solution. This route releases about twice 
as much free energy as the reaction shown 
earlier in Figure 3-30. (A) In each of the two 
successive hydrolysis reactions, an oxygen 
atom from the participating water molecule 
is retained in the products, whereas the 
hydrogen atoms from water form free 
hydrogen ions, H*. (B) The overall reaction 
shown in summary form. 


Energy, Catalysis, and Biosynthesis 


(A) (B) 
ATP 
adenosine triphosphate (ATP) 
H,O 
HO e 
OO + AMP 
pyrophosphate 
HO adenosine monophosphate (AMP) 
HO 
OH + OH ® + ® 

phosphate phosphate 


ATP will make many appearances throughout the book as a molecule 
that powers reactions in the cell. And in Chapters 13 and 14, we discuss 
how the cell uses the energy from food to generate ATP. In the next chap- 
ter, we learn more about the proteins that make such reactions possible. 
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Figure 3-42 Synthesis of a polynucleotide, RNA or DNA, is a multistep process 
driven by ATP hydrolysis. In the first step, a nucleoside monophosphate is 
activated by the sequential transfer of the terminal phosphate groups from two ATP 
molecules. The high-energy intermediate formed—a nucleoside triphosphate— 
exists free in solution until it reacts with the growing end of an RNA or a DNA 
chain, with release of pyrophosphate. Hydrolysis of the pyrophosphate to inorganic 
phosphate is highly favorable and helps to drive the overall reaction in the direction 
of polynucleotide synthesis. 
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ESSENTIAL CONCEPTS 


e Living organisms are able to exist because of a continual input of 
energy. Part of this energy is used to carry out essential reactions 
that support cell metabolism, growth, movement, and reproduction; 
the remainder is lost in the form of heat. 


e The ultimate source of energy for most living organisms is the sun. 
Plants, algae, and photosynthetic bacteria use solar energy to pro- 
duce organic molecules from carbon dioxide. Animals obtain food by 
eating plants or by eating animals that feed on plants. 


e Each of the many hundreds of chemical reactions that occur in a cell 
is specifically catalyzed by an enzyme. Large numbers of different 
enzymes work in sequence to form chains of reactions, called meta- 
bolic pathways, each performing a different function in the cell. 


e Catabolic reactions release energy by breaking down organic mol- 
ecules, including foods, through oxidative pathways. Anabolic 
reactions generate the many complex organic molecules needed by 
the cell, and they require an energy input. In animal cells, both the 
building blocks and the energy required for the anabolic reactions are 
obtained through catabolic reactions. 


e Enzymes catalyze reactions by binding to particular substrate mol- 
ecules in a way that lowers the activation energy required for making 
and breaking specific covalent bonds. 


e The rate at which an enzyme catalyzes a reaction depends on how 
rapidly it finds its substrates and how quickly the product forms and 
then diffuses away. These rates vary widely from one enzyme to 
another. 


e The only chemical reactions possible are those that increase the 
total amount of disorder in the universe. The free-energy change for 
a reaction, AG, measures this disorder, and it must be less than zero 
for a reaction to proceed spontaneously. 


e The AG for a chemical reaction depends on the concentrations of the 
reacting molecules, and it may be calculated from these concentra- 
tions if the equilibrium constant (K) of the reaction (or the standard 
free-energy change, AG”, for the reactants) is known. 


e Equilibrium constants govern all of the associations (and dissocia- 
tions) that occur between macromolecules and small molecules in 
the cell. The larger the binding energy between two molecules, the 
larger the equilibrium constant and the more likely that these mol- 
ecules will be found bound to each other. 


e By creating a reaction pathway that couples an energetically favora- 
ble reaction to an energetically unfavorable one, enzymes can make 
otherwise impossible chemical transformations occur. Large num- 
bers of such coupled reactions make life possible. 


e A small set of activated carriers, particularly ATP, NADH, and NADPH, 
plays a central part in these coupled reactions in cells. ATP carries 
high-energy phosphate groups, whereas NADH and NADPH carry 
high-energy electrons. 


e Food molecules provide the carbon skeletons for the formation of 
macromolecules. The covalent bonds of these larger molecules are 
produced by condensation reactions that are coupled to energeti- 
cally favorable bond changes in activated carriers such as ATP and 
NADPH. 
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KEY TERMS 
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activation energy 
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anabolism 


biosynthesis 
catabolism 
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cell respiration 
coupled reaction 
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QUESTIONS 


QUESTION 3-10 


Which of the following statements are correct? Explain your 
answers. 


A. Some enzyme-catalyzed reactions cease completely if 
their enzyme is absent. 


B. High-energy electrons (such as those found in the 
activated carriers NADH and NADPH) move faster around 
the atomic nucleus. 


C. Hydrolysis of ATP to AMP can provide about twice as 
much energy as hydrolysis of ATP to ADP. 


D. A partially oxidized carbon atom has a somewhat smaller 
diameter than a more reduced one. 


E. Some activated carrier molecules can transfer both 
energy and a chemical group to a second molecule. 


F. The rule that oxidations release energy, whereas 
reductions require energy input, applies to all chemical 
reactions, not just those that occur in living cells. 


G. Cold-blooded animals have an energetic disadvantage 
because they release less heat to the environment than 
warm-blooded animals do. This slows their ability to make 
ordered macromolecules. 


H. Linking the reaction X — Y to a second, energetically 
favorable reaction Y > Z will shift the equilibrium constant 
of the first reaction. 


QUESTION 3-11 


Consider a transition of X — Y. Assume that the only 
difference between X and Y is the presence of three 
hydrogen bonds in Y that are absent in X. What is the ratio 
of X to Y when the reaction is in equilibrium? Approximate 
your answer by using Table 3—1 (p. 96), with 4.2 kJ/mole 
as the energy of each hydrogen bond. If Y instead has six 
hydrogen bonds that distinguish it from X, how would that 
change the ratio? 
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free energy, G 
free-energy change, AG 
metabolism 
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substrate 


QUESTION 3-12 


Protein A binds to protein B to form a complex, AB. At 
equilibrium in a cell the concentrations of A, B, and AB are 
all at 1 uM. 


A. Referring to Figure 3—19, calculate the equilibrium 
constant for the reaction A + B + AB. 


B. What would the equilibrium constant be if A, B, and 
AB were each present in equilibrium at the much lower 
concentrations of 1 nM each? 


C. How many extra hydrogen bonds would be needed to 
hold A and B together at this lower concentration so that 

a similar proportion of the molecules are found in the AB 
complex? (Remember that each hydrogen bond contributes 
about 4.2 kJ/mole.) 


QUESTION 3-13 


Discuss the following statement: “Whether the AG for a 
reaction is larger, smaller, or the same as AG° depends on 
the concentration of the compounds that participate in the 
reaction.” 


QUESTION 3-14 


A. How many ATP molecules could maximally be generated 
from one molecule of glucose, if the complete oxidation of 
1 mole of glucose to CO2 and H20 yields 2867 kJ of free 
energy and the useful chemical energy available in the high- 
energy phosphate bond of 1 mole of ATP is 50 kJ? 


B. As we will see in Chapter 14 (Table 14-1), respiration 
produces 30 moles of ATP from 1 mole of glucose. Compare 
this number with your answer in part (A). What is the overall 
efficiency of ATP production from glucose? 


C. If the cells of your body oxidize 1 mole of glucose, by 
how much would the temperature of your body (assume 
that your body consists of 75 kg of water) increase if the 
heat were not dissipated into the environment? [Recall that 


a kilocalorie (kcal) is defined as that amount of energy that 
heats 1 kg of water by 1°C. And 1 kJ equals 0.24 kcal.] 


D. What would the consequences be if the cells of your 
body could convert the energy in food substances with 
only 20% efficiency? Would your body—as it is presently 
constructed—work just fine, overheat, or freeze? 


E. A resting human hydrolyzes about 40 kg of ATP every 24 
hours. The oxidation of how much glucose would produce 
this amount of energy? (Hint: Look up the structure of ATP 
in Figure 2—26 to calculate its molecular weight; the atomic 
weights of H, C, N, O, and P are 1, 12, 14, 16, and 31, 
respectively.) 


QUESTION 3-15 


A prominent scientist claims to have isolated mutant cells 
that can convert 1 molecule of glucose into 57 molecules 
of ATP. Should this discovery be celebrated, or do you 
suppose that something might be wrong with it? Explain 
your answer. 


QUESTION 3-16 


In a simple reaction A +> A*, a molecule is interconvertible 
between two forms that differ in standard free energy G° by 
18 kJ/mole, with A* having the higher G°. 


A. Use Table 3-1 (p. 96) to find how many more molecules 
will be in state A* compared with state A at equilibrium. 


B. If an enzyme lowered the activation energy of the 
reaction by 11.7 kJ/mole, how would the ratio of A to A* 
change? 


QUESTION 3-17 


In a mushroom, a reaction in a single-step biosynthetic 
pathway that converts a metabolite into a particularly 
vicious poison (metabolite +> poison) is energetically 
highly unfavorable. The reaction is normally driven by ATP 
hydrolysis. Assume that a mutation in the enzyme that 
catalyzes the reaction prevents it from utilizing ATP, but still 
allows it to catalyze the reaction. 


A. Do you suppose it might be safe for you to eat a 
mushroom that bears this mutation? Base your answer on an 
estimation of how much less poison the mutant mushroom 
would produce, assuming the reaction is in equilibrium 

and most of the energy stored in ATP is used to drive the 
unfavorable reaction in nonmutant mushrooms. 


B. Would your answer be different for another mutant 
mushroom whose enzyme couples the reaction to ATP 
hydrolysis but works 100 times more slowly? 


Questions 15 


QUESTION 3-18 


Consider the effects of two enzymes, A and B. Enzyme A 
catalyzes the reaction 


ATP + GDP <+ ADP + GTP 
and enzyme B catalyzes the reaction 
NADH + NADPt < NAD+ + NADPH 


Discuss whether the enzymes would be beneficial or 
detrimental to cells. 


QUESTION 3-19 


Discuss the following statement: “Enzymes and heat are 
alike in that both can speed up reactions that—although 
thermodynamically feasible—do not occur at an appreciable 
rate because they require a high activation energy. Diseases 
that seem to benefit from the careful application of heat—in 
the form of hot chicken soup, for example—are therefore 
likely to be due to the insufficient function of an enzyme.” 


Protein Structure and Function 


When we look at a cell in a microscope or analyze its electrical or bio- 
chemical activity, we are, in essence, observing the handiwork of proteins. 
Proteins are the main building blocks from which cells are assembled, 
and they constitute most of the cell’s dry mass. In addition to provid- 
ing the cell with shape and structure, proteins also execute nearly all its 
myriad functions. Enzymes promote intracellular chemical reactions by 
providing intricate molecular surfaces contoured with particular bumps 
and crevices that can cradle or exclude specific molecules. Transporters 
and channels embedded in the plasma membrane control the passage 
of nutrients and other small molecules into and out of the cell. Other 
proteins carry messages from one cell to another, or act as signal inte- 
grators that relay information from the plasma membrane to the nucleus 
of individual cells. Some proteins act as motors that propel organelles 
through the cytosol, and others function as components of tiny molecu- 
lar machines with precisely calibrated moving parts. Specialized proteins 
also act as antibodies, toxins, hormones, antifreeze molecules, elastic 
fibers, or luminescence generators. To understand how muscles contract, 
how nerves conduct electricity, how embryos develop, or how our bodies 
function, we must first understand how proteins operate. 


The multiplicity of functions carried out by these remarkable macromol- 
ecules, a few of which are represented in , p. 118, arises from 
the huge number of different shapes proteins adopt. We therefore begin 
our description of proteins by discussing their three-dimensional struc- 
tures and the properties that these structures confer. We next look at how 
proteins work: how enzymes catalyze chemical reactions, how some 
proteins act as molecular switches, and how others generate orderly 
movement. We then examine how cells control the activity and location 


THE SHAPE AND STRUCTURE 
OF PROTEINS 


HOW PROTEINS WORK 


HOW PROTEINS ARE 
CONTROLLED 


HOW PROTEINS ARE STUDIED 
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ENZYMES 


function: Catalyze covalent bond breakage 
or formation 


examples: Living cells contain thousands of 
different enzymes, each of which catalyzes 
(speeds up) one particular reaction. Examples 
include: alcohol dehydrogenase—makes the 
alcohol in wine; pepsin—degrades dietary 
proteins in the stomach; ribulose 
bisphosphate carboxylase—helps convert 
carbon dioxide into sugars in plants; DNA 
polymerase—copies DNA; protein kinase — 
adds a phosphate group to a protein 
molecule. 


MOTOR PROTEINS 


function: Generate movement in cells and 
tissues 


STRUCTURAL PROTEINS 


function: Provide mechanical support to 
cells and tissues 


examples: Outside cells, collagen and elastin 
are common constituents of extracellular 
matrix and form fibers in tendons and 
ligaments. Inside cells, tubulin forms long, stiff 
microtubules, and actin forms filaments that 
underlie and support the plasma membrane; 
keratin forms fibers that reinforce epithelial 
cells and is the major protein in hair and horn. 


STORAGE PROTEINS 


function: Store amino acids or ions 


examples: Myosin in skeletal muscle cells 
provides the motive force for humans to 
move; kinesin interacts with microtubules to 
move organelles around the cell; dynein 
enables eukaryotic cilia and flagella to beat. 


RECEPTOR PROTEINS 


function: Detect signals and transmit them 
to the cell's response machinery 


examples: Rhodopsin in the retina detects 


examples: Iron is stored in the liver by binding 
to the small protein ferritin; ovalbumin in egg 
white is used as a source of amino acids for 
the developing bird embryo; casein in milk is a 
source of amino acids for baby mammals. 


TRANSCRIPTION REGULATORS 


function: Bind to DNA to switch genes on 
or off 


light; the acetylcholine receptor in the 
membrane of a muscle cell is activated by 
acetylcholine released from a nerve ending; 
the insulin receptor allows a cell to respond to 
the hormone insulin by taking up glucose; the 
adrenergic receptor on heart muscle increases 
the rate of the heartbeat when it binds to 
epinephrine secreted by the adrenal gland. 


examples: The Lac repressor in bacteria 
silences the genes for the enzymes that 
degrade the sugar lactose; many different 
DNA-binding proteins act as genetic switches 
to control development in multicellular 
organisms, including humans. 


TRANSPORT PROTEINS 


function: Carry small molecules or ions 


examples: In the bloodstream, serum albumin 
carries lipids, hemoglobin carries oxygen, and 
transferrin carries iron. Many proteins embedded 
in cell membranes transport ions or small 
molecules across the membrane. For example, the 
bacterial protein bacteriorhodopsin is a 
light-activated proton pump that transports H+ 
ions out of the cell; glucose transporters shuttle 
glucose into and out of cells; and a Ca?* pump 
clears Ca% from a muscle cell's cytosol after the 
ions have triggered a contraction. 


SIGNAL PROTEINS 


function: Carry extracellular signals from 
cell to cell 


examples: Many of the hormones and growth 
factors that coordinate physiological functions 
in animals are proteins. /nsulin, for example, is 
a small protein that controls glucose levels in 
the blood; netrin attracts growing nerve cell 
axons to specific locations in the developing 
spinal cord; nerve growth factor (NGF) 
stimulates some types of nerve cells to grow 
axons; epidermal growth factor (EGF) 
stimulates the growth and division of 
epithelial cells. 


SPECIAL-PURPOSE PROTEINS 


function: Highly variable 


a 


examples: Organisms make many proteins with 
highly specialized properties. These molecules 
illustrate the amazing range of functions that 
proteins can perform. The antifreeze proteins of 
Arctic and Antarctic fishes protect their blood 
against freezing; green fluorescent protein from 
jellyfish emits a green light; monellin, a protein 
found in an African plant, has an intensely sweet 
taste; mussels and other marine organisms secrete 
glue proteins that attach them firmly to rocks, 
even when immersed in seawater. 


of the proteins they contain. Finally, we present a brief description of the 
techniques that biologists use to work with proteins, including methods 
for purifying them—from tissues or cultured cells—and for determining 
their structures. 


THE SHAPE AND STRUCTURE OF PROTEINS 


From a chemical point of view, proteins are by far the most structurally 
complex and functionally sophisticated molecules known. This is per- 
haps not surprising, considering that the structure and activity of each 
protein has developed and been fine-tuned over billions of years of evo- 
lution. We start by considering how the position of each amino acid in 
the long string of amino acids that forms a protein determines its three- 
dimensional conformation, a shape that is stabilized by noncovalent 
interactions between different parts of the molecule. Understanding the 
structure of a protein at the atomic level allows us to see how the precise 
shape of the protein determines its function. 


The Shape of a Protein Is Specified by Its Amino Acid 
Sequence 


Proteins, as you may recall from Chapter 2, are assembled mainly from a 
set of 20 different amino acids, each with different chemical properties. 
A protein molecule is made from a long chain of these amino acids, held 
together by covalent peptide bonds (Figure 4-1). Proteins are therefore 
referred to as polypeptides, or polypeptide chains. In each type of pro- 
tein, the amino acids are present in a unique order, called the amino acid 
sequence, which is exactly the same from one molecule of that protein 
to the next. One molecule of human insulin, for example, should have 
the same amino acid sequence as every other molecule of human insulin. 
Many thousands of different proteins have been identified, each with its 
own distinct amino acid sequence. 


Each polypeptide chain consists of a backbone that is adorned with a 
variety of chemical side chains. The polypeptide backbone is formed 
from a repeating sequence of the core atoms (-N-C-C-) found in every 


carboxyl 
group 


glycine alanine 


PEPTIDE BOND 
FORMATION WITH 
REMOVAL OF WATER water 


peptide bond in glycylalanine 
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Figure 4-1 Amino acids are linked together 
by peptide bonds. A covalent peptide bond 
forms when the carbon atom of the carboxyl 
group of one amino acid (such as glycine) 
shares electrons with the nitrogen atom from 
the amino group of a second amino acid 
(such as alanine). Because a molecule of 
water is eliminated, peptide bond formation 
is classified as a condensation reaction (see 
Figure 2-31). In this diagram, carbon atoms 
are black, nitrogen blue, oxygen red, and 
hydrogen white. 
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Figure 4-2 A protein is made of 
amino acids linked together into a 
polypeptide chain. The amino acids 
are linked by peptide bonds (see 


Figure 4-1) to form a polypeptide polypeptide backbone 

backbone of repeating structure (gray x 

boxes), from which the side chain H H O o0 

of each amino acid projects. The amino terminus @| | | A carboxyl terminus 


i T eK (C-terminus) 
H i O 


sequence of these chemically distinct (N-terminus) FNC CaN CE 
side chains—which can be nonpolar | Il 
H 
(green), polar uncharged (yellow), 
positively charged (red), or negatively 
charged (blue)—gives each protein its 
distinct, individual properties. A small 
polypeptide of just four amino acids 
is shown here. Proteins are typically 
made up of chains of several hundred 
amino acids, whose sequence is always 


presented starting with the N-terminus 
and read from left to right. Aspartic acid 
(Asp) 


amino acid (Figure 4-2). Because the two ends of each amino acid are 
chemically different—one sports an amino group (NH3*, also written NH2) 
and the other a carboxyl group (COO, also written COOH)—each poly- 
peptide chain has a directionality: the end carrying the amino group is 
called the amino terminus, or N-terminus, and the end carrying the free 
carboxyl group is the carboxyl terminus, or C-terminus. 


peptide peptide bond 


bonds 


side chains 


Projecting from the polypeptide backbone are the amino acid side 
chains—the part of the amino acid that is not involved in forming peptide 
bonds (see Figure 4-2). The side chains give each amino acid its unique 
properties: some are nonpolar and hydrophobic (“water-fearing”), some 
are negatively or positively charged, some can be chemically reactive, 
and so on. The atomic formula for each of the 20 amino acids in proteins 
is presented in Panel 2-6 (pp. 76-77), and a brief list of the 20 common 
amino acids, with their abbreviations, is provided in Figure 4-3. 


Long polypeptide chains are very flexible, as many of the covalent bonds 
that link the carbon atoms in the polypeptide backbone allow free rota- 
tion of the atoms they join. Thus, proteins can in principle fold in an 


AMINO ACID SIDE CHAIN AMINO ACID SIDE CHAIN 


Aspartic acid Asp D negatively charged 
Glutamic acid Glu E negatively charged 


|_________ POLAR AMINO ACIDS NONPOLAR AMINO ACIDS ——— 


Figure 4-3 Twenty different amino acids are commonly found in proteins. Both three-letter and one-letter abbreviations are given, 
as well as the character of the side chain. There are equal numbers of polar (hydrophilic) and nonpolar (hydrophobic) side chains, and 
half of the polar side chains are charged at neutral pH in an aqueous solution. The structures of all of these amino acids are shown in 
Panel 2-6, pp. 76-77. 


glutamic acid 


H | electrostatic 
CH, attractions 


valine 


alanine 


enormous number of ways. The shape of each of these folded chains, 
however, is constrained by many sets of weak noncovalent bonds that 
form within proteins. These bonds involve atoms in the polypeptide 
backbone, as well as atoms within the amino acid side chains. The non- 
covalent bonds that help proteins fold up and maintain their shape include 
hydrogen bonds, electrostatic attractions, and van der Waals attractions, 
which are described in Chapter 2 (see Panel 2-3, pp. 70-71). Because a 
noncovalent bond is much weaker than a covalent bond, it takes many 
noncovalent bonds to hold two regions of a polypeptide chain tightly 
together. The stability of each folded shape is largely determined by the 
combined strength of large numbers of noncovalent bonds (Figure 4-4). 


A fourth weak interaction, the hydrophobic force, also has a central role 
in determining the shape of a protein. In an aqueous environment, hydro- 
phobic molecules, including the nonpolar side chains of particular amino 
acids, tend to be forced together to minimize their disruptive effect on 
the hydrogen-bonded network of the surrounding water molecules (see 
Panel 2-3, pp. 70-71). Therefore, an important factor governing the fold- 
ing of any protein is the distribution of its polar and nonpolar amino 
acids. The nonpolar (hydrophobic) side chains—which belong to amino 
acids such as phenylalanine, leucine, valine, and tryptophan (see Figure 
4-3)—tend to cluster in the interior of the folded protein (just as hydro- 
phobic oil droplets coalesce to form one large drop). Tucked away inside 
the folded protein, hydrophobic side chains can avoid contact with the 
aqueous environment that surrounds them inside a cell. In contrast, polar 
side chains—such as those belonging to arginine, glutamine, and histi- 
dine—tend to arrange themselves near the outside of the folded protein, 
where they can form hydrogen bonds with water and with other polar 
molecules (Figure 4-5). When polar amino acids are buried within the 
protein, they are usually hydrogen-bonded to other polar amino acids or 
to the polypeptide backbone (Figure 4-6). 
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Figure 4-4 Three types of noncovalent 
bonds help proteins fold. Although a 
single one of any of these bonds is quite 
weak, many of them together can create a 
strong bonding arrangement that stabilizes 
a particular three-dimensional structure, 

as in the small polypeptide shown in 

the center. R is often used as a general 
designation for an amino acid side chain. 
Protein folding is also aided by hydrophobic 
forces, as shown in Figure 4-5. 
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Figure 4-5 Hydrophobic forces help 

proteins fold into compact conformations. 

In a folded protein, polar amino acid side 

chains tend to be displayed on the surface, 

where they can interact with water; nonpolar 

amino acid side chains are buried on the 

inside to form a tightly packed hydrophobic \ 
core of atoms that are hidden from water. 


unfolded polypeptide 


Figure 4-6 Hydrogen bonds within a 
protein molecule help stabilize its folded 
shape. Large numbers of hydrogen bonds 
form between adjacent regions of a folded 
polypeptide chain. The structure shown is a 
portion of the enzyme lysozyme, between 
amino acids 42 and 63. Hydrogen bonds 
between two atoms in the polypeptide 
backbone are shown in red; those between 
the backbone and a side chain are shown 
in yellow; and those between atoms of two 
side chains are shown in blue. Note that 
the same amino acid side chain can make 
multiple hydrogen bonds (red arrow). In this 
diagram, nitrogen atoms are blue, oxygen 
atoms are red, and carbon atoms are gray; 
hydrogen atoms are not shown. (After 
C.K. Mathews, K.E. van Holde, and K.G. 
Ahern, Biochemistry, 3rd ed. San Francisco: 
Benjamin Cummings, 2000.) 
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Proteins Fold into a Conformation of Lowest Energy 


Each type of protein has a particular three-dimensional structure, which 
is determined by the order of the amino acids in its polypeptide chain. 
The final folded structure, or conformation, adopted by any polypeptide 
chain is determined by energetic considerations: a protein generally folds 
into the shape in which its free energy (G) is minimized. The folding pro- 
cess is thus energetically favorable, as it releases heat and increases the 
disorder of the universe (see Panel 3-1, pp. 94-95). 


` 
s 


A 
\ 
= 
5 
4 
a // A 
> i "a 


s TO : 


RRA 


> at" 


ws 
D 


- 


hydrogen bond between 
atoms of two amino 
acid side chains 


J EXPOSE TO A HIGH J 
CONCENTRATION REMOVE 

O OF UREA UREA È 
NY —— > — e 


purified protein protein refolds into its 
isolated from cells original conformation 
denatured protein 


Protein folding has been studied in the laboratory using highly purified 
proteins. A protein can be unfolded, or denatured, by treatment with sol- 
vents that disrupt the noncovalent interactions holding the folded chain 
together. This treatment converts the protein into a flexible polypeptide 
chain that has lost its natural shape. Under the right conditions, when the 
denaturing solvent is removed, the protein often refolds spontaneously 
into its original conformation—a process called renaturation (Figure 4-7). 
The fact that a denatured protein can, on its own, refold into the cor- 
rect conformation indicates that all the information necessary to specify 
the three-dimensional shape of a protein is contained in its amino acid 
sequence. 


Although a protein chain can fold into its correct conformation without 
outside help, protein folding in a living cell is generally assisted by a large 
set of special proteins called chaperone proteins. Some of these chaper- 
ones bind to partly folded chains and help them to fold along the most 
energetically favorable pathway (Figure 4-8). Others form “isolation 
chambers” in which single polypeptide chains can fold without the risk of 
forming aggregates in the crowded conditions of the cytoplasm (Figure 
4-9). In either case, the final three-dimensional shape of the protein is 
still specified by its amino acid sequence; chaperones merely make the 
folding process more efficient and reliable. 


Each protein normally folds into a single, stable conformation. This con- 
formation, however, often changes slightly when the protein interacts 
with other molecules in the cell. Such changes in shape are crucial to the 
function of the protein, as we discuss later. 


newly synthesized, 
partially folded protein 
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Figure 4-8 Chaperone proteins can guide the folding of a newly synthesized 
polypeptide chain. The chaperones bind to newly synthesized or partially folded 
chains and help them to fold along the most energetically favorable pathway. The 
function of these chaperones requires ATP binding and hydrolysis. 
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Figure 4-7 Denatured proteins can 

often recover their natural shapes. This 
type of experiment demonstrates that the 
conformation of a protein is determined 
solely by its amino acid sequence. 
Renaturation requires the correct conditions 
and works best for small proteins. 


QUESTION 4-1 


Urea, used in the experiment shown 
in Figure 4-7, is a molecule that 
disrupts the hydrogen-bonded 
network of water molecules. Why 
might high concentrations of urea 
unfold proteins? The structure of 
urea is shown here. 
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by the chaperone chain folds when cap 
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Figure 4-9 Some chaperone proteins act as isolation chambers that help a 
polypeptide fold. In this case, the barrel of the chaperone provides an enclosed 
chamber in which a newly synthesized polypeptide chain can fold without the risk of 
aggregating with other polypeptides in the crowded conditions of the cytoplasm. 
This system also requires an input of energy from ATP hydrolysis, mainly for the 
association and subsequent dissociation of the cap that closes off the chamber. 


Proteins Come in a Wide Variety of Complicated Shapes 


Proteins are the most structurally diverse macromolecules in the cell. 
Although they range in size from about 30 amino acids to more than 
10,000, the vast majority are between 50 and 2000 amino acids long. 
Proteins can be globular or fibrous, and they can form filaments, sheets, 
rings, or spheres (Figure 4—10). We will encounter many of these struc- 
tures throughout the book. 


To date, the structures of about 100,000 different proteins have been 
determined (using techniques we discuss later in the chapter). Most pro- 
teins have a three-dimensional conformation so intricate and irregular 
that their structure would require the rest of the chapter to describe in 
detail. But we can get some sense of the intricacies of polypeptide struc- 
ture by looking at the conformation of a relatively small protein, such as 
the bacterial transport protein HPr. 


This small protein, only 88 amino acids long, facilitates the transport 
of sugar into bacterial cells. In Figure 4-11, we present HPr’s three- 
dimensional structure in four different ways, each of which emphasizes 
different features of the protein. The backbone model (see Figure 4—11A) 
shows the overall organization of the polypeptide chain and provides a 
straightforward way to compare the structures of related proteins. The 
ribbon model (see Figure 4-11B) shows the polypeptide backbone in a 
way that emphasizes its most conspicuous folding patterns, which we 
describe in detail shortly. The wire model (see Figure 4—11C) includes the 
positions of all the amino acid side chains; this view is especially useful 
for predicting which amino acids might be involved in the protein’s activ- 
ity. Finally, the space-filling model (see Figure 4-11D) provides a contour 
map of the protein surface, which reveals which amino acids are exposed 
on the surface and shows how the protein might look to a small molecule 
such as water or to another macromolecule in the cell. 


The structures of larger proteins—or of multiprotein complexes—are even 
more complicated. To visualize such detailed and intricate structures, 
scientists have developed various computer-based tools to empha- 
size different features of a protein, only some of which are depicted in 
Figure 4-11. All of these images can be displayed on a computer screen 
and readily rotated and magnified to view all aspects of the structure 
(Movie 4.1). 


When the three-dimensional structures of many different protein mol- 
ecules are compared, it becomes clear that, although the overall 
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Figure 4—10 Proteins come in a wide variety of shapes and sizes. Each folded polypeptide is shown as a space-filling model, 
represented at the same scale. In the top-left corner is HPr, the small transport protein featured in detail in Figure 4-11. The protein 
deoxyribonuclease is shown bound to a portion of a DNA molecule (gray) for comparison. 
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(A) backbone model 


(B) ribbon model 


(C) wire model 


(D) space-filling model 
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Figure 4—11 Protein conformation can be represented in a variety 
of ways. Shown here is the structure of the small bacterial transport 
protein HPr. The images are colored to make it easier to trace the path 
of the polypeptide chain. In these models, the region of polypeptide 
chain carrying the protein’s N-terminus is purple and that near its 
C-terminus is red. 


conformation of each protein is unique, some regular folding patterns 
can be detected, as we discuss next. 


The a Helix and the B Sheet Are Common Folding 
Patterns 


More than 60 years ago, scientists studying hair and silk discovered two 
regular folding patterns that are present in many different proteins. The 
first to be discovered, called the a helix, was found in the protein a-Keratin, 
which is abundant in skin and its derivatives—such as hair, nails, and 
horns. Within a year of that discovery, a second folded structure, called 
a B sheet, was found in the protein fibroin, the major constituent of silk. 
(Biologists often use Greek letters to name their discoveries, with the first 
example receiving the designation a, the second B, and so on.) 


These two folding patterns are particularly common because they result 
from hydrogen bonds that form between the N-H and C=O groups in 
the polypeptide backbone (see Figure 4-6). Because the amino acid side 
chains are not involved in forming these hydrogen bonds, a helices and B 
sheets can be generated by many different amino acid sequences. In each 
case, the protein chain adopts a regular, repeating form. These structural 
features, and the shorthand cartoon symbols that are often used to repre- 
sent them in models of protein structures, are presented in Figures 4—12 
and 4-13. 
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Figure 4-12 Some polypeptide chains fold into an orderly repeating form 
known as an @ helix. (A) In an a helix, the N-H of every peptide bond is hydrogen- 
bonded to the C=O of a neighboring peptide bond located four amino acids away 
in the same chain. All of the atoms in the polypeptide backbone are shown; the 
amino acid side chains are denoted by R. (B) The same polypeptide, showing only 
the carbon (black and gray) and nitrogen (blue) atoms. (C) Cartoon symbol used to 
represent an @ helix in ribbon models of proteins (see Figure 4—11B). 
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Helices Form Readily in Biological Structures 
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The abundance of helices in proteins is, in a way, not surprising. A helix 
is generated simply by placing many similar subunits next to one another, 
each in the same strictly repeated relationship to the one before. Because 
it is very rare for subunits to join up in a straight line, this arrangement 
will generally result in a structure that resembles a spiral staircase 
(Figure 4-14). Depending on the way it twists, a helix is said to be either 
right-handed or left-handed (see Figure 4-14E). Handedness is not 
affected by turning the helix upside down, but it is reversed if the helix is 
reflected in a mirror. 
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Figure 4-13 Some polypeptide chains 
fold into an orderly pattern called a 

B sheet. (A) In a B sheet, several segments 
(strands) of an individual polypeptide chain 
are held together by hydrogen-bonding 
between peptide bonds in adjacent 
strands. The amino acid side chains in 

each strand project alternately above 

and below the plane of the sheet. In the 
example shown, the adjacent chains run in 
opposite directions, forming an antiparallel 
B sheet. All of the atoms in the polypeptide 
backbone are shown; the amino acid side 
chains are denoted by R. (B) The same 
polypeptide, showing only the carbon 
(black and gray) and nitrogen (blue) atoms. 
(C) Cartoon symbol used to represent 

B sheets in ribbon models of proteins (see 
Figure 4—-11B). 


QUESTION 4-2 


Remembering that the amino 

acid side chains projecting from 
each polypeptide backbone in a 

B sheet point alternately above 
and below the plane of the sheet 
(see Figure 4—13A), consider 

the following protein sequence: 
Leu-Lys-Val-Asp-lle-Ser-Leu-Arg- 
Leu-Lys-lle-Arg-Phe-Glu. Do you 
find anything remarkable about the 
arrangement of the amino acids in 
this sequence when incorporated 
into a B sheet? Can you make any 
predictions as to how the f sheet 
might be arranged in a protein? 
(Hint: consult the properties of the 
amino acids listed in Figure 4—3.) 


Figure 4—14 A helix is a common, regular, 
biological structure. A helix will form when 
a series of similar subunits bind to each 
other in a regular way. At the bottom, the 
interaction between two subunits is shown; 
behind them are the helices that result. 
These helices have (A) two, (B) three, or 

(C and D) six subunits per helical turn. At 
the top, the arrangement of subunits has 
been photographed from directly above the 
helix. Note that the helix in (D) has a wider 
path than that in (C), but the same number 
of subunits per turn. (E) A helix can be either 
right-handed or left-handed. As a reference, 
it is useful to remember that standard 

metal screws, which advance when turned 
clockwise, are right-handed. So to judge the 
handedness of a helix, imagine screwing it 
into a wall. Note that a helix preserves the 
same handedness when it is turned upside 
down. In proteins, a helices are almost 
always right-handed. 
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hydrophobic amino Figure 4-15 Many membrane-bound proteins cross the lipid 
acid side chain bilayer as an a helix. The hydrophobic side chains of the amino acids 
hydrogen bond that form the a helix make contact with the hydrophobic hydrocarbon 


tails of the phospholipid molecules, while the hydrophilic parts of the 
polypeptide backbone form hydrogen bonds with one another along 
the interior of the helix. About 20 amino acids are required to span a 
membrane in this way. Note that, despite the appearance of a space 
along the interior of the helix in this schematic diagram, the helix is not 
a channel: no ions or small molecules can pass through it. 


An a helix is generated when a single polypeptide chain turns around itself 
to form a structurally rigid cylinder. A hydrogen bond is made between 
every fourth amino acid, linking the C=O of one peptide bond to the N-H 
of another (see Figure 4-12A). This pattern gives rise to a regular right- 
handed helix with a complete turn every 3.6 amino acids (Movie 4.2). 


Short regions of a helix are especially abundant in proteins that are 
embedded in cell membranes, such as transport proteins and receptors. 
We see in Chapter 11 that the portions of a transmembrane protein that 
phospholipid cross the lipid bilayer usually form an a helix, composed largely of amino 
acids with nonpolar side chains. The polypeptide backbone, which is 
hydrophilic, is hydrogen-bonded to itself inside the a helix, where it is 
shielded from the hydrophobic lipid environment of the membrane by the 
protruding nonpolar side chains (Figure 4—15). 


Sometimes two (or three) a helices will wrap around one another to form 
a particularly stable structure called a coiled-coil. This structure forms 
when the a helices have most of their nonpolar (hydrophobic) side chains 
along one side, so they can twist around each other with their hydro- 
phobic side chains facing inward—minimizing contact with the aqueous 
cytosol (Figure 4-16). Long, rodlike coiled-coils form the structural 
framework for many elongated proteins, including the a-keratin found in 
hair and the outer layer of the skin, as well as myosin, the motor protein 
responsible for muscle contraction (discussed in Chapter 17). 


Figure 4—16 Intertwined a helices can 
form a stiff coiled-coil. (A) A single a helix 
is shown, with successive amino acid side 
chains labeled in a sevenfold repeating 
sequence “abcdefg.” Amino acids “a” and 
“d” in such a sequence lie close together 
on the cylinder surface, forming a stripe 
(shaded in green) that winds slowly around 
the a helix. Proteins that form coiled- 

coils typically have nonpolar amino acids 
at positions “a” and “d.” Consequently, 
as shown in (B), two a helices can wrap 
around each other, with the nonpolar side 
chains of one a helix interacting with the 
nonpolar side chains of the other, while 
the more hydrophilic amino acid side 
chains (shaded in red) are left exposed to 
the aqueous environment. (C) A portion 
of the atomic structure of a coiled-coil 
made by two a helices, as determined by 
x-ray crystallography. In this structure, the 


stripe of 
hydrophobic 
“a” and “d” 


amino acids eal D 


helices wrap around each other to minimize 
exposure of hydrophobic amino acid 
side chains to aqueous environment 


backbones of the helices are shown in red, COOH 

3 k k i HOOC COOH 
the interacting, nonpolar side chains are 
green, and the remaining side chains are L 


light gray. Coiled-coils can also form from 
three a helices (Movie 4.3). (A) (B) (© 


B Sheets Form Rigid Structures at the Core of Many 
Proteins 


A B sheet is made when hydrogen bonds form between segments of a 
polypeptide chain that lie side by side (see Figure 4—13A). When the neigh- 
boring segments run in the same orientation (say, from the N-terminus 
to the C-terminus), the structure forms a parallel p sheet; when they 
run in opposite directions, the structure forms an antiparallel p sheet 
(Figure 4-17). Both types of B sheet produce a very rigid, pleated struc- 
ture, and they form the core of many proteins. Even the small bacterial 
transport protein HPr (see Figure 4-11) contains several p sheets. 


B sheets have remarkable properties. They give silk fibers their extraor- 
dinary tensile strength. They also form the basis of amyloid structures, in 
which p sheets are stacked together in long rows with their amino acid 
side chains interdigitated like the teeth of a zipper (Figure 4—18). Such 
structures play an important role in cells, as we discuss later in this chap- 
ter. However, they can also precipitate disease, as we see next. 


Misfolded Proteins Can Form Amyloid Structures 
That Cause Disease 


When proteins fold incorrectly, they sometimes form amyloid structures 
that can damage cells and even whole tissues. These amyloid struc-tures 
are thought to contribute to a number of neurodegenerative disorders, such 
as Alzheimer’s disease and Huntington's disease. Some infectious neuro- 
degenerative diseases—including scrapie in sheep, bovine spongiform 
encephalopathy (BSE, or “mad cow” disease) in cattle, and Creutzfeldt- 
Jakob disease (CJD) in humans—are caused by misfolded proteins called 
prions. The misfolded prion form of a protein can convert the properly 
folded version of the protein in an infected brain into the abnormal confor- 
mation. This allows the misfolded prions to form aggregates (Figure 4-19), 
which can spread rapidly from cell to cell, eventually causing the death of 
the affected animal or human. Prions are considered “infectious” because 
they can also spread from an affected individual to a normal individual via 
contaminated food, blood, or surgical instruments, for example. 


Proteins Have Several Levels of Organization 


A protein’s structure does not begin and end with a helices and 
B sheets. Its complete conformation includes several interdependent 
levels of organization, which build one upon the next. Because a pro- 
tein’s structure begins with its amino acid sequence, this is considered its 
primary structure. The next level of organization includes the a helices 
and p sheets that form within certain segments of the polypeptide chain; 
these folds are elements of the protein’s secondary structure. The full, 
three-dimensional conformation formed by an entire polypeptide chain— 
including the a helices, B sheets, and all other loops and folds that form 
between the N- and C-termini—is sometimes referred to as the tertiary 
structure. Finally, if the protein molecule exists as a complex of more 
than one polypeptide chain, then these interacting polypeptides form its 
quaternary structure. 


Figure 4-18 B sheets can stack to form an amyloid structure. 

(A) Electron micrograph showing an amyloid structure from a yeast. 
This structure resembles the type of insoluble aggregates observed in 
the neurons of individuals with different neurodegenerative diseases 
(see Figure 4-19). (B) Schematic representation shows the stacking of 
P sheets that stabilizes an individual amyloid strand. (A, from M.R. 
Sawaya et al., Nature 447:453-457, 2007. With permission from 
Macmillan Publishers Ltd.) 
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(A) 


Figure 4—17 B sheets come in two 
varieties. (A) Antiparallel B sheet (see also 
Figure 4—13A). (B) Parallel B sheet. Both of 
these structures are common in proteins. 
By convention, the arrows point toward 
the C-terminus of the polypeptide chain 
(Movie 4.4). 
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(A) normal protein can, on occasion, adopt 
an abnormal, misfolded prion form 


Q ” d 
normal 
protein 


abnormal prion form 
of protein 


(B) the prion form of the protein can bind 
to the normal form, inducing conversion 
to the abnormal conformation 


binding 


le-x 


heterodimer 


protein to abnormal 


conversion of normal 
prion form 


(C) abnormal prion proteins propagate 
and aggregate to form amyloid fibrils 


amyloid fibril 


Figure 4—20 Many proteins are composed 
of separate functional domains. Elements 
of secondary structure such as a helices 

and B sheets pack together into stable, 
independently folding, globular elements 
called protein domains. A typical protein 
molecule is built from one or more domains, 
linked by a region of polypeptide chain 

that is often relatively unstructured. The 
ribbon diagram on the right represents the 
bacterial transcription regulatory protein 
CAP, which consists of one large cyclic 
AMP-binding domain (outlined in blue) and 
one small DNA-binding domain (outlined 

in yellow). The function of this protein is 
described in Chapter 8 (see Figure 8-9). 


secondary 
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Figure 4-19 Prion diseases are caused by proteins whose 
misfolding is infectious. (A) A protein undergoes a rare 
conformational change to produce an abnormally folded prion form. 
(B) The abnormal form causes the conversion of normal proteins 

in the host's brain into the misfolded prion form. (C) The prions 
aggregate into amyloid fibrils, which can disrupt brain-cell function, 
causing a neurodegenerative disorder (see also Figure 4-18). Some 
of the abnormal amyloid fibrils that form in major neurodegenerative 
disorders such as Alzheimer's disease may be able to propagate from 
cell to cell in this way. 


Studies of the conformation, function, and evolution of proteins have 
also revealed the importance of a level of organization distinct from 
the four just described. This organizational unit is the protein domain, 
which is defined as any segment of a polypeptide chain that can fold 
independently into a compact, stable structure. A protein domain usu- 
ally contains between 40 and 350 amino acids—folded into a helices and 
B sheets and other elements of structure—and it is the modular unit from 
which many larger proteins are constructed (Figure 4—20). 


Different domains of a protein are often associated with different func- 
tions. For example, the bacterial catabolite activator protein (CAP), 
illustrated in Figure 4-20, has two domains: a small domain that binds 
to DNA and a large domain that binds cyclic AMP, a small intracellular 
signaling molecule. When the large domain binds cyclic AMP, it causes a 
conformational change in the protein that enables the small domain to 
bind to a specific DNA sequence and thereby promote the expression of 
an adjacent gene. To provide a sense of the many different domain struc- 
tures observed in proteins, ribbon models of three different domains are 
shown in Figure 4-21. 


Proteins Also Contain Unstructured Regions 


Small protein molecules, such as the oxygen-carrying muscle protein 
myoglobin, contain only a single domain (see Figure 4-10). Larger pro- 
teins can contain as many as several dozen domains, which are often 


a helix 


B sheet 


single protein 
domain 


protein molecule 
made of two 
different domains 


structure 


(A) (B) (©) 


connected by relatively short, unstructured lengths of polypeptide chain. 
The ubiquity of such intrinsically disordered sequences, which con- 
tinually bend and flex due to thermal buffeting, became appreciated only 
after bioinformatics methods were developed that could recognize them 
from their amino acid sequences. Present estimates suggest that a third 
of all eukaryotic proteins also possess longer, unstructured regions— 
greater than 30 amino acids in length—in their polypeptide chains. These 
unstructured sequences can have a variety of important functions in 
cells, as we discuss later in the chapter. 


Few of the Many Possible Polypeptide Chains Will Be 
Useful 


In theory, a vast number of different polypeptide chains could be made 
from 20 different amino acids. Because each amino acid is chemically 
distinct and could, in principle, occur at any position, a polypeptide chain 
four amino acids long has 20 x 20 x 20 x 20 = 160,000 different possible 
sequences. For a typical protein with a length of 300 amino acids, that 
means that more than 20300 (that’s 10320) different polypeptide chains 
could theoretically be produced. And that’s just one protein. 


Of the unimaginably large collection of potential polypeptide sequences, 
only a minuscule fraction is actually made by cells. That’s because most 
biological functions depend on proteins with stable, well-defined three- 
dimensional conformations. This requirement greatly restricts the list of 
polypeptide sequences present in living cells. Another constraint is that 
functional proteins must be “well-behaved” and not engage in unwanted 
associations with other proteins in the cell—forming insoluble pro- 
tein aggregates, for example. Many potential protein sequences would 
therefore have been eliminated by natural selection through the long 
trial-and-error process that underlies evolution (discussed in Chapter 9). 


Thanks to natural selection, the amino acid sequences of many present- 
day polypeptides have evolved to adopt a stable conformation—one that 
bestows upon the protein the exact chemical properties that will enable it 
to perform a particular function. Such proteins are so precisely built that 
a change in even a few atoms in one amino acid can sometimes disrupt 
the structure of a protein and thereby eliminate its function. In fact, the 
conformations of many proteins—and their constituent domains—are so 
stable and effective that they have been conserved throughout the evolu- 
tion of a diverse array of organisms. For example, the three-dimensional 
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Figure 4-21 Ribbon models show three 
different protein domains. (A) Cytochrome 
bs62 is a single-domain protein involved in 
electron transfer in E. coli. It is composed 
almost entirely of a helices. (B) The 
NAD-binding domain of the enzyme 
lactate dehydrogenase is composed of a 
mixture of a helices and B sheets. (C) An 
immunoglobulin domain of an antibody 
molecule is composed of a sandwich of two 
antiparallel B sheets. In these examples, 
the a helices are shown in green, while 
strands organized as B sheets are red. The 
protruding loop regions (yellow) are often 
unstructured and can provide binding sites 
for other molecules. 
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QUESTION 4-3 


Random mutations only very rarely 
result in changes that improve a 
protein's usefulness for the cell, yet 
useful mutations are selected in 
evolution. Because these changes 
are so rare, for each useful mutation 
there are innumerable mutations 
that lead to either no improvement 
or inactive proteins. Why, then, do 
cells not contain millions of proteins 
that are of no use? 


Figure 4—22 Serine proteases constitute a 
family of proteolytic enzymes. Backbone 
models of two serine proteases, elastase 
and chymotrypsin, are illustrated. Although 
only those amino acid sequences in the 
polypeptide chain shaded in green are 
the same in the two proteins, the two 
conformations are very similar nearly 
everywhere. Nonetheless, the two proteases 
act on different substrates. 

The active site of each enzyme—where 
its substrates are bound and cleaved—is 
circled in red. The amino acid serine directly 
participates in the cleavage reaction, 
which is why the enzymes are called serine 
proteases. The black dots on the right side 
of the chymotrypsin molecule mark the two 
ends created where the enzyme has cleaved 
its own backbone. 
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structures of the DNA-binding domains of some transcription regulators 
from yeast, animals, and plants are almost completely superimposable, 
even though the organisms are separated by more than a billion years 
of evolution. Other proteins, however, have changed their structure and 
function over evolutionary time, as we now discuss. 


Proteins Can Be Classified into Families 


Once a protein has evolved a stable conformation with useful properties, 
its structure can be modified over time to enable it to perform new func- 
tions. We know that this occurred quite often during evolution, because 
many present-day proteins can be grouped into protein families, in which 
each family member has an amino acid sequence and a three-dimensional 
conformation that closely resemble those of the other family members. 


Consider, for example, the serine proteases, a family of protein-cleaving 
(proteolytic) enzymes that includes the digestive enzymes chymotrypsin, 
trypsin, and elastase, as well as several proteases involved in blood clot- 
ting. When any two of these enzymes are compared, portions of their 
amino acid sequences are found to be nearly the same. The similarity 
of their three-dimensional conformations is even more striking: most of 
the detailed twists and turns in their polypeptide chains, which are sev- 
eral hundred amino acids long, are virtually identical (Figure 4—22). The 
various serine proteases nevertheless have distinct enzymatic activities, 
each cleaving different proteins or the peptide bonds between different 
types of amino acids. 


Large Protein Molecules Often Contain More than 
One Polypeptide Chain 


The same type of weak noncovalent bonds that enable a polypeptide 
chain to fold into a specific conformation also allow proteins to bind 
to each other to produce larger structures in the cell. Any region on a 
protein’s surface that interacts with another molecule through sets of 
noncovalent bonds is termed a binding site. A protein can contain binding 
sites for a variety of molecules, large and small. If a binding site recog- 
nizes the surface of a second protein, the tight binding of two folded 
polypeptide chains at this site will create a larger protein, whose quater- 
nary structure has a precisely defined geometry. Each polypeptide chain 
in such a protein is called a subunit, and each of these subunits may 
contain more than one domain. 


dimer of the CAP protein tetramer of neuraminidase protein 


dimer formed by 

interaction between 

a single, identical 

binding site on each 
(A) 


monomer (B) 


tetramer formed by 
interactions between 


sites on each monomer 


In the simplest case, two identical, folded polypeptide chains form a sym- 
metrical complex of two protein subunits (called a dimer) that is held 
together by interactions between two identical binding sites. CAP, the 
bacterial protein we discussed earlier, is such a dimer (Figure 4—23A),; it 
is composed of two identical copies of the protein subunit, each of which 
contains two domains, as shown previously in Figure 4-20. Many other 
symmetrical protein complexes, formed from multiple copies of the same 
polypeptide chain, are commonly found in cells. The enzyme neurami- 
nidase, for example, consists of a ring of four identical protein subunits 
(Figure 4—23B). 


Other proteins contain two or more different polypeptide chains. 
Hemoglobin, the protein that carries oxygen in red blood cells, is a particu- 
larly well-studied example. The protein contains two identical a-globin 
subunits and two identical B-globin subunits, symmetrically arranged 
(Figure 4—24). Many proteins contain multiple subunits, and they can be 
very large (Movie 4.5). 
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two nonidentical binding 


Figure 4—23 Many protein molecules 
contain multiple copies of the same 
protein subunit. (A) A symmetrical 
dimer. The protein CAP is a complex of 
two identical polypeptide chains (see 
also Figure 4-20). (B) A symmetrical 
homotetramer. The enzyme 
neuraminidase exists as a ring of four 
identical polypeptide chains. For both 
(A) and (B), a small schematic below the 
structure emphasizes how the repeated 
use of the same binding interaction 
forms the structure. In (A), the use of the 
same binding site on each monomer 
(represented by brown and green ovals) 
causes the formation of a symmetrical 
dimer. In (B), a pair of nonidentical 
binding sites (represented by orange 
circles and blue squares) causes the 
formation of a symmetrical tetramer. 


133 


Figure 4—24 Some proteins are formed as 


a symmetrical assembly of two different 
subunits. Hemoglobin, an oxygen-carrying 


protein abundant in red blood cells, 


contains two copies of a-globin (green) and 
two copies of B-globin (blue). Each of these 
four polypeptide chains cradles a molecule 
of heme (red), where oxygen (O2) is bound. 
Thus, each hemoglobin protein can carry 


four molecules of oxygen. 
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Figure 4-26 An actin filament is 
composed of identical protein subunits. 
(A) Transmission electron micrograph of an 
actin filament. (B) The helical array of actin 
molecules in an actin filament often contains 
thousands of molecules and extends for 
micrometers in the cell; 1 micrometer = 
1000 nanometers. (A, courtesy of Roger 
Craig.) 


Figure 4-25 Identical protein subunits can assemble into complex 
structures. (A) A protein with just one binding site can form a dimer 
with another identical protein. (B) Identical proteins with two different 
binding sites will often form a long, helical filament. (C) If the two 
binding sites are positioned appropriately in relation to each other, 
the protein subunits will form a closed ring instead of a helix (see also 
Figure 4—23B). 


Proteins Can Assemble into Filaments, Sheets, or Soheres 


Proteins can form even larger assemblies than those discussed so far. 
Most simply, a chain of identical protein molecules can be formed if 
the binding site on one protein molecule is complementary to another 
region on the surface of another protein molecule of the same type. 
Because each protein molecule is bound to its neighbor in an identical 
way (see Figure 4-14), the molecules will often be arranged in a helix 
that can be extended indefinitely in either direction (Figure 4-25). This 
type of arrangement can produce an extended protein filament. An actin 
filament, for example, is a long, helical structure formed from many mol- 
ecules of the protein actin (Figure 4-26). Actin is extremely abundant 
in eukaryotic cells, where it forms one of the major filament systems of 
the cytoskeleton (discussed in Chapter 17). Other sets of identical pro- 
teins associate to form tubes, as in the microtubules of the cytoskeleton 
(Figure 4-27), or cagelike spherical shells, as in the protein coats of virus 
particles (Figure 4-28). 


Many large structures, such as viruses and ribosomes, are built from a 
mixture of one or more types of protein plus RNA or DNA molecules. 
These structures can be isolated in pure form and dissociated into their 
constituent macromolecules. It is often possible to mix the isolated com- 
ponents back together and watch them reassemble spontaneously into 
the original structure. This demonstrates that all the information needed 
for assembly of the complicated structure is contained in the macro- 
molecules themselves. Experiments of this type show that much of the 
structure of a cell is self-organizing: if the required proteins are produced 
in the right amounts, the appropriate structures will form automatically. 


Some Types of Proteins Have Elongated Fibrous Shapes 


Most of the proteins we have discussed so far are globular proteins, in 
which the polypeptide chain folds up into a compact shape like a ball with 
an irregular surface. Enzymes, for example, tend to be globular proteins: 
even though many are large and complicated, with multiple subunits, 
most have a quaternary structure with an overall rounded shape (see 
Figure 4-10). In contrast, other proteins have roles in the cell that require 
them to span a large distance. These proteins generally have a rela- 
tively simple, elongated three-dimensional structure and are commonly 
referred to as fibrous proteins. 


Figure 4-27 A single type of protein subunit can pack together 
to form a filament, a hollow tube, or a spherical shell. Actin 
subunits, for example, form actin filaments (see Figure 4-26), whereas 
tubulin subunits form hollow microtubules, and some virus proteins 
form a spherical shell (capsid) that encloses the viral genome 

(see Figure 4-28). 


One large class of intracellular fibrous proteins resembles a-keratin, 
which we met earlier when we introduced the a helix. Keratin filaments 
are extremely stable: long-lived structures such as hair, horns, and nails 
are composed mainly of this protein. An a-Keratin molecule is a dimer 
of two identical subunits, with the long a helices of each subunit form- 
ing a coiled-coil (see Figure 4-16). These coiled-coil regions are capped 
at either end by globular domains containing binding sites that allow 
them to assemble into ropelike intermediate filaments—a component 
of the cytoskeleton that gives cells mechanical strength (discussed in 
Chapter 17). 


Fibrous proteins are especially abundant outside the cell, where they form 
the gel-like extracellular matrix that helps bind cells together to form tis- 
sues. These proteins are secreted by the cells into their surroundings, 
where they often assemble into sheets or long fibrils. Collagen is the most 
abundant of these fibrous extracellular proteins in animal tissues. A col- 
lagen molecule consists of three long polypeptide chains, each containing 
the nonpolar amino acid glycine at every third position. This regular struc- 
ture allows the chains to wind around one another to generate a long, 
regular, triple helix with glycine at its core (Figure 4—29A). Many such 
collagen molecules bind to one another, side-by-side and end-to-end, to 
create long, overlapping arrays called collagen fibrils, which are extremely 
strong and help hold tissues together, as described in Chapter 20. 


In complete contrast to collagen is another fibrous protein in the extracel- 
lular matrix, elastin. Elastin molecules are formed from relatively loose 
and unstructured polypeptide chains that are covalently cross-linked into 
a rubberlike elastic meshwork. The resulting elastic fibers enable skin and 
other tissues, such as arteries and lungs, to stretch and recoil without 
tearing. As illustrated in Figure 4—29B, the elasticity is due to the ability 
of the individual protein molecules to uncoil reversibly whenever they 
are stretched. 


Extracellular Proteins Are Often Stabilized by Covalent 
Cross-Linkages 


Many protein molecules are attached to the surface of a cell’s plasma 
membrane or secreted as part of the extracellular matrix, which exposes 
them to the potentially harsh conditions outside the cell. To help maintain 
their structures, the polypeptide chains in such proteins are often stabi- 
lized by covalent cross-linkages. These linkages can either tie together 
two amino acids in the same polypeptide chain or join together many 
polypeptide chains in a large protein complex—as for the collagen fibrils 
and elastic fibers just described. A variety of different types of cross-links 
exist. 


Figure 4—28 Many viral capsids are essentially spherical protein 
assemblies. They are formed from many copies of a small set of 
protein subunits. The nucleic acid of the virus (DNA or RNA) is 
packaged inside. The structure of the simian virus SV40, shown here, 
was determined by x-ray crystallography and is known in atomic detail. 
(Courtesy of Robert Grant, Stephan Crainic, and James M. Hogle.) 
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Figure 4—29 Fibrous proteins collagen and elastin form very different structures. (A) A collagen molecule is a 
triple helix formed by three extended protein chains that wrap around one another. Many rodlike collagen molecules 
are cross-linked together in the extracellular space to form collagen fibrils (top), which have the tensile strength of 
steel. The striping on the collagen fibril is caused by the regular repeating arrangement of the collagen molecules 
within the fibril. (B) Elastin molecules are cross-linked together by covalent bonds (red) to form rubberlike, elastic 
fibers. Each elastin polypeptide chain uncoils into a more extended conformation when the fiber is stretched, and 
recoils spontaneously as soon as the stretching force is relaxed. 


Figure 4-30 Disulfide bonds help stabilize 
a favored protein conformation. This 
diagram illustrates how covalent disulfide 
bonds form between adjacent cysteine side 
chains by the oxidation of their -SH groups. 


As indicated, these cross-links can join 


either two parts of the same polypeptide 
chain or two different polypeptide chains. 
Because the energy required to break one 
covalent bond is much larger than the 
energy required to break even a whole 
set of noncovalent bonds (see Table 2-1, 
p. 48), a disulfide bond can have a major 
stabilizing effect on a protein's folded 
structure (Movie 4.6). 


The most common covalent cross-links in proteins are sulfur-sulfur 
bonds. These disulfide bonds (also called S-S bonds) are formed, before 
a protein is secreted, by an enzyme in the endoplasmic reticulum that 
links together two -SH groups from cysteine side chains that are adja- 
cent in the folded protein (Figure 4-30). Disulfide bonds do not change a 
protein's conformation, but instead act as a sort of “atomic staple” to re- 
inforce the protein’s most favored conformation. Lysozyme—an enzyme 
in tears, saliva, and other secretions that can disrupt bacterial cell walls— 
retains its antibacterial activity for a long time because it is stabilized by 
such disulfide cross-links. 


Disulfide bonds generally do not form in the cell cytosol, where a high 
concentration of reducing agents converts such bonds back to cysteine 
-SH groups. Apparently, proteins do not require this type of structural 
reinforcement in the relatively mild conditions inside the cell. 
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HOW PROTEINS WORK 


For proteins, form and function are inextricably linked. Dictated by the 
surface topography of a protein’s side chains, this union of structure, 
chemistry, and activity gives proteins the extraordinary ability to orches- 
trate the large number of dynamic processes that occur in cells. But the 
fundamental question remains: How do proteins actually work? In this 
section, we will see that the activity of proteins depends on their ability 
to bind specifically to other molecules, allowing them to act as catalysts, 
structural supports, tiny motors, and so on. The examples we review here 
by no means exhaust the vast functional repertoire of proteins. However, 
the specialized functions of the proteins you will encounter elsewhere in 
this book are based on the same principles. 


All Proteins Bind to Other Molecules 


The biological properties of a protein molecule depend on its physical 
interaction with other molecules. Antibodies attach to viruses or bacteria 
as part of the body's defenses; the enzyme hexokinase binds glucose and 
ATP to catalyze a reaction between them; actin molecules bind to one 
another to assemble into long filaments; and so on. Indeed, all proteins 
stick, or bind, to other molecules in a specific manner. In some cases, this 
binding is very tight; in others, it is weak and short-lived. 


The binding of a protein to other biological molecules always shows great 
specificity: each protein molecule can bind to just one or a few molecules 
out of the many thousands of different molecules it encounters. Any sub- 
stance that is bound by a protein—whether it is an ion, a small organic 
molecule, or a macromolecule—is referred to as a ligand for that protein 
(from the Latin ligare, “to bind”). 


The ability of a protein to bind selectively and with high affinity to a ligand 
is due to the formation of a set of weak, noncovalent interactions—hydro- 
gen bonds, electrostatic attractions, and van der Waals attractions—plus 
favorable hydrophobic forces (see Panel 2-3, pp. 70-71). Each individ- 
ual noncovalent interaction is weak, so that effective binding requires 
many such bonds to be formed simultaneously. This is possible only if 
the surface contours of the ligand molecule fit very closely to the protein, 
matching it like a hand in a glove (Figure 4-31). 


When molecules have poorly matching surfaces, few noncovalent inter- 
actions occur, and the two molecules dissociate as rapidly as they come 
together. This is what prevents incorrect and unwanted associations 
from forming between mismatched molecules. At the other extreme, 
when many noncovalent interactions are formed, the association will 
persist (see Movie 2.4). Strong binding between molecules occurs in cells 
whenever a biological function requires that the molecules remain tightly 
associated for a long time—for example, when a group of macromol- 
ecules come together to form a functional subcellular structure such as 
a ribosome. 


The region of a protein that associates with a ligand, known as its bind- 
ing site, usually consists of a cavity in the protein surface formed by 
a particular arrangement of amino acid side chains. These side chains 
can belong to amino acids that are widely separated on the linear poly- 
peptide chain, but are brought together when the protein folds (Figure 
4-32). Other regions on the surface often provide binding sites for dif- 
ferent ligands that regulate the protein’s activity, as we discuss later. Still 
other parts of the protein may be required to attract or attach the protein 
to a particular location in the cell—for example, the hydrophobic a helix 
of a membrane-spanning protein allows it to be inserted into the lipid 
bilayer of a cell membrane (see Figure 4-15 and discussed in Chapter 11). 


How Proteins Work 


QUESTION 4-4 


Hair is composed largely of fibers 
of the protein keratin. Individual 
keratin fibers are covalently cross- 
linked to one another by many 
disulfide (S-S) bonds. If curly hair is 
treated with mild reducing agents 
that break a few of the cross-links, 
pulled straight, and then oxidized 
again, it remains straight. Draw a 
diagram that illustrates the three 
different stages of this chemical and 
mechanical process at the level of 
the keratin filaments, focusing on 
the disulfide bonds. What do you 
think would happen if hair were 
treated with strong reducing agents 
that break all the disulfide bonds? 


noncovalent bonds 


(A) protein 


(B) 


Figure 4-31 The binding of a protein to 
another molecule is highly selective. 
Many weak interactions are needed to 


enable a protein to bind tightly to a second 


molecule (a ligand). The ligand must 
therefore fit precisely into the protein’s 
binding site, so that a large number of 
noncovalent interactions can be formed 
between the protein and the ligand. 


(A) Schematic drawing showing the binding 


of a hypothetical protein and ligand; 


(B) space-filling model of the ligand—protein 


interaction shown in Figure 4-32. 
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Figure 4-32 Binding sites allow proteins to interact with specific ligands. (A) The folding of the polypeptide 
chain typically creates a crevice or cavity on the folded protein's surface, where specific amino acid side chains are 
brought together in such a way that they can form a set of noncovalent bonds only with certain ligands. (B) Close-up 
view of an actual binding site showing the hydrogen bonds and an electrostatic interaction formed between a 
protein and its ligand (in this example, the bound ligand is cyclic AMP, shown in dark yellow). 


Although the atoms buried in the interior of a protein have no direct con- 
tact with the ligand, they provide an essential framework that gives the 
surface its contours and chemical properties. Even tiny changes to the 
amino acids in the interior of a protein can change the protein's three- 
dimensional shape and destroy its function. 


Humans Produce Billions of Different Antibodies, Each 
with a Different Binding Site 


All proteins must bind to specific ligands to carry out their various func- 
tions. For antibodies, the universe of possible ligands is limitless and 
includes molecules found on bacteria, viruses, and other agents of 
infection. How does the body manage to produce antibodies capable of 
recognizing and binding tightly to such a diverse collection of ligands? 


Antibodies are immunoglobulin proteins produced by the immune sys- 
tem in response to foreign molecules, especially those on the surface of 
an invading microorganism. Each antibody binds to a particular target 
molecule extremely tightly, either inactivating the target directly or mark- 
ing it for destruction. An antibody recognizes its target molecule, called 
an antigen, with remarkable specificity. And because there are poten- 
tially billions of different antigens we might encounter, humans must be 
able to produce billions of different antibodies—one of which will be spe- 
cific for almost any antigen imaginable. 


Antibodies are Y-shaped molecules with two identical antigen-binding 
sites, each of which is complementary to a small portion of the surface 
of the antigen molecule. A detailed examination of antibody structure 
reveals that the antigen-binding sites are formed from several loops 
of polypeptide chain that protrude from the ends of a pair of closely 
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Figure 4-33 An antibody is Y-shaped and has two identical antigen-binding sites, one on each arm of the Y. 
(A) Schematic drawing of a typical antibody molecule. The protein is composed of four polypeptide chains (two 
identical heavy chains and two identical, smaller light chains), stabilized and held together by disulfide bonds (red). 
Each chain is made up of several similar domains, here shaded with blue, for the variable domains, or gray, for the 
constant domains. The antigen-binding site is formed where a heavy-chain variable domain (Vy) and a light-chain 
variable domain (V1) come close together. These are the domains that differ most in their amino acid sequence in 
different antibodies—hence their name. (B) Ribbon drawing of a single light chain showing that the most variable 
parts of the polypeptide chain (orange) extend as loops at one end of the variable domain (V1) to form half of one 
antigen-binding site of the antibody molecule shown in (A). Note that both the constant and variable domains are 
composed of a sandwich of two antiparallel B sheets connected by a disulfide bond (red). 


juxtaposed protein domains (Figure 4-33). The amino acid sequence in 
these loops can vary greatly without altering the basic structure of the 
antibody. An enormous diversity of antigen-binding sites can therefore 
be generated by changing only the length and amino acid sequence of 
these “hypervariable loops,” which is how the wide variety of different 
antibodies is formed (Movie 4.7). 


With their unique combination of specificity and diversity, antibodies are 
not only indispensable for fighting off infections, they are also invaluable 
in the laboratory, where they can be used to identify, purify, and study 
other molecules (Panel 4-2, pp. 140-141). 


Enzymes Are Powerful and Highly Specific Catalysts 


For many proteins, binding to another molecule is their main function. 
An actin molecule, for example, need only associate with other actin 
molecules to form a filament. There are proteins, however, for which 
ligand binding is simply a necessary first step in their function. This is the 
case for the large and very important class of proteins called enzymes. 
These remarkable molecules are responsible for nearly all of the chemical 
transformations that occur in cells. Enzymes bind to one or more ligands, 
called substrates, and convert them into chemically modified products, 


139 


140 


Antibodies are proteins 
that bind very tightly to 
their targets (antigens). 
They are produced in 
vertebrates as a defense 
against infection. Each 
antibody molecule is 
made of two identical 
light chains and two 
identical heavy chains. 
Its two antigen-binding 
sites are therefore 
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can make billions of 
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molecules, each with a 
distinct antigen-binding 
site. Each antibody 
recognizes its antigen 
with great specificity. 
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Special proteins in 
blood kill antibody- 
coated bacteria 
or viruses. 


Antibody-—antigen 
aggregates are ingested 
by phagocytic cells. 


identical. (See Figure 4-33). 


B CELLS PRODUCE ANTIBODIES 


Antibodies are made by a class of white blood cells called B 
lymphocytes, or B cells. Each resting B cell carries a different 
membrane-bound antibody molecule on its surface that serves 
as a receptor for recognizing a specific antigen. When antigen 
binds to this receptor, the B cell is stimulated to divide and to 
secrete large amounts of the same antibody in a soluble form. 
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The B cell is stimulated both to proliferate and to make 
and secrete more of the same antibody. 


RAISING ANTIBODIES IN ANIMALS 


Antibodies can be made in the laboratory by injecting an animal 
(usually a mouse, rabbit, sheep, or goat) with antigen A. 
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inject antigen A 


take blood later 


Repeated injections of the same antigen at intervals of several 
weeks stimulate specific B cells to secrete large amounts of 
anti-A antibodies into the bloodstream. 
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Because many different B cells are stimulated by antigen A, the 
blood will contain a variety of anti-A antibodies, each of which 
binds A in a slightly different way. 
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USING ANTIBODIES TO PURIFY MOLECULES 
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MONOCLONAL ANTIBODIES USING ANTIBODIES AS MOLECULAR TAGS 


Large quantities of a single type of antibody 
molecule can be obtained by fusing a B cell Y + 
(taken from an animal injected with antigen A) = 
with a tumor cell. The resulting hybrid cell 
divides indefinitely and secretes anti-A 
antibodies of a single (monoclonal) type. 
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antibody. 
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injected with antigen 
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FUSE ANTIBODY-SECRETING 
B CELL WITH TUMOR CELL 


Hybrid cell 
makes and 
secretes anti-A 
antibody and 
divides 
indefinitely. 


Fluorescent antibody binds to 
antigen A in tissue and is detected 
in a fluorescence microscope. The 
antigen here is pectin in the cell 
walls of a slice of plant tissue. 


MICROSCOPIC DETECTION 


Gold-labeled antibody binds to 
antigen A in tissue and is detected 
in an electron microscope. The 
antigen is pectin in the cell wall 
of a single plant cell. 


Antigen A is 
separated from 
other molecules 
by electrophoresis. 


BIOCHEMICAL DETECTION 


Incubation with the 

labeled antibodies 

that bind to antigen A 

allows the position of the 
< antigen to be determined. 


Labeled second antibody 
(blue) binds to first 


Note: In all cases, the sensitivity can 


be greatly increased by using multiple 


layers of antibodies. This “sandwich” 
method enables smaller numbers of 
antigen molecules to be detected. 
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Figure 4—34 Enzymes convert substrates Ai N 
to products while remaining unchanged 
themselves. Each enzyme has a site to ene enzyme 
which substrate molecules bind, forming 


an enzyme-substrate complex. There, a substrate- V1 
covalent bond making and/or breaking binding site | CATALYSIS” | 
reaction occurs, generating an enzyme- @ & = a 


product complex. The product is then 


| A allows h bind molecule A enzyme- enzyme- molecule B 

re Case , allowing the enzyme to bin (substrate) substrate product (product) 
additional substrate molecules and repeat complex complex 

the reaction. An enzyme thus serves as 

a catalyst, and it usually forms or breaks doing this over and over again without themselves being changed 


a single covalent bond in a substrate 


fear (Figure 4-34). Thus, enzymes act as catalysts that permit cells to make or 
molecule. 


break covalent bonds at will. This catalysis of organized sets of chemical 
reactions by enzymes creates and maintains all cell components, making 
life possible. 


Enzymes can be grouped into functional classes based on the chemical 
reactions they catalyze (Table 4—1). Each type of enzyme is highly spe- 
cific, catalyzing only a single type of reaction. Thus, hexokinase adds a 
phosphate group to D-glucose but not to its optical isomer L-glucose; the 
blood-clotting enzyme thrombin cuts one type of blood-clotting protein 
between a particular arginine and its adjacent glycine and nowhere else. 
As discussed in detail in Chapter 3, enzymes often work in sets, with the 
product of one enzyme becoming the substrate for the next. The result 
is an elaborate network of metabolic pathways that provides the cell with 
energy and generates the many large and small molecules that the cell 
needs. 


Enzymes Greatly Accelerate the Speed of Chemical 
Reactions 

The affinities of enzymes for their substrates, and the rates at which they 
convert bound substrate to product, vary widely from one enzyme to 


another. Both values can be determined experimentally by mixing purified 
enzymes and substrates together in a test tube. At a low concentration 


TABLE 4-1 SOME COMMON FUNCTIONAL CLASSES OF ENZYMES 


Hydrolase General term for enzymes that catalyze a hydrolytic cleavage reaction 

Nuclease Breaks down nucleic acids by hydrolyzing bonds between nucleotides 

Protease Breaks down proteins by hydrolyzing peptide bonds between amino acids 

Ligase Joins two molecules together; DNA ligase joins two DNA strands together end-to-end 

Isomerase Catalyzes the rearrangement of bonds within a single molecule 

Polymerase Catalyzes polymerization reactions such as the synthesis of DNA and RNA 

Kinase Catalyzes the addition of phosphate groups to molecules. Protein kinases are an important group of 
kinases that attach phosphate groups to proteins 

Phosphatase Catalyzes the hydrolytic removal of a phosphate group from a molecule 

Oxido-reductase General name for enzymes that catalyze reactions in which one molecule is oxidized while the other is 


reduced. Enzymes of this type are often called oxidases, reductases, or dehydrogenases 


ATPase Hydrolyzes ATP. Many proteins have an energy-harnessing ATPase activity as part of their function, 
including motor proteins such as myosin (discussed in Chapter 17) and membrane transport proteins such 
as the Na* pump (discussed in Chapter 12) 


Enzyme names typically end in “-ase,” with the exception of some enzymes, such as pepsin, trypsin, thrombin, lysozyme, and so on, 
which were discovered and named before the convention became generally accepted, at the end of the nineteenth century. The 
name of an enzyme usually indicates the nature of the reaction catalyzed. For example, citrate synthase catalyzes the synthesis of 
citrate by a reaction between acetyl CoA and oxaloacetate. 
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of substrate, the amount of enzyme-substrate complex—and the rate at 
which product is formed—will depend solely on the concentration of the 
substrate. If the concentration of substrate added is large enough, how- 
ever, all of the enzyme molecules will be filled with substrate. When this 
happens, the rate of product formation depends on how rapidly the sub- 
strate molecule can undergo the reaction that will convert it to product. 
At this point, the enzymes are working as fast as they can, a value termed 
Vmax. For many enzymes operating at Vmax, the number of substrate mol- 
ecules converted to product is in the vicinity of 1000 per second, although 
turnover numbers ranging from 1 to 100,000 molecules per second have 
been measured for different enzymes. Enzymes can speed up the rate of 
a chemical reaction by a factor of a million or more. 


The same type of experiment can be used to gauge how tightly an enzyme 
interacts with its substrate, a value that is related to how much substrate 
it takes to fully saturate a sample of enzyme. Because it is difficult to 
determine at what point an enzyme sample is “fully occupied,” biochem- 
ists instead determine the concentration of substrate at which an enzyme 
works at half its maximum speed. This value, called the Michaelis con- 
stant, Ky, was named after one of the biochemists who worked out 
the relationship (Figure 4-35). In general, a small Ky indicates that a 
substrate binds very tightly to the enzyme—due to a large number of 
noncovalent interactions (see Figure 4-31A); a large Km, on the other 
hand, indicates weak binding. We describe the methods used to analyze 
enzyme performance in How We Know, pp. 144-145. 


Lysozyme Illustrates How an Enzyme Works 


We have discussed how enzymes recognize their substrates. But how do 
they catalyze the chemical conversion of these substrates into products? 
To find out, we take a closer look at lysozyme—an enzyme that acts 
as a natural antibiotic in egg white, saliva, tears, and other secretions. 
Lysozyme severs the polysaccharide chains that form the cell walls of 
bacteria. Because the bacterial cell is under pressure due to intracellular 
osmotic forces, cutting even a small number of polysaccharide chains 
causes the cell wall to rupture and the bacterium to burst, or lyse—hence 
the enzyme’s name. Because lysozyme is a relatively small and stable 
protein, and can be isolated easily in large quantities, it has been studied 
intensively. It was the first enzyme to have its structure worked out at 
the atomic level by x-ray crystallography, and its mechanism of action is 
understood in great detail. 


The reaction catalyzed by lysozyme is a hydrolysis: the enzyme adds a 
molecule of water to a single bond between two adjacent sugar groups in 
the polysaccharide chain, thereby causing the bond to break (see Figure 
2-19). This reaction is energetically favorable because the free energy of 
the severed polysaccharide chains is lower than the free energy of the 
intact chain. However, the pure polysaccharide can sit for years in water 
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Figure 4-35 An enzyme’s performance 
depends on how rapidly it can process its 
substrate. The rate of an enzyme reaction 
(V) increases as the substrate concentration 
increases, until a maximum value (Vmax) is 
reached. At this point, all substrate-binding 
sites on the enzyme molecules are fully 
occupied, and the rate of the reaction is 
limited by the rate of the catalytic process 
on the enzyme surface. For most enzymes, 
the concentration of substrate at which 

the reaction rate is half-maximal (Km) is a 
direct measure of how tightly the substrate 
is bound, with a large value of Ky (a large 
amount of substrate needed) corresponding 
to weak binding. 


QUESTION 4-5 


Use drawings to explain how 

an enzyme (such as hexokinase, 
mentioned in the text) can 
distinguish its normal substrate 
(here, D-glucose) from the optical 
isomer L-glucose, which is not a 
substrate. (Hint: remembering 

that a carbon atom forms four 
single bonds that are tetrahedrally 
arranged and that the optical 
isomers are mirror images of each 
other around such a bond, draw the 
substrate as a simple tetrahedron 
with four different corners and 

then draw its mirror image. Using 
this drawing, indicate why only 

one optical isomer might bind to a 
schematic active site of an enzyme.) 


™ HOW WE KNOW 


MEASURING ENZYME PERFORMANCE 


At first glance, it seems that a cell’s metabolic pathways 
have been pretty well mapped out, with each reac- 
tion proceeding predictably to the next. So why would 
anyone need to know exactly how tightly a particular 
enzyme clutches its substrate or whether it can process 
100 or 1000 substrate molecules every second? 


In reality, metabolic maps merely suggest which path- 
ways a cell might follow as it converts nutrients into 
small molecules, chemical energy, and the larger build- 
ing blocks of life. Like a road map, they do not predict 
the density of traffic under a particular set of conditions; 
that is, which pathways the cell will use when it is starv- 
ing, when it is well fed, when oxygen is scarce, when it 
is stressed, or when it decides to divide. The study of an 
enzyme'’s kinetics—how fast it operates, how it handles 
its substrate, how its activity is controlled—allows us to 
predict how an individual catalyst will perform, and how 
it will interact with other enzymes in a network. Such 
knowledge leads to a deeper understanding of cell biol- 
ogy, and it opens the door to learning how to harness 
enzymes to perform desired reactions, including the 
large-scale production of specific chemicals. 


Speed 


The first step to understanding how an enzyme performs 
involves determining the maximal velocity, Vmax, for the 
reaction it catalyzes. This is accomplished by measur- 
ing, in a test tube, how rapidly the reaction proceeds in 
the presence of a fixed amount of enzyme and differ- 
ent concentrations of substrate (Figure 4-36A): the rate 
should increase as the amount of substrate rises until 
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the reaction reaches its Vmax (Figure 4-36B). The veloc- 
ity of the reaction can be measured by monitoring either 
how quickly the substrate is consumed or how rapidly 
the product accumulates. In many cases, the appear- 
ance of product or the disappearance of substrate can be 
observed directly with a spectrophotometer. This instru- 
ment detects the presence of molecules that absorb light 
at a particular wavelength; NADH, for example, absorbs 
light at 340 nm, while its oxidized counterpart, NAD*, 
does not. So, a reaction that generates NADH (by reduc- 
ing NAD*) can be monitored by following the formation 
of NADH at 340 nm in a spectrophotometer. 


Looking at the plot in Figure 4-36B, however, it is dif- 
ficult to determine the exact value of Vmax, as it is not 
clear where the reaction rate will reach its plateau. To 
get around this problem, the data are converted to their 
reciprocals and graphed in a “double-reciprocal plot,” 
where the inverse of the velocity (1/v) appears on the 
y axis and the inverse of the substrate concentration 
(1/[S]) on the x axis (Figure 4-36C). This graph yields 
a straight line whose y intercept (the point where the 
line crosses the y axis) represents 1/Vmax and whose x 
intercept corresponds to -1/Km. These values are then 
converted to values for Vmax and Km. 


Control 


Substrates are not the only molecules that can influ- 
ence how well or how quickly an enzyme works. In 
many cases, products, substrate lookalikes, inhibitors, 
and other small molecules can also increase or decrease 
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Figure 4-36 Measured reaction rates are plotted to determine the Vmax and Km of an enzyme-catalyzed reaction. (A) Test 
tubes containing a series of increasing substrate concentrations are prepared, a fixed amount of enzyme is added, and initial reaction 
rates (velocities) are determined. (B) The initial velocities (v) plotted against the substrate concentrations [S] give a curve described 

by the general equation y = ax/(b + x). Substituting our kinetic terms, the equation becomes v = VmaxlS]/(Km + [S]), where Vmax is the 
asymptote of the curve (the value of y at an infinite value of x), and Ky is equal to the substrate concentration where v is one-half Vmax- 
This is called the Michaelis-Menten equation, named for the biochemists who provided evidence for this enzymatic relationship. (C) In 
a double-reciprocal plot, 1/v is plotted against 1/[S]. The equation describing this straight line is 1/v = (Km/Vmax)}(1/[S]) + 1/Vmax. When 
1/[S] = 0, the y intercept (1/v) is 1/Vmax. When 1/v = 0, the x intercept (1/[S]) is —1/Ky. Plotting the data this way allows Vmax and Ky to 
be calculated more precisely. By convention, lowercase letters are used for variables (hence v for velocity) and uppercase letters are 
used for constants (hence Vma). 


enzyme activity. Such regulation allows cells to control 
when and how rapidly various reactions occur, a pro- 
cess we discuss in detail in this chapter. 


The effect of an inhibitor on an enzyme'’s activity is mon- 
itored in the same way that we measured the enzyme’s 
kinetics. A curve is first generated showing the velocity 
of the uninhibited reaction between enzyme and sub- 
strate. Additional curves are then produced for reactions 
in which the inhibitor molecule has been included in the 
mix. 


Comparing these curves, with and without inhibitor, can 
also reveal how a particular inhibitor impedes enzyme 
activity. For example, some inhibitors bind to the same 
site on an enzyme as its substrate. These competitive 
inhibitors block enzyme activity by competing directly 
with the substrate for the enzyme’s attention. They 
resemble the substrate enough to tie up the enzyme, 
but they differ enough in structure to avoid getting con- 
verted to product. This blockage can be overcome by 
adding enough substrate so that enzymes are more 
likely to encounter a substrate molecule than an inhib- 
itor molecule. From the kinetic data, we can see that 
competitive inhibitors do not change the Vinax of a reac- 
tion; in other words, add enough substrate and the 
enzyme will encounter mostly substrate molecules and 
will reach its maximum velocity (Figure 4-37). 


Competitive inhibitors can be used to treat patients who 
have been poisoned by ethylene glycol, an ingredient in 
commercially available antifreeze. Although ethylene 
glycol is itself not fatally toxic, a by-product of its metab- 
olism—oxalic acid—can be lethal. To prevent oxalic acid 
from forming, the patient is given a large (though not 
quite intoxicating) dose of ethanol. Ethanol competes 
with the ethylene glycol for binding to alcohol dehydro- 
genase, the first enzyme in the pathway to oxalic acid 
formation. As a result, the ethylene glycol remains mostly 
unmetabolized and is safely eliminated from the body. 
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Other types of inhibitors may interact with sites on the 
enzyme distant from where the substrate binds. Many 
biosynthetic enzymes are regulated by feedback inhibi- 
tion, whereby an enzyme early in a pathway will be shut 
down by a product generated later in the pathway (see, 
for example, Figure 4-43). Because this type of inhibitor 
binds to a separate, regulatory site on the enzyme, the 
substrate can still bind, but it might do so more slowly 
than it would in the absence of inhibitor. Such noncom- 
petitive inhibition is not overcome by the addition of 
more substrate. 


Design 


With the kinetic data in hand, we can use computer 
modeling programs to predict how an enzyme will per- 
form, and even how a cell will respond, when exposed 
to different conditions—such as the addition of a par- 
ticular sugar or amino acid to the culture medium, or 
the addition of a poison or a pollutant. Seeing how a 
cell manages its resources—which pathways it favors 
for dealing with particular biochemical challenges—can 
also suggest strategies for designing better catalysts for 
reactions of medical or commercial importance (e.g., for 
producing drugs or detoxifying industrial waste). Using 
such tactics, bacteria have even been genetically engi- 
neered to produce large amounts of indigo—the dye, 
originally extracted from plants, that makes your blue 
jeans blue. We discuss the methods that enable such 
genetic manipulation in detail in Chapter 10. 


Harnessing the power of cell biology for commercial 
purposes—even to produce something as simple as the 
amino acid tryptophan—is currently a multibillion-dollar 
industry. And, as more genome data come in, presenting 
us with more enzymes to exploit, vats of custom-made 
bacteria are increasingly churning out drugs and chemi- 
cals that represent the biological equivalent of pure gold. 
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substrate binding to an enzyme. (A) The active site of 
the enzyme can bind either the competitive inhibitor or 
the substrate, but not both together. (B) The upper plot 
shows that inhibition by a competitive inhibitor can be 
overcome by increasing the substrate concentration. 
The double-reciprocal plot below shows that the Vmax 
of the reaction is not changed in the presence of the 
competitive inhibitor: the y intercept is identical for 
both the curves. 
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Figure 4-38 Lysozyme cleaves a polysaccharide chain. (A) Schematic view of the enzyme lysozyme (E), which 


catalyzes the cutting of a polysaccharide substrate molecule (S). The enzyme first binds to the polysaccharide to 
form an enzyme-substrate complex (ES), then it catalyzes the cleavage of a specific covalent bond in the backbone 
of the polysaccharide. The resulting enzyme-product complex (EP) rapidly dissociates, releasing the products (P) 


and leaving the enzyme free to act on another substrate molecule. (B) A space-filling model of lysozyme bound to 
a short length of polysaccharide chain prior to cleavage. 


without being hydrolyzed to any detectable degree. This is because there 
is an energy barrier to such reactions, called the activation energy (dis- 
cussed in Chapter 3, pp. 89-90). For a colliding water molecule to break a 
bond linking two sugars, the polysaccharide molecule has to be distorted 
into a particular shape—the transition state—in which the atoms around 
the bond have an altered geometry and electron distribution. To distort 
the polysaccharide in this way requires a large input of energy—which is 
where the enzyme comes in. 


Like all enzymes, lysozyme has a binding site on its surface, termed an 
active site, which is where catalysis takes place. Because its substrate is 
a polymer, lysozyme’s active site is a long groove that cradles six of the 
linked sugars in the polysaccharide chain at the same time. Once this 
enzyme-substrate complex forms, the enzyme cuts the polysaccharide 
by catalyzing the addition of a water molecule to one of its sugar-sugar 
bonds, and the severed chains are then quickly released, freeing the 
enzyme for further cycles of cleavage (Figure 4-38). 


Like any protein binding to its ligand, lysosome recognizes its substrate 
through the formation of multiple noncovalent bonds (see Figure 4-32). 
However, lysozyme holds its polysaccharide substrate in such a way that 
one of the two sugars involved in the bond to be broken is distorted from 
its normal, most stable conformation. Conditions are thereby created 
in the microenvironment of the lysozyme active site that greatly reduce 
the activation energy necessary for the hydrolysis to take place (Figure 
4-39). Because the activation energy is so low, the overall chemical reac- 
tion—from the initial binding of the polysaccharide to the final release of 
the severed chains—occurs many millions of times faster in the presence 
of lysozyme than it would in its absence. In the absence of lysozyme, the 
energy of random molecular collisions almost never exceeds the acti- 
vation energy required for the reaction to occur; the hydrolysis of such 
polysaccharides thus occurs extremely slowly, if at all. 


Other enzymes use similar mechanisms to lower the activation energies 
and speed up the reactions they catalyze. In reactions involving two or 
more substrates, the active site acts like a template or mold that brings 
the reactants together in the proper orientation for the reaction to occur 
(Figure 4—40A). As we saw for lysozyme, the active site can also con- 
tain precisely positioned chemical groups that speed up the reaction by 
altering the distribution of electrons in the substrates (Figure 4—40B). 


How Proteins Work 

SUBSTRATE PRODUCTS 
This substrate is an oligosaccharide of six sugars, The final products are an oligosaccharide of four sugars 
labeled A through F. Only sugars D and E are shown in detail. (left) and a disaccharide (right), produced by hydrolysis. 
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(H+) to sugar E; Asp 52 is poised to attack the product complex (EP). 


C1 carbon atom of sugar D. 


Figure 4-39 Enzymes bind to, and chemically alter, substrate molecules. In the active site of lysozyme, a 
covalent bond in a polysaccharide molecule is bent and then broken. The top row shows the free substrate and 
the free products. The three lower panels depict sequential events at the enzyme active site, during which a 
sugar-sugar covalent bond is broken. Note the change in the conformation of sugar D in the enzyme-substrate 
complex compared with the free substrate. This conformation favors the formation of the transition state shown 
in the middle panel, greatly lowering the activation energy required for the reaction. The reaction, and the 
structure of lysozyme bound to its product, are shown in Movie 4.8 and Movie 4.9. (Based on D.J. Vocadlo et al., 
Nature 412:835-838, 2001.) 


Binding to the enzyme also changes the shape of the substrate, bending 
bonds so as to drive the bound molecule toward a particular transition 
state (Figure 4—40C). Finally, like lysozyme, many enzymes participate 
intimately in the reaction by briefly forming a covalent bond between 
the substrate and an amino acid side chain in the active site. Subsequent 
steps in the reaction restore the side chain to its original state, so that the 
enzyme remains unchanged after the reaction and can go on to catalyze 
many more reactions. 


Many Drugs Inhibit Enzymes 


Many of the drugs we take to treat or prevent illness work by blocking the 
activity of a particular enzyme. Cholesterol-lowering statins inhibit HMG- 
CoA reductase, an enzyme involved in the synthesis of cholesterol by 
the liver. Methotrexate kills some types of cancer cells by shutting down 
dihydrofolate reductase, an enzyme that produces a compound required 
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Figure 4—40 Enzymes can encourage 

a reaction in several ways. (A) Holding 
reacting substrates together in a precise 
alignment. (B) Rearranging the distribution 
of charge in a reaction intermediate. 

(C) Altering bond angles in the substrate 
to increase the rate of a particular reaction. 
A single enzyme may use any of these 
mechanisms in combination. 


Figure 4—41 Retinal and heme are 
required for the function of certain 
proteins. (A) The structure of retinal, the 
light-sensitive molecule covalently attached 
to the rhodopsin protein in our eyes. (B) The 
structure of a heme group, shown with the 
carbon-containing heme ring colored red 
and the iron atom at its center in orange. 

A heme group is tightly, but noncovalently, 
bound to each of the four polypeptide 
chains in hemoglobin, the oxygen-carrying 
protein whose structure was shown in 
Figure 4-24. 


(A) enzyme binds to two (B) binding of substrate (C) enzyme strains the 
substrate molecules and to enzyme rearranges bound substrate 
orients them precisely to electrons in the substrate, molecule, forcing it 
encourage a reaction to creating partial negative toward a transition 
occur between them and positive charges state that favors a 

that favor a reaction reaction 


for DNA synthesis during cell division. Because cancer cells have lost 
important intracellular control systems, some of them are unusually sen- 
sitive to treatments that interrupt chromosome replication, making them 
susceptible to methotrexate. 


Pharmaceutical companies often develop drugs by first using automated 
methods to screen massive libraries of compounds to find chemicals that 
are able to inhibit the activity of an enzyme of interest. They can then 
chemically modify the most promising compounds to make them even 
more effective, enhancing their binding affinity, specificity for the target 
enzyme, and persistence in the human body. As we discuss in Chapter 
20, the anticancer drug Gleevec® was designed to specifically inhibit an 
enzyme whose aberrant behavior is required for the growth of a type of 
cancer called chronic myeloid leukemia. The drug binds tightly in the 
substrate-binding pocket of that enzyme, blocking its activity. 


Tightly Bound Small Molecules Add Extra Functions to 
Proteins 


Although the precise order of their amino acids gives proteins their shape 
and functional versatility, sometimes amino acids by themselves are not 
enough for a protein to do its job. Just as we use tools to enhance and 
extend the capabilities of our hands, so proteins often employ small, 
nonprotein molecules to perform functions that would be difficult or 
impossible using amino acids alone. Thus, the photoreceptor protein 
rhodopsin, which is the light-sensitive protein made by the rod cells in 
the retina of our eyes, detects light by means of a small molecule, retinal, 
which is attached to the protein by a covalent bond to a lysine side chain 
(Figure 4—41A). Retinal changes its shape when it absorbs a photon 
of light, and this change is amplified by rhodopsin to trigger a cascade 
of reactions that eventually leads to an electrical signal being carried to 
the brain. 


COOH COOH 
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Another example of a protein that contains a nonprotein portion essen- 
tial for its function is hemoglobin (see Figure 4-24). A molecule of 
hemoglobin carries four noncovalently bound heme groups, ring-shaped 
molecules each with a single central iron atom (Figure 4—41B). Heme 
gives hemoglobin—and blood—its red color. By binding reversibly to dis- 
solved oxygen gas through its iron atom, heme enables hemoglobin to 
pick up oxygen in the lungs and release it in tissues that need it. 


Enzymes, too, make use of nonprotein molecules: they frequently have a 
small molecule or metal atom associated with their active site that assists 
with their catalytic function. Carboxypeptidase, an enzyme that cuts poly- 
peptide chains, carries a tightly bound zinc ion in its active site. During 
the cleavage of a peptide bond by carboxypeptidase, the zinc ion forms 
a transient bond with one of the substrate atoms, thereby assisting the 
hydrolysis reaction. In other enzymes, a small organic molecule—often 
referred to as a coenzyme—serves a similar purpose. Biotin, for exam- 
ple, is found in enzymes that transfer a carboxyl group (-COO’) from one 
molecule to another (see Figure 3-38). Biotin participates in these reac- 
tions by forming a covalent bond to the -COO group to be transferred, 
thereby producing an activated carrier (see Table 3-2, p. 109). This small 
molecule is better suited for this function than any of the amino acids 
used to make proteins. 


Because biotin cannot be synthesized by humans, it must be provided in 
the diet; thus biotin is classified as a vitamin. Other vitamins are similarly 
needed to make small molecules that are essential components of our 
proteins; vitamin A, for example, is needed in the diet to make retinal, the 
light-sensitive part of rhodopsin. 


HOW PROTEINS ARE CONTROLLED 


Thus far, we have examined how binding to other molecules allows pro- 
teins to perform their specific functions. But inside the cell, most proteins 
and enzymes do not work continuously, or at full speed. Instead, their 
activities are regulated in a coordinated fashion so the cell can main- 
tain itself in an optimal state, producing only those molecules it requires 
to thrive under current conditions. By coordinating not only when—and 
how vigorously—proteins perform, but also where in the cell they act, the 
cell ensures that it does not deplete its energy reserves by accumulating 
molecules it does not need or waste its stockpiles of critical substrates. 
We now consider how cells control the activity of their enzymes and 
other proteins. 


The regulation of protein activity occurs at many levels. At the most fun- 
damental level, the cell controls the amount of each protein it contains. 
It can do so by controlling the expression of the gene that encodes that 
protein (discussed in Chapter 8). It can also regulate the rate at which 
the protein is degraded (discussed in Chapter 7). The cell also controls 
protein activities by confining the participating proteins to particular sub- 
cellular compartments. Some of these compartments are enclosed by 
membranes (as discussed in Chapters 11, 12, 14, and 15); others are cre- 
ated by the proteins that are drawn there, as we discuss shortly. Finally, 
the activity of an individual protein can be rapidly adjusted at the level of 
the protein itself. 


All of these mechanisms rely on the ability of proteins to interact with 
other molecules—including other proteins. These interactions can cause 
proteins to adopt different conformations, and thereby alter their func- 
tion, as we see next. 


How Proteins Are Controlled 
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Figure 4—42 Feedback inhibition 
regulates the flow through biosynthetic 
pathways. B is the first metabolite in a 
pathway that gives the end product Z. 

Z inhibits the first enzyme that is specific 
to its own synthesis and thereby limits its 
own concentration in the cell. This form 
of negative regulation is called feedback 
inhibition. 


QUESTION 4-6 


Consider the drawing in Figure 
4—42. What will happen if, instead of 
the indicated feedback, 

A. feedback inhibition from 

Z affects the step B — C only? 

B. feedback inhibition from 

Z affects the step Y — Z only? 

C. Zis a positive regulator of the 
step B — X? 

D. Zis a positive regulator of the 
step B — C? 

For each case, discuss how useful 
these regulatory schemes would be 
for a cell. 


Figure 4-43 Feedback inhibition at 
multiple points regulates connected 
metabolic pathways. The biosynthetic 
pathways for four different amino acids in 
bacteria are shown, starting from the amino 
acid aspartate. The red lines indicate points 
at which products feed back to inhibit 
enzymes and the blank boxes represent 
intermediates in each pathway. In this 
example, each amino acid controls the first 
enzyme specific to its own synthesis, thereby 
limiting its own concentration and avoiding 
a wasteful buildup of intermediates. Some 
of the products also separately inhibit the 
initial set of reactions common to all the 
syntheses. Three different enzymes catalyze 
the initial reaction from aspartate to aspartyl 
phosphate, and each of these enzymes is 
inhibited by a different product. 


The Catalytic Activities of Enzymes Are Often Regulated 
by Other Molecules 


A living cell contains thousands of different enzymes, many of which 
are operating at the same time in the same small volume of the cytosol. 
By their catalytic action, enzymes generate a complex web of metabolic 
pathways, each composed of chains of chemical reactions in which the 
product of one enzyme becomes the substrate of the next. In this maze of 
pathways, there are many branch points where different enzymes com- 
pete for the same substrate. The system is so complex that elaborate 
controls are required to regulate when and how rapidly each reaction 
occurs. 


A common type of control occurs when a molecule other than a substrate 
specifically binds to an enzyme at a special regulatory site, altering the 
rate at which the enzyme converts its substrate to product. In feedback 
inhibition, for example, an enzyme acting early in a reaction pathway 
is inhibited by a molecule produced later in that pathway. Thus, when- 
ever large quantities of the final product begin to accumulate, the product 
binds to an earlier enzyme and slows down its catalytic action, limit- 
ing further entry of substrates into that reaction pathway (Figure 4—42). 
Where pathways branch or intersect, there are usually multiple points 
of control by different final products, each of which regulates its own 
synthesis (Figure 4-43). Feedback inhibition can work almost instanta- 
neously and is rapidly reversed when product levels fall. 
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Feedback inhibition is a form of negative regulation: it prevents an 
enzyme from acting. Enzymes can also be subject to positive regulation, 
in which the enzyme’s activity is stimulated by a regulatory molecule 
rather than being suppressed. Positive regulation occurs when a product 
in one branch of the metabolic maze stimulates the activity of an enzyme 
in another pathway. But how do these regulatory molecules change an 
enzyme'’s activity? 


Allosteric Enzymes Have Two or More Binding Sites That 
Influence One Another 


Feedback inhibition was initially puzzling to those who discovered it, in 
part because the regulatory molecule often has a shape that is totally dif- 
ferent from the shape of the enzyme’s preferred substrate. Indeed, when 
this form of regulation was discovered in the 1960s, it was termed allos- 
tery (from the Greek allo, “other,” and stere, “solid” or “shape”). Given the 
numerous, specific, noncovalent interactions that allow enzymes to inter- 
act with their substrates within the active site, it seemed likely that these 
regulatory molecules were binding somewhere else on the surface of 
the protein. As more was learned about feedback inhibition, researchers 
realized that many enzymes must contain at least two different binding 
sites: an active site that recognizes the substrates and one or more sites 
that recognize regulatory molecules. These sites must somehow “com- 
municate” to allow the catalytic events at the active site to be influenced 
by the binding of the regulatory molecule at a separate location. 


The interaction between sites that are located in different regions on a 
protein molecule is now known to depend on a conformational change 
in the protein. The binding of a ligand to one of the sites causes a shift 
in the protein's structure from one folded shape to a slightly different 
folded shape, and this alters the shape of a second binding site that can 
be far away. Many enzymes have two conformations that differ in activ- 
ity, each of which can be stabilized by the binding of a different ligand. 
During feedback inhibition, for example, the binding of an inhibitor at a 
regulatory site on a protein causes the protein to spend more time in a 
conformation in which its active site—located elsewhere in the protein— 
becomes less accommodating to the substrate molecule (Figure 4—44). 


As schematically illustrated in Figure 4-45A, many—if not most—protein 
molecules are allosteric: they can adopt two or more slightly different 
conformations, and their activity can be regulated by a shift from one 
to another. This is true not only for enzymes, but also for many other 
proteins as well. The chemistry involved here is extremely simple in con- 
cept. Because each protein conformation will have somewhat different 
contours on its surface, the protein’s binding sites for ligands will be 
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Figure 4-44 Feedback inhibition triggers 
a conformational change in an enzyme. 
Aspartate transcarbamoylase from E. coli, 
a large multisubunit enzyme used in early 
studies of allosteric regulation, catalyzes 
an important reaction that begins the 
synthesis of the pyrimidine ring of 

C, U, and T nucleotides (see Panel 2-7, 

pp. 78-79). One of the final products of 
this pathway, cytidine triphosphate (CTP), 
binds to the enzyme to turn it off whenever 
CTP is plentiful. This diagram shows the 
conformational change that occurs when 
the enzyme is turned off by CTP binding to 
its four regulatory sites, which are distinct 
from the active site where the substrate 
binds. Figure 4-10 shows the structure of 
aspartate transcarbamoylase as seen from 
the top. This figure depicts the enzyme as 
seen from the side. 
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Figure 4—45 The binding of a regulatory 
ligand can change the equilibrium 
between two protein conformations. 

(A) Schematic diagram of a hypothetical, 
allosterically regulated enzyme for which a 
rise in the concentration of ADP molecules 
(red wedges) increases the rate at which 
the enzyme catalyzes the oxidation of 
sugar molecules (blue hexagons). 

(B) Due to thermal motions, the enzyme 
will spontaneously interconvert between 
the open (inactive) and closed (active) 
conformations shown in (A). But when 
ADP is absent, only a small fraction of 

the enzyme molecules will be present 

in the active conformation at any given 
time. As illustrated, most remain in the 
inactive conformation. (C) Because ADP 
can bind to the protein only in its closed, 
active conformation, an increase in ADP 
concentration locks nearly all of the enzyme 
molecules in the active form—an example 
of positive regulation. In cells, rising 
concentrations of ADP signal a depletion 
of ATP reserves; increased oxidation of 
sugars—in the presence of ADP—thus 
provides more energy for the synthesis of 
ATP from ADP. 
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altered when the protein changes shape. Each ligand will stabilize the 
conformation that it binds to most strongly. Therefore, at high enough 
concentrations, a ligand will tend to “switch” the population of proteins 
to the conformation that it favors (Figure 4—45B and C). 


Phosphorylation Can Control Protein Activity by Causing 
a Conformational Change 


Another method that eukaryotic cells use to regulate protein activity 
involves attaching a phosphate group covalently to one or more of the 
protein's amino acid side chains. Because each phosphate group carries 
two negative charges, the enzyme-catalyzed addition of a phosphate 
group can cause a conformational change by, for example, attracting a 
cluster of positively charged amino acid side chains from somewhere else 
in the same protein. This structural shift can, in turn, affect the binding 
of ligands elsewhere on the protein surface, thereby altering the protein’s 
activity. Removal of the phosphate group by a second enzyme will return 
the protein to its original conformation and restore its initial activity. 


Reversible protein phosphorylation controls the activity of many types 
of proteins in eukaryotic cells. This form of regulation is used so exten- 
sively that more than one-third of the 10,000 or so proteins in a typical 
mammalian cell are phosphorylated at any one time. The addition and 
removal of phosphate groups from specific proteins often occur in 
response to signals that specify some change in a cell's state. For exam- 
ple, the complicated series of events that takes place as a eukaryotic cell 
divides is timed largely in this way (discussed in Chapter 18). And many 
of the intracellular signaling pathways activated by extracellular signals 
depend on a network of protein phosphorylation events (discussed in 
Chapter 16). 


Protein phosphorylation involves the enzyme-catalyzed transfer of the 
terminal phosphate group of ATP to the hydroxyl group on a serine, thre- 
onine, or tyrosine side chain of the protein. This reaction is catalyzed 
by a protein kinase. The reverse reaction—removal of the phosphate 
group, or dephosphorylation—is catalyzed by a protein phosphatase 
(Figure 4—46A). Phosphorylation can either stimulate protein activity or 
inhibit it, depending on the protein involved and the site of phosphoryla- 
tion (Figure 4—46B). Cells contain hundreds of different protein kinases, 
each responsible for phosphorylating a different protein or set of pro- 
teins. Cells also contain a smaller set of different protein phosphatases; 
some of these are highly specific and remove phosphate groups from only 
one or a few proteins, whereas others act on a broad range of proteins. 
The state of phosphorylation of a protein at any moment in time, and thus 


Figure 4—46 Protein phosphorylation is a very common mechanism 
for regulating protein activity. Many thousands of proteins in a typical 
eukaryotic cell are modified by the covalent addition of one or more 
phosphate groups. (A) The general reaction, shown here, entails transfer 
of a phosphate group from ATP to an amino acid side chain of the target 
protein by a protein kinase. Removal of the phosphate group is catalyzed 
by a second enzyme, a protein phosphatase. In this example, the 
phosphate is added to a serine side chain; in other cases, the phosphate 
is instead linked to the -OH group of a threonine or tyrosine side chain. 
(B) Phosphorylation can either increase or decrease the protein's activity, 
depending on the site of phosphorylation and the structure of the protein. 


its activity, will depend on the relative activities of the protein kinases 
and phosphatases that act on it. 


Phosphorylation can take place in a continuous cycle, in which a phos- 
phate group is rapidly added to—and rapidly removed from—a particular 
side chain. Such phosphorylation cycles allow proteins to switch quickly 
from one state to another. The more swiftly the cycle is “turning,” the 
faster the concentration of a phosphorylated protein can change in 
response to a sudden stimulus. Although keeping the cycle turning costs 
energy—because ATP is hydrolyzed with each phosphorylation—many 
enzymes in the cell undergo this speedy, cyclic form of regulation. 


Covalent Modifications Also Control the Location and 
Interaction of Proteins 


Phosphorylation can do more than control a protein’s activity; it can 
create docking sites where other proteins can bind, thus promoting the 
assembly of proteins into larger complexes. For example, when extracel- 
lular signals stimulate a class of cell-surface, transmembrane proteins 
called receptor tyrosine kinases, they cause the receptor proteins to phos- 
phorylate themselves on certain tyrosines. The phosphorylated tyrosines 
then serve as docking sites for the binding and activation of a set of 
intracellular signaling proteins, which transmits the message to the cell 
interior and changes the behavior of the cell (see Figure 16-29). 


Phosphorylation is not the only form of covalent modification that can 
affect a protein’s function. Many proteins are modified by the addition of 
an acetyl group to a lysine side chain, including the histones discussed 
in Chapter 5. And the addition of the fatty acid palmitate to a cysteine 
side chain drives a protein to associate with cell membranes. Attachment 
of ubiquitin, a 76-amino-acid polypeptide, can target a protein for deg- 
radation, as we discuss in Chapter 7. More than 100 types of covalent 
modifications can occur in the cell, each playing its own role in regulating 
protein function. Each of these modifying groups is enzymatically added 
or removed depending on the needs of the cell. 


A large number of proteins are modified on more than one amino acid 
side chain. The p53 protein, which plays a central part in controlling how 
a cell responds to DNA damage and other stresses, can be covalently 
modified at 20 sites (Figure 4-47). Because an enormous number of com- 
binations of these 20 modifications is possible, the protein’s behavior can 
in principle be altered in a huge number of ways. 
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Figure 4—47 The modification of 

a protein at multiple sites can 
control the protein’s behavior. This 
diagram shows some of the covalent 
modifications that control the activity 
and degradation of p53, a protein 

of nearly 400 amino acids. p53 is an 
important transcription regulator that 
regulates a cell's response to damage 
(discussed in Chapter 18). Not all of 
these modifications will be present 
at the same time. Colors along the 
body of the protein represent distinct 


of the modifications shown are located 
within relatively unstructured regions 
of the polypeptide chain. 


protein domains, including one that 
binds to DNA (green) and one that 
COOH activates gene transcription (pink). All 
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QUESTION 4-7 


Either protein phosphorylation or 
the binding of a nucleotide (such as 
ATP or GTP) can be used to regulate 
a protein's activity. What do you 
suppose are the advantages of each 
form of regulation? 


Figure 4—48 Many different GTP-binding 
proteins function as molecular switches. 
A GTP-binding protein requires the 
presence of a tightly bound GTP molecule 
to be active. The active protein can shut 
itself off by hydrolyzing its bound GTP 

to GDP and inorganic phosphate (P)), 
which converts the protein to an inactive 
conformation. To reactivate the protein, 
the tightly bound GDP must dissociate. As 
explained in Chapter 16, this dissociation is 
a slow step that can be greatly accelerated 
by important regulatory proteins called 
guanine nucleotide exchange factors 
(GEFs). As indicated, once the GDP 
dissociates, a molecule of GTP quickly 
replaces it, returning the protein to its 
active conformation. 
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The set of covalent modifications that a protein contains at any moment 
constitutes an important form of regulation. The attachment or removal 
of these modifying groups can change a protein’s activity or stability, its 
binding partners, or its location inside the cell. Covalent modifications 
thus enable the cell to make optimal use of the proteins it produces, and 
they allow the cell to respond rapidly to changes in its environment. 


Regulatory GTP-Binding Proteins Are Switched On and 
Off by the Gain and Loss of a Phosphate Group 


Eukaryotic cells have a second way to regulate protein activity by phos- 
phate addition and removal. In this case, however, the phosphate is not 
enzymatically transferred from ATP to the protein. Instead, the phosphate 
is part of a guanine nucleotide—guanosine triphosphate (GTP)—that 
binds tightly various types of GTP-binding proteins. These proteins act 
as molecular switches: they are in their active conformation when GTP is 
bound, but they can hydrolyze this GTP to GDP—which releases a phos- 
phate and flips the protein to an inactive conformation (Movie 4.10). As 
with protein phosphorylation, this process is reversible: the active con- 
formation is regained by dissociation of the GDP, followed by the binding 
of a fresh molecule of GTP (Figure 4—48). 


Hundreds of GTP-binding proteins function as molecular switches in 
cells. The dissociation of GDP and its replacement by GTP, which turns the 
switch on, is often stimulated in response to cell signals. The GTP-binding 
proteins activated in this way in turn bind to other proteins to regulate 
their activities. The crucial role GTP-binding proteins play in intracellular 
signaling pathways is discussed in detail in Chapter 16. 


ATP Hydrolysis Allows Motor Proteins to Produce 
Directed Movements in Cells 


We have seen how conformational changes in proteins play a central 
part in enzyme regulation and cell signaling. But conformational changes 
also play another important role in the operation of the eukaryotic cell: 
they enable certain specialized proteins to drive directed movements of 
cells and their components. These motor proteins generate the forces 
responsible for muscle contraction and most other eukaryotic cell move- 
ments. They also power the intracellular movements of organelles and 
macromolecules. For example, they help move chromosomes to opposite 
ends of the cell during mitosis (discussed in Chapter 18), and they move 
organelles along cytoskeletal tracks (discussed in Chapter 17). 
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Figure 4-49 Changes in conformation can allow a protein to 
“walk” along a cytoskeletal filament. This protein cycles between 
three different conformations (A, B, and C) as it moves along the 
filament. But, without an input of energy to drive its movement in a 
single direction, the protein can only wander randomly back and forth, 
ultimately getting nowhere. 


But how can the changes in shape experienced by proteins be used to 
generate such orderly movements? A protein that is required to walk 
along a cytoskeletal fiber, for example, can move by undergoing a series 
of conformational changes. However, with nothing to drive these changes 
in one direction or the other, the shape changes will be reversible and the 
protein will wander randomly back and forth (Figure 4—49). 


To force the protein to proceed in a single direction, the conformational 
changes must be unidirectional. To achieve such directionality, one of the 
steps must be made irreversible. For most proteins that are able to move 
in a single direction for long distances, this irreversibility is achieved by 
coupling one of the conformational changes to the hydrolysis of an ATP 
molecule that is tightly bound to the protein—which is why motor pro- 
teins are also ATPases. A great deal of free energy is released when ATP is 
hydrolyzed, making it very unlikely that the protein will undergo a reverse 
shape change—as required for moving backward. (Such a reversal would 
require that the ATP hydrolysis be reversed, by adding a phosphate mol- 
ecule to ADP to form ATP.) As a consequence, the protein moves steadily 
forward (Figure 4-50). 


Many different motor proteins generate directional movement by using 
the hydrolysis of a tightly bound ATP molecule to drive an orderly series 
of conformational changes. These movements can be rapid: the muscle 
motor protein myosin walks along actin filaments at about 6 um/sec dur- 
ing muscle contraction (discussed in Chapter 17). 


Proteins Often Form Large Complexes That Function as 
Machines 


As proteins progress from being small, with a single domain, to being 
larger with multiple domains, the functions they can perform become 
more elaborate. The most complex tasks are carried out by large protein 
assemblies formed from many protein molecules. Now that it is possible 
to reconstruct biological processes in cell-free systems in a test tube, it 
is clear that each central process in a cell—including DNA replication, 
gene transcription, protein synthesis, vesicle budding, and transmem- 
brane signaling—is catalyzed by a highly coordinated, linked set of many 
proteins. For most such protein machines, the hydrolysis of bound 
nucleoside triphosphates (ATP or GTP) drives an ordered series of con- 
formational changes in some of the individual protein subunits, enabling 
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Figure 4-50 A schematic model of how a motor protein uses ATP hydrolysis to move in one direction along a cytoskeletal 

filament. An orderly transition among three conformations is driven by the hydrolysis of a bound ATP molecule and the release of 
the products, ADP and inorganic phosphate (Pi). Because these transitions are coupled to the hydrolysis of ATP, the entire cycle is 
essentially irreversible. Through repeated cycles, the protein moves continuously to the right along the filament. 
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Figure 4-51 “Protein machines” can 
carry out complex functions. These 
machines are made of individual proteins 
that collaborate to perform a specific task 
(Movie 4.11). The movement of proteins is 
often coordinated and made unidirectional 
by the hydrolysis of a bound nucleotide 
such as ATP. Conformational changes of 
this type are especially useful to the cell 

if they occur in a large protein assembly 

in which the activities of several different 
protein molecules can be coordinated by 
the movements within the complex, as 
schematically illustrated here. 


QUESTION 4-8 


Explain why the hypothetical 
enzymes in Figure 4-51 have a 
great advantage in opening the 
safe if they work together in a 
protein complex, as opposed to 
working individually in an unlinked, 
sequential manner. 


the ensemble of proteins to move coordinately (Figure 4-51). In these 
machine-like complexes, the appropriate enzymes can be positioned to 
carry out successive reactions in a series—as during the synthesis of pro- 
teins on a ribosome, for example (discussed in Chapter 7). And during cell 
division, a large protein machine moves rapidly along DNA to replicate 
the DNA double helix (discussed in Chapter 6 and shown in Movie 6.3 
and Movie 6.4). 


A large number of different protein machines have evolved to perform 
many critical biological tasks. Cells make wide use of protein machines 
for the same reason that humans have invented mechanical and elec- 
tronic machines: for almost any job, manipulations that are spatially and 
temporally coordinated through linked processes are much more efficient 
than is the sequential use of individual tools. 


Many Interacting Proteins Are Brought Together by 
Scaffolds 


We have seen that proteins rely on interactions with other molecules to 
carry out their biological functions. Enzymes bind substrates and regu- 
latory ligands—many of which are generated by other enzymes in the 
same reaction pathway. Receptor proteins in the plasma membrane, 
when activated by extracellular ligands, can recruit a set of intracellular 
signaling proteins that interact with and activate one another, propagat- 
ing the signal to the cell interior. In addition, the proteins involved in DNA 
replication, gene transcription, DNA repair, and protein synthesis form 
protein machines that carry out these complex and crucial tasks with 
great efficiency. 


But how do proteins find the appropriate partners—and the sites where 
they are needed—within the crowded conditions inside the cell (see 
Figure 3-22)? Many protein complexes are brought together by scaf- 
fold proteins, large molecules that contain binding sites recognized by 
multiple proteins. By binding a specific set of interacting proteins, a scaf- 
fold can greatly enhance the rate of a particular chemical reaction or cell 
process, while also confining this chemistry to a particular area of the 
cell—for example, drawing signaling proteins to the plasma membrane. 


Although some scaffolds are rigid, the most abundant scaffolds in cells 
are very elastic. Because they contain long unstructured regions that 
allow them to bend and sway, these scaffolds serve as flexible tethers 
that greatly enhance the collisions between the proteins that are bound 
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Figure 4—52 Scaffold proteins can 

concentrate interacting proteins in the 
cell. In this hypothetical example, each of 
a set of interacting proteins is bound to a 


unstructured scaffold protein 
region 


rapid 
collisions A} specific structured domain within a long, 
T — i ze otherwise unstructured scaffold protein. The 
unstructured regions of the scaffold act as 
interacting e flexible tethers, and they enhance the rate 
structured proteins of formation of the functional complex by 
domain scaffold ready promoting the rapid, random collision of 


for reuse the proteins bound to the scaffold. 


to them (Figure 4-52). Some other scaffolds are not proteins but long 
molecules of RNA. We encounter these RNA scaffolds when we discuss 
RNA synthesis and processing in Chapter 7. 


Scaffolds allow proteins to be assembled and activated only when and 
where they are needed. Nerve cells, for example, deploy large, flex- 
ible scaffold proteins—some more than 1000 amino acids in length—to 
organize the specialized proteins involved in transmitting and receiving 
the signals that carry information from one nerve cell to the next. These 
proteins cluster beneath the plasma membranes of communicating nerve 
cells (see Figure 4-54), allowing them both to transmit and to respond to 
the appropriate messages when stimulated to do so. 


Weak Interactions Between Macromolecules Can 
Produce Large Biochemical Subcompartments in Cells 


The aggregates formed by sets of proteins, RNAs, and protein machines 
can grow quite large, producing distinct biochemical compartments 
within the cell. The largest of these is the nucleolus—the nuclear com- 
partment in which ribosomal RNAs are transcribed and ribosomal 
subunits are assembled. This cell structure, which is formed when the 
chromosomes that carry the ribosomal genes come together during inter- 
phase (see Figure 5-17), is large enough to be seen in a light microscope. 
Smaller, transient structures assemble as needed in the nucleus to gen- 
erate “factories” that carry out DNA replication, DNA repair, or MRNA 
production (see Figure 7-24). In addition, specific mRNAs are sequestered 
in cytoplasmic granules that help to control their use in protein synthesis. 


The general term used to describe such assemblies, many of which con- 
tain both protein and RNA, is an intracellular condensate. Some of 
these condensates, including the nucleolus, can take the form of spheri- 
cal, liquid droplets that can be seen to break up and fuse (Figure 4-53). 
Although these condensates resemble the sort of phase-separated com- 
partments that form when oil and water mix, their interior makeup is 
complex and structured. Some are based on amyloid structures, revers- 
ible assemblies of stacked B sheets that come together to produce a 
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Figure 4-53 Spherical, liquid-drop-like nucleoli can be seen to fuse in the light microscope. In these experiments, the nucleoli 

are present inside a nucleus that has been dissected from Xenopus oocytes and placed under oil on a microscope slide. Here, three 
nucleoli are seen fusing to form one larger nucleolus (Movie 4.12). A very similar process occurs following each round of division, when 
small nucleoli initially form on multiple chromosomes, but then coalesce to form a single, large nucleolus. (From 

C.P. Brangwynne, T.J. Mitchison, and A.A. Hyman, Proc. Natl. Acad. Sci. USA 108:4334—4339, 2011.) 


158 CHAPTER 4 


protein scaffolds 


amyloid 


product 


(A) RNA scaffolds 


Figure 4—54 Intracellular condensates 

can form biochemical subcompartments 
in cells. These large aggregates form as a 
result of multiple weak binding interactions 
between scaffolds and other macromolecules. 
When these macromolecule-macromolecule 
interactions become sufficiently strong, a 
"phase separation” occurs. This creates two 
distinct aqueous compartments, in one of 
which the interacting molecules are densely 
aggregated. Such intracellular condensates 
concentrate a select set of macromolecules, 
thereby producing regions with a special 
biochemistry without the use of an 
encapsulating membrane. 

(A) Schematic illustration of a phase- 
separated intracellular condensate. These 
condensates can create a factory that 
catalyzes the formation of a specific type of 
product, or they can serve to store important 
entities, such as specific mRNA molecules, 
for later use. As shown, reversible amyloid 
structures often help to create these 
aggregates. These B-sheet structures form 
between regions of unstructured amino acid 
sequence within the larger protein scaffolds. 

(B-D) Three examples that illustrate how 
intracellular condensates (colorized regions) 
are thought to be used by cells. (B) Inside the 
interphase nucleus, the nucleolus is a large 
factory that produces ribosomes. In addition, 
many scattered RNA production factories 
concentrate the protein machines that 
transcribe the genome. (C) In the cytoplasm, 
a matrix forms the centrosome that nucleates 
the assembly of microtubules. (D) In a patch 
underlying the plasma membrane at the 
synapse where communicating nerve cells 
touch, multiple interacting scaffolds produce 
large protein assemblies; these create a local 
biochemistry that makes possible memory 
formation and storage in the nerve cell 
network. (B, courtesy of E.G. Jordan and 
J. McGovern; C, from M. McGill, 

D.P. Highfield, T.M. Monahan, and 

B.R. Brinkley, J. Ultrastruct. Res. 57:43-53, 
1976. With permission from Elsevier; 

D, courtesy of Cedric Raine.) 
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“hydrogel” that pulls other molecules into the condensate (Figure 4—54). 
Amyloid-forming proteins thus have functional roles in cells. But for a 
handful of these amyloid-forming proteins, mutation or perturbation can 
lead to neurological disease, which is how some of them were initially 
discovered. 


HOW PROTEINS ARE STUDIED 


Understanding how a particular protein functions calls for detailed struc- 
tural and biochemical analyses—both of which require large amounts of 
pure protein. But isolating a single type of protein from the thousands 
of other proteins present in a cell is a formidable task. For many years, 
proteins had to be purified directly from the source—the tissues in which 
they are most plentiful. That approach was inconvenient, entailing, for 
example, early-morning trips to the slaughterhouse. More importantly, 
the complexity of intact tissues and organs is a major disadvantage when 
trying to purify particular molecules, because a long series of chromatog- 
raphy steps is generally required. These procedures not only take weeks 
to perform, but they also yield only a few milligrams of pure protein. 


Nowadays, proteins are more often isolated from cells that are grown in 
a laboratory (see, for example, Figure 1-39). Often these cells have been 
“tricked” into making large quantities of a given protein using the genetic 
engineering techniques discussed in Chapter 10. Such engineered cells 
frequently allow large amounts of pure protein to be obtained in only a 
few days. 


In this section, we outline how proteins are extracted and purified from 
cultured cells and other sources. We describe how these proteins are 
analyzed to determine their amino acid sequence and their three-dimen- 
sional structure. Finally, we discuss how technical advances are allowing 
proteins to be analyzed, cataloged, manipulated, and even designed from 
scratch. 


Proteins Can Be Purified from Cells or Tissues 


Whether starting with a piece of liver or a vat of bacteria, yeast, or ani- 
mal cells that have been engineered to produce a protein of interest, the 
first step in any purification procedure involves breaking open the cells 
to release their contents. The resulting slurry is called a cell homogenate 
or extract. This physical disruption is followed by an initial fractionation 
procedure to separate out the class of molecules of interest—for example, 
all the soluble proteins in the cell (Panel 4-3, pp. 164-165). 


With this collection of proteins in hand, the job is then to isolate the 
desired protein. The standard approach involves purifying the protein 
through a series of chromatography steps, which use different ma- 
terials to separate the individual components of a complex mixture into 


portions, or fractions, based on the properties of the protein—such as 
size, shape, or electrical charge. After each separation step, the result- 
ing fractions are examined to determine which ones contain the protein 
of interest. These fractions are then pooled and subjected to additional 
chromatography steps until the desired protein is obtained in pure form. 


The most efficient forms of protein chromatography separate polypeptides 
on the basis of their ability to bind to a particular molecule—a process 
called affinity chromatography (Panel 4-4, p. 166). If large amounts of 
antibodies that recognize the protein are available, for example, they can 
be attached to the matrix of a chromatography column and used to help 
extract the protein from a mixture (see Panel 4-2, pp. 140-141). 


Affinity chromatography can also be used to isolate proteins that interact 
physically with a protein being studied. In this case, the purified protein 
of interest is attached tightly to the column matrix; the proteins that bind 
to it will remain in the column and can then be removed by changing the 
composition of the washing solution (Figure 4-55). 


Proteins can also be separated by electrophoresis. In this technique, a 
mixture of proteins is loaded onto a polymer gel and subjected to an 
electric field; the polypeptides will then migrate through the gel at differ- 
ent speeds depending on their size and net charge (Panel 4-5, p. 167). If 
too many proteins are present in the sample, or if the proteins are very 
similar in their migration rate, they can be resolved further using two- 
dimensional gel electrophoresis (see Panel 4-5). These electrophoretic 
approaches yield a number of bands or spots that can be visualized by 
staining; each band or spot contains a different protein. Chromatography 
and electrophoresis—both developed more than 70 years ago but greatly 
improved since—continue to be instrumental in building an understand- 
ing of what proteins look like and how they behave. These and other 
historical breakthroughs are described in Table 4-2. 


Once a protein has been obtained in pure form, it can be used in bio- 
chemical assays to study the details of its activity. It can also be subjected 
to techniques that reveal its amino acid sequence and, ultimately, its pre- 
cise three-dimensional structure. 


Determining a Protein’s Structure Begins with 
Determining Its Amino Acid Sequence 


The task of determining a protein's primary structure—its amino acid 
sequence—can be accomplished in several ways. For many years, 
sequencing a protein was done by directly analyzing the amino acids 
in the purified protein. First, the protein was broken down into smaller 
pieces using a selective protease; the enzyme trypsin, for example, 
cleaves polypeptide chains on the carboxyl side of a lysine or an arginine. 
Then the identities of the amino acids in each fragment were determined 
chemically. The first protein sequenced in this way was the hormone 
insulin in 1955. 


A much faster way to determine the amino acid sequence of proteins that 
have been isolated from organisms for which the full genome sequence 
is known is a method called mass spectrometry. This technique deter- 
mines the exact mass of every peptide fragment in a purified protein, 
which then allows the protein to be identified from a database that con- 
tains a list of every protein thought to be encoded by the genome of the 
relevant organism. Such lists are computed by taking the organism’s 
genome sequence and applying the genetic code (discussed in Chapter 7). 


To perform mass spectrometry, the peptides derived from digestion with 
trypsin are blasted with a laser. This treatment heats the peptides, caus- 
ing them to become electrically charged (ionized) and ejected in the 
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Figure 4—55 Affinity chromatography can 
be used to isolate the binding partners of 
a protein of interest. The purified protein 
of interest (protein X) is covalently attached 
to the matrix of a chromatography column. 
An extract containing a mixture of proteins 
is then loaded onto the column. Those 
proteins that associate with protein X inside 
the cell will usually bind to it on the column. 
Proteins not bound to the column pass right 
through, and the proteins that are bound 
tightly to protein X can then be released by 
changing the pH or ionic composition of the 
washing solution. 
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TABLE 4-2 HISTORICAL LANDMARKS IN OUR UNDERSTANDING OF PROTEINS 


1838 


819-1904 
864 
894 
897 


926 


1233 


1934 
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951 


955 
956 


960 


1963 


1966 
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976 
984 


1988 


1996-2013 


975-2013 


The name “protein” (from the Greek proteios, “primary”) was suggested by Berzelius for the complex nitrogen-rich 
substance found in the cells of all animals and plants 


Most of the 20 common amino acids found in proteins were discovered 
Hoppe-Seyler crystallized, and named, the protein hemoglobin 
Fischer proposed a lock-and-key analogy for enzyme-substrate interactions 


Buchner and Buchner showed that cell-free extracts of yeast can break down sucrose to form carbon dioxide and 
ethanol, thereby laying the foundations of enzymology 


Sumner crystallized urease in pure form, demonstrating that proteins could possess the catalytic activity of enzymes; 
Svedberg developed the first analytical ultracentrifuge and used it to estimate the correct molecular weight of 
hemoglobin 


Tiselius introduced electrophoresis for separating proteins in solution 


Bernal and Crowfoot presented the first detailed x-ray diffraction patterns of a protein, obtained from crystals of the 
enzyme pepsin 


Martin and Synge developed chromatography, a technique now widely used to separate proteins 


Pauling and Corey proposed the structure of a helical conformation of a chain of amino acids—the a helix—and the 
structure of the B sheet, both of which were later found in many proteins 


Sanger determined the order of amino acids in insulin, the first protein whose amino acid sequence was determined 


Ingram produced the first protein fingerprints, showing that the difference between sickle-cell hemoglobin and 
normal hemoglobin is due to a change in a single amino acid (Movie 4.13) 


Kendrew described the first detailed three-dimensional structure of a protein (sperm whale myoglobin) to a 
resolution of 0.2 nm, and Perutz proposed a lower-resolution structure for hemoglobin 


Monod, Jacob, and Changeux recognized that many enzymes are regulated through allosteric changes in their 
conformation 


Phillips described the three-dimensional structure of lysozyme by x-ray crystallography, the first enzyme to be 
analyzed in atomic detail 


Nomura reconstituted a functional bacterial ribosome from purified components 


Henderson and Unwin determined the first three-dimensional structure of a transmembrane protein 
(bacteriorhodopsin), using a computer-based reconstruction from electron micrographs 


Neher and Sakmann developed patch-clamp recording to measure the activity of single ion-channel proteins 


Withrich used nuclear magnetic resonance (NMR) spectroscopy to solve the three-dimensional structure of a soluble 
sperm protein 


Tanaka and Fenn separately developed methods for using mass spectrometry to analyze proteins and other 
biological macromolecules 


Mann, Aebersold, Yates, and others refine methods for using mass spectrometry to identify proteins in complex 
mixtures, exploiting the availability of complete genome sequences 


Frank, Dubochet, Henderson and others develop computer-based methods for single-particle cryoelectron 
microscopy (cryo-EM), enabling determination of the structures of large protein complexes at atomic resolution 


form of a gas. Accelerated by a powerful electric field, the peptide ions 
then fly toward a detector; the time it takes them to arrive is related to 
their mass and their charge. (The larger the peptide is, the more slowly it 
moves; the more highly charged it is, the faster it moves.) The set of very 
exact masses of the protein fragments produced by trypsin cleavage then 
serves as a “fingerprint” that can be used to identify the protein—and its 
corresponding gene—from publicly accessible databases (Figure 4-56). 


This approach can even be applied to complex mixtures of proteins; 
for example, starting with an extract containing all the proteins made 
by yeast cells grown under a particular set of conditions. To obtain the 
increased resolution required to distinguish individual proteins, such 


Figure 4-56 Mass spectrometry can be used to identify proteins 
by determining the precise masses of peptides derived from them. 
As indicated, this in turn allows proteins of interest to be produced in 
the large amounts needed for determining their three-dimensional 
structure. In this example, a protein of interest is excised from a 
polyacrylamide gel after two-dimensional electrophoresis (see Panel 
4—5, p. 167) and then digested with trypsin. The peptide fragments 
are loaded into the mass spectrometer, and their exact masses are 
measured. Genome sequence databases are then searched to find the 
protein encoded by the organism in question whose profile matches 
this peptide fingerprint. Mixtures of proteins can also be analyzed in 
this way. (Image courtesy of Patrick O'Farrell.) 


mixtures are frequently analyzed using tandem mass spectrometry. In this 
case, after the peptides pass through the first mass spectrometer, they 
are broken into even smaller fragments and analyzed by a second mass 
spectrometer. 


Although all the information required for a polypeptide chain to fold is 
contained in its amino acid sequence, only in special cases can we reli- 
ably predict a protein’s detailed three-dimensional conformation—the 
spatial arrangement of its atoms—from its sequence alone. Today, the 
predominant way to discover the precise folding pattern of any pro- 
tein is by experiment, using x-ray crystallography, nuclear magnetic 
resonance (NMR) spectroscopy, or most recently cryoelectron 
microscopy (cryo-EM), as described in Panel 4-6 (pp. 168-169). 


Genetic Engineering Techniques Permit the Large-Scale 
Production, Design, and Analysis of Almost Any Protein 


Advances in genetic engineering techniques now permit the production 
of large quantities of almost any desired protein. In addition to making 
life much easier for biochemists interested in purifying specific proteins, 
this ability to churn out huge quantities of a protein has given rise to an 
entire biotechnology industry (Figure 4-57). Bacteria, yeast, and cultured 
mammalian cells are now used to mass-produce a variety of therapeutic 
proteins, such as insulin, human growth hormone, and even the fertility- 
enhancing drugs used to boost egg production in women undergoing in 
vitro fertilization treatment. Preparing these proteins previously required 
the collection and processing of vast amounts of tissue and other bio- 
logical products—including, in the case of the fertility drugs, the urine of 
postmenopausal nuns. 


The same sorts of genetic engineering techniques can also be employed 
to produce new proteins and enzymes that contain novel structures or 
perform unusual tasks: metabolizing toxic wastes or synthesizing life- 
saving drugs, for example. Most synthetic catalysts are nowhere near as 
effective as naturally occurring enzymes in terms of their ability to speed 
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Figure 4-57 Biotechnology companies 
produce mass quantities of useful 
proteins. Shown in this photograph are the 
large, turnkey microbial fermenters used 


to produce a whooping cough vaccine. 


(Courtesy of Pierre Guerin Technologies.) 
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Figure 4-58 Most proteins belong to 
structurally related families. (A) More 
than two-thirds of all well-studied proteins 
contain a single structural domain. The 
members of these single-domain families 
can have different amino acid sequences 
but fold into a protein with a similar shape. 
(B) During evolution, structural domains 
have been combined in different ways to 
produce families of multidomain proteins. 
Almost all novelty in protein structure 
comes from the way these single domains 
are arranged. Unlike the number of novel 
single domains, the number of multidomain 
families being added to the public 
databases is still rapidly increasing. 
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the rate of selected chemical reactions. But, as we continue to learn more 
about how proteins and enzymes exploit their unique conformations to 
carry out their biological functions, our ability to make novel proteins 
with useful functions can only improve. 


The Relatedness of Proteins Aids the Prediction of 
Protein Structure and Function 


Biochemists have made enormous progress over the past 150 years in 
understanding the structure and function of proteins (see Table 4-2, 
p. 160). These advances are the fruits of decades of painstaking research 
on isolated proteins, performed by individual scientists working tirelessly 
on single proteins or protein families, one by one, sometimes for their 
entire careers. In the future, however, more and more of these investi- 
gations of protein conformation and activity will likely take place on a 
larger scale. 


Improvements in our ability to rapidly sequence whole genomes, and 
the development of methods such as mass spectrometry, have fueled 
our ability to determine the amino acid sequences of enormous num- 
bers of proteins. Millions of unique protein sequences from thousands 
of different species have thereby been deposited into publicly avail- 
able databases, and the collection is expected to double in size every 
two years. Comparing the amino acid sequences of all of these proteins 
reveals that the majority belong to protein families that share specific 
“sequence patterns’—stretches of amino acids that fold into distinct 
structural domains. In some of these families, the proteins contain only a 
single structural domain. In others, the proteins include multiple domains 
arranged in novel combinations (Figure 4—58). 


Although the number of multidomain families is growing rapidly, the 
discovery of novel single domains appears to be leveling off. This pla- 
teau suggests that the vast majority of proteins may fold up into a limited 
number of structural domains—perhaps as few as 10,000 to 20,000. For 
many single-domain families, the structure of at least one family member 
is known. And knowing the structure of one family member allows us 
to say something about the structure of its relatives. By this account, we 
have some structural information for almost three-quarters of the pro- 
teins archived in databases (Movie 4.14). 


A future goal is to acquire the ability to look at a protein’s amino acid 
sequence and be able to deduce its structure and gain insight into its 
function. We are coming closer to being able to predict protein structure 
based on sequence information alone, but we still have a considerable 
way to go. To date, computational methods that take an amino acid 
sequence and search for the protein conformations with the lowest 
energy have been successful for proteins less than 100 amino acids long, 
or for longer proteins for which additional information is available (such 
as homology with proteins whose structure is known). 


Looking at an amino acid sequence and predicting how a protein will 
function—alone or as part of a complex in the cell—is more challenging 
still. But the closer we get to accomplishing these goals, the closer we 
will be to understanding the fundamental basis of life. 


ESSENTIAL CONCEPTS 


e Living cells contain an enormously diverse set of protein molecules, 
each made as a linear chain of amino acids linked together by cova- 
lent peptide bonds. 


e Each type of protein has a unique amino acid sequence, which 


determines both its three-dimensional shape and its biological 
activity. 


The folded structure of a protein is stabilized by multiple noncovalent 
interactions between different parts of the polypeptide chain. 


Hydrogen bonds between neighboring regions of the polypeptide 
backbone often give rise to regular folding patterns, known as a heli- 
ces and f sheets. 


The structure of many proteins can be subdivided into smaller globu- 
lar regions of compact three-dimensional structure, known as protein 
domains. 


The biological function of a protein depends on the detailed chemical 
properties of its surface and how it binds to other molecules called 
ligands. 


When a protein catalyzes the formation or breakage of a specific 
covalent bond in a ligand, the protein is called an enzyme and the 
ligand is called a substrate. 


At the active site of an enzyme, the amino acid side chains of the 
folded protein are precisely positioned so that they favor the for- 
mation of the high-energy transition states that the substrates must 
pass through to be converted to product. 


The three-dimensional structure of many proteins has evolved so 
that the binding of a small ligand outside of the active site can induce 
a significant change in protein shape. 


Most enzymes are allosteric proteins that can exist in two conforma- 
tions that differ in catalytic activity, and the enzyme can be turned 
on or off by ligands that bind to a distinct regulatory site to stabilize 
either the active or the inactive conformation. 


The activities of most enzymes within the cell are strictly regulated. 
One of the most common forms of regulation is feedback inhibition, 
in which an enzyme early in a metabolic pathway is inhibited by the 
binding of one of the pathway’s end products. 


Many thousands of proteins in a typical eukaryotic cell are regulated 
by cycles of phosphorylation and dephosphorylation. 


GTP-binding proteins also regulate protein function in eukaryotes; 
they act as molecular switches that are active when GTP is bound 
and inactive when GDP is bound, turning themselves off by hydrolyz- 
ing their bound GTP to GDP. 


Motor proteins produce directed movement in eukaryotic cells 
through conformational changes linked to the hydrolysis of a tightly 
bound molecule of ATP to ADP. 


Highly efficient protein machines are formed by assemblies of allos- 
teric proteins in which the various conformational changes are 
coordinated to perform complex functions. 


Covalent modifications added to a protein’s amino acid side chains 
can control the location and function of the protein and can serve as 
docking sites for other proteins. 


Biochemical subcompartments often form as phase-separated intra- 
cellular condensates, speeding important reactions and confining 
them to specific regions of the cell. 


Starting from crude cell or tissue homogenates, individual proteins 
can be obtained in pure form by using a series of chromatography 
steps. 


The function of a purified protein can be discovered by biochemical 
analyses, and its exact three-dimensional structure can be deter- 
mined by x-ray crystallography, NMR spectroscopy, or cryoelectron 
microscopy. 


Essential Concepts 
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BREAKING OPEN CELLS AND TISSUES 


The first step in the Using gentle mechanical procedures, called homogenization, The resulting thick soup (called 
purification of most the plasma membranes of cells can be ruptured so that the cell a homogenate or an extract) 
proteins is to disrupt contents are released. Four commonly used procedures are contains large and small molecules 
tissues and cells in a shown here. from the cytosol, such as enzymes, 
controlled fashion. ribosomes, and metabolites, as well 


as all of the membrane-enclosed 


: e. R Eo oó: organelles. 
uO PIRE- ey’ ja : 


Ca N ~ s; Ly e Pwa . ES PB. 
OP 1 Break apart cells with 2 Usea mild detergent 4 2 J 


© 


high-frequency to make holes in the 
sound (ultrasound). plasma membrane. 


cell 
suspension 


When carefully conducted, 
homogenization leaves most 
of the membrane-enclosed 
organelles largely intact. 


3 Force cells through 4 Shear cells between 
a small hole using a close-fitting rotating 
high pressure. plunger and the thick 


walls of a glass vessel. 


swinging-arm rotor centrifugal force 
THE CENTRIFUGE — 
Co > tube 
armored chamber sedimenting material ‘6 € 1 Dl G 


al p 
CENTRIFUGATION ¢ a metal bucket 
E: 
Le 


Le 
Many cell fractionations are done The metal buckets that hold the tubes are 
in a second type of rotor, a free to swing outward as the rotor turns. 


swinging-arm rotor. 


HOMOGENATE SUPERNATANT 
before smaller and less 
centrifugation CENTRIFUGATION dense components 
— 
PELLET 
larger and more 
dense components 


| 


refrigeration vacuum 
motor 
Centrifugation is the most widely used procedure to separate a Such speeds require centrifuge chambers to be refrigerated and 
homogenate into different parts, or fractions. The homogenate is have the air evacuated so that friction does not heat up the 
placed in test tubes and rotated at high speed in a centrifuge or homogenate. The centrifuge is surrounded by thick armor plating, 
ultracentrifuge. Present-day ultracentrifuges rotate at speeds up because an unbalanced rotor can shatter with an explosive release 
to 100,000 revolutions per minute and produce enormous forces, of energy. A fixed-angle rotor can hold larger volumes than a 


as high as 600,000 times gravity. swinging-arm rotor, but the pellet forms less evenly, as shown. 


DIFFERENTIAL CENTRIFUGATION 


Repeated centrifugation at progressively 
higher speeds will fractionate cell 
homogenates into their components. 
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Centrifugation separates cell components on the basis of size and density. The larger 
and denser components experience the greatest centrifugal force and move most 
rapidly. They sediment to form a pellet at the bottom of the tube, while smaller, less 


dense components remain in suspension above, a portion called the supernatant. 


LOW-SPEED 
CENTRIFUGATION 


cell 
homogenate 


PELLET 1 


whole cells 
nuclei 
cytoskeletons 


VELOCITY SEDIMENTATION 


sample CENTRIFUGATION 


stabilizing 
sucrose 
gradient 
(e.g., 520%) 


MEDIUM-SPEED 
CENTRIFUGATION OF 
SUPERNATANT 1 


HIGH-SPEED 
CENTRIFUGATION OF 
SUPERNATANT 2 


peroxisomes 


reticulum 


VERY HIGH-SPEED 
CENTRIFUGATION OF 
SUPERNATANT 3 


———_———— 
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PELLET 2 PELLET 3 PELLET 4 
mitochondria closed fragments ribosomes 
lysosomes of endoplasmic viruses 


large macromolecules 


other small vesicles 


FRACTIONATION 
ETAT A AE 


| _ slowly sedimenting 
component 


#— fast-sedimenting 
component 


centrifuge tube pierced 
at its base 


automated rack of small collecting 
tubes allows fractions to be collected 
as the rack moves from left to right 


Subcellular components sediment at different rates according to their 
size after being carefully layered over a dilute salt solution and then 
centrifuged through it. In order to stabilize the sedimenting 
components against convective mixing in the tube, the solution contains 
a continuous shallow gradient of sucrose that increases in concentration 
toward the bottom of the tube. The gradient is typically 5—20% 
sucrose. When sedimented through such a dilute sucrose gradient, using 
a swinging-arm rotor, different cell components separate into distinct 
bands that can be collected individually. 


EQUILIBRIUM SEDIMENTATION 


The sample is distributed 
throughout the sucrose 


The ultracentrifuge can also be used to density gradient. 


separate cell components on the basis of their 
buoyant density, independently of their 

size or shape. The sample is usually either 
layered on top of, or dispersed within, a 
steep density gradient that contains a 

very high concentration of sucrose or cesium 


CENTRIFUGATION 


c 


> sample 


U Ww w 


rack movement 


eS we Y V 


After an appropriate centrifugation time, the 
bands may be collected, most simply by 
puncturing the plastic centrifuge tube and 
collecting drops from the bottom, as shown here. 


At equilibrium, components 
have migrated to a region in 
the gradient that matches 
their own density. 


CENTRIFUGATION 


S 


low-buoyant 
|_— density 
component 


m high-buoyant 


: . ;— stee ji 
chloride. Each subcellular component will Si pees density 
h on component 
move up or down when centrifuged until it gradient 
reaches a position where its density matches (e.g., 20-70%) 
its surroundings and then will move no further. START BEFORE EQUILIBRIUM EQUILIBRIUM 


A series of distinct bands will eventually be 
produced, with those nearest the bottom of the 
tube containing the components of highest 
buoyant density. The method is also called 
density gradient centrifugation. 


A sucrose gradient is shown here, 
but denser gradients can be formed with 
cesium chloride that are particularly useful 


for separating nucleic acids (DNA and RNA). 


The final bands can be 
collected from the base of 
the tube, as shown above for 
velocity sedimentation. 
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PROTEIN SEPARATION 


Proteins are very diverse. They differ in 
size, shape, charge, hydrophobicity, and 
their affinity for other molecules. All of 
these properties can be exploited to 
separate them from one another so 
that they can be studied individually. 


THREE KINDS OF 
CHROMATOGRAPHY 


Although the material used to form 
the matrix for column chromatography 
varies, it is usually packed in the 
column in the form of small beads. 

A typical protein purification strategy 
might employ in turn each of the 
three kinds of matrix described 
below, with a final protein 
purification of up to 10,000-fold. 
Purity can easily be assessed by gel 
electrophoresis (Panel 4-5). 
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(A) ION-EXCHANGE CHROMATOGRAPHY 


lon-exchange columns are packed with 
small beads carrying either positive or 
negative charges that retard proteins of 
the opposite charge. The association 
between a protein and the matrix 
depends on the pH and ionic strength of 
the solution passing down the column. 
These can be varied in a controlled way 
to achieve an effective separation. 


COLUMN CHROMATOGRAPHY 


Proteins are often fractionated by column chromatography. A mixture of proteins in 
solution is applied to the top of a cylindrical column filled with a permeable solid 
matrix immersed in solvent. A large amount of solvent is then pumped through the 
column. Because different proteins are retarded to different extents by their 
interaction with the matrix, they can be collected separately as they flow out from 
the bottom. According to the choice of matrix, proteins can be separated according 
to their charge, hydrophobicity, size, or ability to bind to particular chemical 

groups (see below). 


solvent continuously 

applied to the top of 
sample column from a large 
applied reservoir of solvent 
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(B) GEL-FILTRATION CHROMATOGRAPHY (C) AFFINITY CHROMATOGRAPHY 
Gel-filtration columns separate proteins Affinity columns contain a matrix covalently 
according to their size. The matrix consists coupled to a molecule that interacts 
of tiny porous beads. Protein molecules specifically with the protein of interest 
that are small enough to enter the holes (e.g., an antibody or an enzyme substrate). 
in the beads are delayed and travel more Proteins that bind specifically to such a 
slowly through the column. Proteins that column can subsequently be released by a 
cannot enter the beads are washed out pH change or by concentrated salt 
of the column first. Such columns also solutions, and they emerge highly purified 


allow an estimate of protein size. (see Figure 4-55). 
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GEL ELECTROPHORESIS The detergent CH3 protein with two 
. g subunits, A and B, 
sodium dodecyl ioined by a disulfid ; i 
sample loaded onto gel sulfate (SDS) CH2 joine z H i ide single-subunit 
by pipette is used to (S-S) bon protein 
solubilize CH2 A B € 
astic casin proteins for SDS 
9 polyacrylamide- CH2 Lss 
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gel electrophoresis. 


negatively 
charged SDS E 
molecules 


s O=$=O POLYACRYLAMIDE-GEL ELECTROPHORESIS 
buffer oO Na® A 
When an electric field is applied to a solution “SDS” 


containing protein molecules, the proteins 
will migrate in a direction and at a speed that 
reflects their size and net charge. This forms 
the basis of the technique called 
electrophoresis. 


SDS polyacrylamide-gel electrophoresis (SDS-PAGE) 

Individual polypeptide chains form a complex with 

negatively charged molecules of sodium dodecyl 

sulfate (SDS) and therefore migrate as negatively 

charged SDS-protein complexes through a slab of 

porous polyacrylamide gel. The apparatus used for 

this electrophoresis technique is shown above (left). A 

A reducing agent (mercaptoethanol) is usually 

added to break any S-S linkages within or between 

proteins. Under these conditions, unfolded \ © 
polypeptide chains migrate at a rate that reflects 

their molecular weight, with the smallest proteins slab of polyacrylamide gel 
migrating most quickly. 


ISOELECTRIC FOCUSING 


For any protein there is a characteristic 
pH, called the isoelectric point, at which 
the protein has no net charge and 
therefore will not move in an electric 
field. In isoelectric focusing, proteins 
are electrophoresed in a narrow tube of 
polyacrylamide gel in which a pH 
gradient is established by a mixture of 
special buffers. Each protein moves to a 
point in the pH gradient that corresponds 
to its isoelectric point and stays there. 


TWO-DIMENSIONAL POLYACRYLAMIDE-GEL ELECTROPHORESIS 


Complex mixtures of proteins cannot be resolved well on one-dimensional gels, but 
two-dimensional gel electrophoresis, combining two different separation methods, can 
be used to resolve more than 1000 proteins in a two-dimensional protein map. In the 
first step, native proteins are separated in a narrow gel on the basis of their intrinsic 
charge using isoelectric focusing (see left). In the second step, this gel is placed on top of 
a gel slab, and the proteins are subjected to SDS-PAGE (see above) in a direction 
perpendicular to that used in the first step. Each protein migrates to form a discrete spot. 


stable pH gradient 


© 


All the proteins in 

an E. coli bacterial basic ~<«— stable pH gradient ——»> acidic 
cell are separated 
in this two- 
dimensional gel, in 
which each spot 
corresponds to a 
different 
polypeptide chain. 
They are separated 
according to their 
isoelectric point 
from left to right 
and to their 
molecular weight 
from top to 
bottom. (Courtesy 
of Patrick O'Farrell.) 


1 
At high pH, At low pH, 
the protein the protein 
is negatively is positively 
charged. charged. 


© © 


~& SDS migration (mol. wt. x 1073) 


The protein shown here has an isoelectric pH of 6.5. 
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X-RAY CRYSTALLOGRAPHY 


To determine a protein's three-dimensional structure—and assess how this conformation changes as the protein 
functions—one must be able to “see” the relative positions of the protein’s individual atoms. Since the 1930s, x-ray 
crystallography has been the gold standard for the determination of protein structure. This method uses x rays—which have a 
wavelength approximately equal to the diameter of a hydrogen atom—to probe the structure of proteins at an atomic level. 


To begin, the purified protein is first coaxed into forming crystals: large, highly ordered arrays in which every protein 
molecule has the same conformation and is perfectly aligned with its neighbors. The process can take years of trial and error 
to find the right conditions to produce high-quality protein crystals. When a narrow beam of x-rays is directed at this crystal, 
the atoms in the protein molecules scatter the incoming x-rays. These scattered waves either reinforce or cancel one another, 
producing a complex diffraction pattern that is collected by electronic detectors. The position and intensity of each spot in 
the x-ray diffraction pattern contain information about the position of the atoms in the protein crystal. 


x-ray diffraction pattern 
obtained from the protein crystal 


x-ray source of x-rays structure from 
(A) diffraction pattern 


op 
É — 
/ beam eI calculation of 


(D) 


Computers then transform these patterns into maps of the relative spatial positions of the atoms. By combining this information 
with the amino acid sequence of the protein, an atomic model of the protein's structure can be generated. The protein shown 
here is ribulose bisphosphate carboxylase (Rubisco), an enzyme that plays a central role in CO, fixation during photosynthesis 
(discussed in Chapter 14). The protein illustrated is approximately 450 amino acids in length. Nitrogen atoms are shown in blue, 
oxygen in red, phosphorus in yellow; and carbon in gray. (B, courtesy of C. Branden; C, courtesy of J. Hajdu and I. Andersson.) 


NMR SPECTROSCOPY 


If a protein is small—50,000 daltons or less—its 
structure in solution can be determined by nuclear 
magnetic resonance (NMR) spectroscopy. This 
method takes advantage of the fact that for many 
atoms—hydrogen in particular—the nucleus is 
intrinsically magnetic. 


When a solution of pure protein is exposed to a 
powerful magnet, its nuclei will act like tiny bar 
magnets and align themselves with the magnetic 
field. If the protein solution is then bombarded with 
a blast of radio waves, the excited nuclei will wobble 
around their magnetic axes, and, as they relax back 
into the aligned position, they give off a signal that 
can be used to reveal their relative positions. 


(A) (Courtesy of P. Kraulis, Uppsala) (B) 


Again, combined with an amino acid sequence, an NMR spectrum can allow the computation of a protein’s three-dimensional 
structure. Proteins larger than 50,000 daltons can be broken up into their constituent functional domains before analysis by NMR 
spectroscopy. In (A), a two-dimensional NMR spectrum derived from the C-terminal binding domain of the enzyme cellulase is 
shown. The spots represent interactions between neighboring H atoms. The structures that satisfy the distance constraints 
presented by the NMR spectrum are shown superimposed in (B). This domain, which binds to cellulose, is 36 amino acids in length. 


CRYOELECTRON MICROSCOPY 


X-ray crystallography remains the first port of call 
when determining proteins’ structures. However, large 
macromolecular machines are often hard to crystallize, 
as are many integral membrane proteins, and for 
dynamic proteins and assemblies it is hard to access 
different conformations through crystallography 
alone. To get around these problems, investigators are 
increasingly turning to cryoelectron microscopy 
(cryo-EM) to solve macromolecular structures. 


beam of electrons 


È a 3> AN gates 


electron detector captures projected image of molecules 


Algorithms then sort 
the particles into sets 
that each contains 
particles that are all 
oriented in the same 
direction. The 
thousands of images 
in each set are all 
then superimposed 
and averaged to 
improve the signal to 
noise ratio. 


CRYO-EM STRUCTURE OF 
THE RIBOSOME 


Courtesy of Joachim Frank. 


60S ribosomal subunits randomly 
oriented in a thin film of ice 100 nm 


Although by no means routine, big improvements in 
image processing algorithms, modeling tools and sheer 
computing power all mean that structures of 
macromolecular complexes are now becoming attainable 
with resolutions in the 0.2 to 0.3 nm range. 
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molecules immobilized in thin film of ice 


carbon film on EM grid 


In this technique, a droplet of the pure protein in water is placed 
on a small EM grid that is plunged into a vat of liquid ethane at 
-180°C. This freezes the proteins in a thin film of ice and the rapid 
freezing ensures that the surrounding water molecules have no 
time to form ice crystals, which would damage the protein’s shape. 


The sample is examined, still frozen, by transmission electron 
microscopy (see Panel 1-1, p. 13). To avoid damage, it is 
important that only a few electrons pass through each part of the 
specimen, sensitive detectors are therefore deployed to capture 
every electron that passes through the specimen. Much EM 
specimen preparation and data collection is now fully automated 
and many thousands of micrographs are typically captured, each 
of which will contain hundreds or thousands of individual 
molecules all arranged in random orientations within the ice. 


This crisper two-dimensional 
image set, which represents 
different views of the particle, 
are then combined and 
converted via a series of 
complex iterative steps into a 
high resolution 
three-dimensional structure. 


Model of GroEL 
(Courtesy of Gabriel Lander.) 


60S large ribosomal subunit at path of a rRNA loop fitted 
0.25 nm resolution into the electron density map 


This resolving power now approaches that of x-ray 
crystallography, and the two techniques thrive together, each 
bootstrapping the other to obtain ever more useful and dynamic 
structural information. A good example is the structure of the 
ribosome shown here at a resolution of 0.25 nm. 
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CHAPTER 4 


Protein Structure and Function 


KEY TERMS 


active site 

allosteric 

a helix 

amino acid sequence 
antibody 

antigen 

B sheet 

binding site 
C-terminus 
chromatography 
coenzyme 

coiled-coil 
conformation 
cryoelectron microscopy (cryo-EM) 
disulfide bond 
electrophoresis 
enzyme 


feedback inhibition 


fibrous protein 

globular protein 

GTP-binding protein 

helix 

intracellular condensate 

intrinsically disordered sequence 

ligand 

lysozyme 

mass spectrometry 

Michaelis constant (Km) 

motor protein 

N-terminus 

nuclear magnetic resonance 
(NMR) spectroscopy 

peptide bond 

polypeptide, polypeptide chain 

polypeptide backbone 

primary structure 


protein 

protein domain 
protein family 
protein kinase 
protein machine 
protein phosphatase 
protein phosphorylation 
quaternary structure 
scaffold protein 
secondary structure 
side chain 

substrate 

subunit 

tertiary structure 
transition state 
turnover number 
Vmax 


x-ray crystallography 


QUESTIONS 


QUESTION 4-9 


Look at the models of the protein in Figure 4—11. Is the 
red a helix right- or left-handed? Are the three strands that 
form the large B sheet parallel or antiparallel? Starting at 
the N-terminus (the purple end), trace your finger along the 
peptide backbone. Are there any knots? Why, or why not? 


QUESTION 4-10 


Which of the following statements are correct? Explain your 
answers. 


A. The active site of an enzyme usually occupies only a 
small fraction of the enzyme surface. 


B. Catalysis by some enzymes involves the formation of 
a covalent bond between an amino acid side chain and a 
substrate molecule. 


C. AB sheet can contain up to five strands, but no more. 


D. The specificity of an antibody molecule is contained 
exclusively in loops on the surface of the folded light-chain 
domain. 


E. The possible linear arrangements of amino acids are so 
vast that new proteins almost never evolve by alteration of 
old ones. 


F. Allosteric enzymes have two or more binding sites. 


G. Noncovalent bonds are too weak to influence the three- 
dimensional structure of macromolecules. 


H. Affinity chromatography separates molecules according 
to their intrinsic charge. 


|. Upon centrifugation of a cell homogenate, smaller 
organelles experience less friction and thereby sediment 
faster than larger ones. 


QUESTION 4-11 

What common feature of a helices and B sheets makes them 
universal building blocks for proteins? 

QUESTION 4-12 


Protein structure is determined solely by a protein’s amino 
acid sequence. Should a genetically engineered protein in 
which the original order of all amino acids is reversed have 
the same structure as the original protein? 


QUESTION 4-13 


Consider the following protein sequence as an a helix: 

Leu-Lys-Arg-lle-Val-Asp-lle-Leu-Ser-Arg-Leu-Phe-Lys-Val. 

How many turns does this helix make? Do you find anything 

remarkable about the arrangement of the amino acids in 

this sequence when folded into an a helix? (Hint: consult the 

properties of the amino acids in Figure 4-3.) 

QUESTION 4-14 

Simple enzyme reactions often conform to the equation: 
E+SoES—>EP~E+P 


where E, S, and P are enzyme, substrate, and product, 
respectively. 


A. What does ES represent in this equation? 


B. Why is the first step shown with bidirectional arrows and 
the second step as a unidirectional arrow? 


C. Why does E appear at both ends of the equation? 


D. One often finds that high concentrations of P inhibit the 
enzyme. Suggest why this might occur. 


E. If compound X resembles S and binds to the active site 


of the enzyme but cannot undergo the reaction catalyzed 
by it, what effects would you expect the addition of X to 
the reaction to have? Compare the effects of X and of the 
accumulation of P. 


QUESTION 4-15 


Which of the following amino acids would you expect 

to find more often near the center of a folded globular 
protein? Which ones would you expect to find more often 
exposed to the outside? Explain your answers. Ser, Ser-P (a 
Ser residue that is phosphorylated), Leu, Lys, Gln, His, Phe, 
Val, Ile, Met, Cys-S—S—Cys (two cysteines that are disulfide- 
bonded), and Glu. Where would you expect to find the most 
N-terminal amino acid and the most C-terminal amino acid? 


QUESTION 4-16 


Assume you want to make and study fragments of a protein. 
Would you expect that any fragment of the polypeptide 
chain would fold the same way as it would in the intact 
protein? Consider the protein shown in Figure 4—20. Which 
fragments do you suppose are most likely to fold correctly? 


QUESTION 4-17 


Neurofilament proteins assemble into long, intermediate 
filaments (discussed in Chapter 17), found in abundance 
running along the length of nerve cell axons. The C-terminal 
region of these proteins is an unstructured polypeptide, 
hundreds of amino acids long and heavily modified by the 
addition of phosphate groups. The term “polymer brush” 
has been applied to this part of the neurofilament. Can you 
suggest why? 


QUESTION 4-18 


An enzyme isolated from a mutant bacterium grown at 

20°C works in a test tube at 20°C but not at 37°C (37°C is 
the temperature of the gut, where this bacterium normally 
lives). Furthermore, once the enzyme has been exposed 

to the higher temperature, it no longer works at the lower 
one. The same enzyme isolated from the normal bacterium 
works at both temperatures. Can you suggest what happens 
(at the molecular level) to the mutant enzyme as the 
temperature increases? 


QUESTION 4-19 


A motor protein moves along protein filaments in the cell. 
Why are the elements shown in the illustration not sufficient 
to mediate directed movement (Figure Q4-19)? With 
reference to Figure 4-50, modify the illustration shown 
here to include other elements that are required to create a 
unidirectional motor, and justify each modification you make 
to the illustration. 


Figure Q4-19 


QUESTION 4-20 


Gel-filtration chromatography separates molecules 
according to their size (see Panel 4—4, p. 166). Smaller 
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molecules diffuse faster in solution than larger ones, yet 
smaller molecules migrate more slowly through a gel- 
filtration column than larger ones. Explain this paradox. 
What should happen at very rapid flow rates? 


QUESTION 4-21 


As shown in Figure 4—16, both a helices and the coiled-coil 
structures that can form from them are helical structures, 
but do they have the same handedness in the figure? 
Explain why? 


QUESTION 4-22 


How is it possible that a change in a single amino acid in a 
protein of 1000 amino acids can destroy protein function, 
even when that amino acid is far away from any ligand- 
binding site? 


QUESTION 4-23 


The curve shown in Figure 4-35 is described by the 
Michaelis-Menten equation: 


rate (v) = Vmax [S]/(Km + [S]) 


Can you convince yourself that the features qualitatively 
described in the text are accurately represented by this 
equation? In particular, how can the equation be simplified 
when the substrate concentration [S] is in one of the 
following ranges: (A) [S] is much smaller than the Km, 

(B) [S] equals the Km, and (C) [S] is much larger than the Km? 


QUESTION 4-24 


The rate of a simple enzyme reaction is given by the 
standard Michaelis-Menten equation: 


rate = Vmax [S]/(Km + [S]) 


If the Vmax of an enzyme is 100 umole/sec and the Km is 

1 mM, at what substrate concentration is the rate 

50 umole/sec? Plot a graph of rate versus substrate (S) 
concentration for [S] = 0 to 10 mM. Convert this to a plot of 
1/rate versus 1/[S]. Why is the latter plot a straight line? 


QUESTION 4-25 


Select the correct options in the following and explain your 
choices. If [S] is very much smaller than Km, the active site 
of the enzyme is mostly occupied/unoccupied. If [S] is very 
much greater than Ky, the reaction rate is limited by the 
enzyme/substrate concentration. 


QUESTION 4-26 


A. The reaction rates of the reaction S — P, catalyzed by 
enzyme E, were determined under conditions in which only 
very little product was formed. The data in the table below 
were measured, plot the data as a graph. Use this graph to 
estimate the Ky and the Vmax for this enzyme. 


B. To determine the Km and Vmax values more precisely, 
a trick is generally used in which the Michaelis-Menten 
equation is transformed so that it is possible to plot the 
data as a straight line. A simple rearrangement yields 


1/rate = (Km/ Vmax) (1/[S]) + 1/Vmax 


which is an equation of the form y = ax + b. Calculate 
1/rate and 1/[S] for the data given in part (A) and then plot 
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Substrate Concentration Reaction Rate (umole/min) 
(uM) 

0.08 0.15 

0.12 0.21 

0.54 0.7 

123 tell 

1.82 1.3) 

DMD LS 

4.94 id 

10.00 1.8 


1/rate versus 1/[S] as a new graph. Determine Ky and Vmax 
from the intercept of the line with the axis, where 1/[S] = 0, 
combined with the slope of the line. Do your results agree 
with the estimates made from the first graph of the raw 
data? 


C. It is stated in part (A) that only very little product 
was formed under the reaction conditions. Why is this 
important? 


D. Assume the enzyme is regulated such that upon 
phosphorylation its Km increases by a factor of 3 without 
changing its Vmax. Is this an activation or inhibition? Plot the 
data you would expect for the phosphorylated enzyme in 
both the graph for (A) and the graph for (B). 


DNA and Chromosomes 


Life depends on the ability of cells to store, retrieve, and translate the 
genetic instructions required to make and maintain a living organism. 
These instructions are stored within every living cell in its genes—the 
information-bearing elements that determine the characteristics of a spe- 
cies as a whole and of the individuals within it. 


At the beginning of the twentieth century, when genetics emerged as 
a science, scientists became intrigued by the chemical nature of genes. 
The information in genes is copied and transmitted from a cell to its 
daughter cells millions of times during the life of a multicellular organ- 
ism, and passed from generation to generation through the reproductive 
cells—eggs and sperm. Genes survive this process of replication and 
transmission essentially unchanged. What kind of molecule could be 
capable of such accurate and almost unlimited replication, and also be 
able to direct the development of an organism and the daily life of a cell? 
What kind of instructions does the genetic information contain? How are 
these instructions physically organized so that the enormous amount of 
information required for the development and maintenance of even the 
simplest organism can be contained within the tiny space of a cell? 


The answers to some of these questions began to emerge in the 1940s, 
when it was discovered from studies in simple fungi that genetic informa- 
tion consists primarily of instructions for making proteins. As described 
in the previous chapter, proteins perform most of the cell's functions: they 
serve as building blocks for cell structures; they form the enzymes that 
catalyze the cell’s chemical reactions; they regulate the activity of genes; 
and they enable cells to move and to communicate with one another. 
With hindsight, it is hard to imagine what other type of instructions the 
genetic information could have contained. 


THE STRUCTURE OF DNA 


THE STRUCTURE OF 
EUKARYOTIC CHROMOSOMES 


THE REGULATION OF 
CHROMOSOME STRUCTURE 


174 CHAPTER 5 


single chromosome 


(A) 
dividing cell 


nondividing cell 


@ 


10 um 


Figure 5-1 Chromosomes become visible 
as eukaryotic cells prepare to divide. 

(A) Two adjacent plant cells photographed 
using a fluorescence microscope. The 
DNA, which is labeled with a fluorescent 
dye (DAPI), is packaged into multiple 
chromosomes; these become visible as 
distinct structures only when they condense 
in preparation for cell division, as can be 
seen in the cell on the left. For clarity, a 
single chromosome has been shaded 
(brown) in the dividing cell. The cell on the 
right, which is not dividing, contains the 
identical chromosomes, but they cannot 
be distinguished as individual entities 
because the DNA is in a much more 
extended conformation at this phase in the 
cell's division cycle. (B) Schematic diagram 
of the outlines of the two cells and their 
chromosomes. (A, courtesy of Peter Shaw.) 


DNA and Chromosomes 


The other crucial advance made in the 1940s was the recognition that 
deoxyribonucleic acid (DNA) is the carrier of the cell's genetic informa- 
tion. But the mechanism whereby the information could be copied for 
transmission from one generation of cells to the next, and how proteins 
might be specified by instructions in DNA, remained completely myste- 
rious until 1953, when the structure of DNA was determined by James 
Watson and Francis Crick. The structure immediately revealed how DNA 
might be copied, or replicated, and it provided the first clues about how 
a molecule of DNA might encode the instructions for making proteins. 
Today, the fact that DNA is the genetic material is so fundamental to our 
understanding of life that it can be difficult to appreciate what an enor- 
mous intellectual gap this discovery filled. 


In this chapter, we begin by describing the structure of DNA. We see how, 
despite its chemical simplicity, the structure and chemical properties of 
DNA make it ideally suited for carrying genetic information. We then con- 
sider how genes and other important segments of DNA are arranged in 
the single, long DNA molecule that forms each chromosome in the cell. 
Finally, we discuss how eukaryotic cells fold these long DNA molecules 
into compact chromosomes inside the nucleus. This packing has to be 
done in an orderly fashion so that the chromosomes can be apportioned 
correctly between the two daughter cells at each cell division. At the 
same time, chromosomal packaging must allow DNA to be accessed by 
the large number of proteins that replicate and repair it, and that deter- 
mine the activity of the cell’s many genes. 


This is the first of five chapters that deal with basic genetic mechanisms— 
the ways in which the cell maintains and makes use of the genetic 
information carried in its DNA. In Chapter 6, we discuss the mechanisms 
by which the cell accurately replicates and repairs its DNA. In Chapter 7, 
we consider gene expression—how genes are used to produce RNA and 
protein molecules. In Chapter 8, we describe how a cell controls gene 
expression to ensure that each of the many thousands of proteins encoded 
in its DNA is manufactured at the proper time and place. In Chapter 9, we 
discuss how present-day genes evolved, and, in Chapter 10, we consider 
some of the ways that DNA can be experimentally manipulated to study 
fundamental cell processes. 


An enormous amount has been learned about these subjects in the past 
60 years. Much less obvious, but equally important, is the fact that our 
knowledge is very incomplete; thus a great deal still remains to be dis- 
covered about how DNA provides the instructions to build living things. 


THE STRUCTURE OF DNA 


Long before biologists understood the structure of DNA, they had rec- 
ognized that inherited traits and the genes that determine them were 
associated with chromosomes. Chromosomes (named from the Greek 
chroma, “color,” because of their staining properties) were discovered in 
the nineteenth century as threadlike structures in the nucleus of eukary- 
otic cells that become visible as the cells begin to divide (Figure 5-1). As 
biochemical analyses became possible, researchers learned that chro- 
mosomes contain both DNA and protein. But which of these components 
encoded the organism's genetic information was not immediately clear. 


We now know that the DNA carries the genetic information of the cell and 
that the protein components of chromosomes function largely to pack- 
age and control the enormously long DNA molecules. But biologists in 
the 1940s had difficulty accepting DNA as the genetic material because of 
the apparent simplicity of its chemistry (see How We Know, pp. 193-195). 


DNA, after all, is simply a long polymer composed of only four types of 
nucleotide subunits, which are chemically very similar to one another. 


Then, early in the 1950s, Maurice Wilkins and Rosalind Franklin exam- 
ined DNA using x-ray diffraction analysis, a technique for determining 
the three-dimensional atomic structure of a molecule (see Panel 4-6, 
pp. 168-169). Their results provided one of the crucial pieces of evidence 
that led, in 1953, to Watson and Crick’s model of the double-helical struc- 
ture of DNA. This structure—in which two strands of DNA are wound 
around each other to form a helix—immediately suggested how DNA 
could encode the instructions necessary for life, and how these instruc- 
tions could be copied and passed along when cells divide. In this section, 
we examine the structure of DNA and explain in general terms how it is 
able to store hereditary information. 


A DNA Molecule Consists of Two Complementary 
Chains of Nucleotides 


A molecule of deoxyribonucleic acid (DNA) consists of two long poly- 
nucleotide chains. Each chain, or strand, is composed of four types of 
nucleotide subunits, and the two strands are held together by hydrogen 
bonds between the base portions of the nucleotides (Figure 5-2). 


AS we saw in Chapter 2 (Panel 2-7, pp. 78-79), nucleotides are com- 
posed of a nitrogen-containing base and a five-carbon sugar, to which 
a phosphate group is attached. For the nucleotides in DNA, the sugar 
is deoxyribose (hence the name deoxyribonucleic acid) and the base 
can be either adenine (A), cytosine (C), guanine (G), or thymine (T). The 
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The Structure of DNA 


Figure 5-2 DNA is made of four 
nucleotide building blocks. (A) Each 
nucleotide is composed of a sugar 
phosphate covalently linked to a 
base—quanine (G) in this figure. 

(B) The nucleotides are covalently linked 
together into polynucleotide chains, with 

a sugar—phosphate backbone from which 
the bases—adenine, cytosine, guanine, 
and thymine (A, C, G, and T)—extend. 

(C) ADNA molecule is composed of two 
polynucleotide chains (DNA strands) held 
together by hydrogen bonds between 

the paired bases. The arrows on the DNA 
strands indicate the polarities of the two 
strands, which run antiparallel to each other 
(with opposite chemical polarities) in the 
DNA molecule. (D) Although the DNA is 
shown straightened out in (C), in reality, it is 
wound into a double helix, as shown here. 
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QUESTION 5-1 


Which of the following statements 
are correct? Explain your answers. 
A. ADNA strand has a polarity 
because its two ends contain 
different bases. 

B. G-C base pairs are more stable 
than A-T base pairs. 


Figure 5-3 The nucleotide subunits within a DNA strand are held 
together by phosphodiester bonds. These bonds connect one sugar 
to the next. The chemical differences in the ester linkages—between 
the 5’ carbon of one sugar and the 3’ carbon of the other—give rise 

to the polarity of the resulting DNA strand. For simplicity, only two 
nucleotides are shown here. 


nucleotides are covalently linked together in a chain through the sugars 
and phosphates, which form a backbone of alternating sugar-phosphate- 
sugar-phosphate (see Figure 5-2B). Because only the base differs in 
each of the four types of subunits, each polynucleotide chain resembles 
a necklace: a sugar-phosphate backbone strung with four types of tiny 
beads (the four bases A, C, G, and T). These same symbols (A, C, G, and 
T) are also commonly used to denote the four different nucleotides—that 
is, the bases with their attached sugar phosphates. 


The nucleotide subunits within a DNA strand are held together by phos- 
phodiester bonds that link the 5’ end of one sugar with the 3’ end of the 
next (Figure 5-3). Because the ester linkages to the sugar molecules on 
either side of the bond are different, each DNA strand has a chemical 
polarity. If we imagine that each nucleotide has a phosphate “knob” and 
a hydroxyl “hole” (see Figure 5-2A), each strand, formed by interlocking 
knobs with holes, will have all of its subunits lined up in the same orienta- 
tion. Moreover, the two ends of the strand can be easily distinguished, as 
one will have a hole (the 3’ hydroxyl) and the other a knob (the 5’ phos- 
phate). This polarity in a DNA strand is indicated by referring to one end as 
the 3’ end and the other as the 5’ end (see Figure 5-3). 


The two polynucleotide chains in the DNA double helix are held together 
by hydrogen-bonding between the bases on the different strands. All the 
bases are therefore on the inside of the double helix, with the sugar- 
phosphate backbones on the outside (see Figure 5-2D). The bases do 
not pair at random, however; A always pairs with T, and G always pairs 
with C (Figure 5-4). In each case, a bulkier two-ring base (a purine, see 
Panel 2-7, pp. 78-79) is paired with a single-ring base (a pyrimidine). 
Each purine-pyrimidine pair is called a base pair, and this complemen- 
tary base-pairing enables the base pairs to be packed in the energetically 
most favorable arrangement along the interior of the double helix. In this 
arrangement, each base pair has the same width, thus holding the sugar- 
phosphate backbones an equal distance apart along the DNA molecule. 
For the members of each base pair to fit together within the double helix, 
the two strands of the helix must run antiparallel to each other—that is, be 
oriented with opposite polarities (see Figure 5-2C and D). The antiparallel 
sugar-phosphate strands then twist around each other to form a double 
helix containing 10 base pairs per helical turn (Figure 5-5). This twisting 
also contributes to the energetically favorable conformation of the DNA 
double helix. 


As a consequence of the base-pairing arrangement shown in Figure 5-4, 
each strand of a DNA double helix contains a sequence of nucleotides 
that is exactly complementary to the nucleotide sequence of its part- 
ner strand—an A always matches a T on the opposite strand, and a C 
always matches a G. This complementarity is of crucial importance when 
it comes to both copying and maintaining the DNA structure, as we dis- 
cuss in Chapter 6. An animated version of the DNA double helix can be 
seen in Movie 5.1. 


The Structure of DNA Provides a Mechanism for Heredity 


The fact that genes encode information that must be copied and trans- 
mitted accurately when a cell divides raised two fundamental issues: how 
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Figure 5-4 The two strands of the DNA double helix are held together by hydrogen bonds between 
complementary base pairs. (A) Schematic illustration showing how the shapes and chemical structures of the 

bases allow hydrogen bonds to form efficiently only between A and T and between G and C. The atoms that form 
the hydrogen bonds between these nucleotides (see Panel 2-3, pp. 70-71) can be brought close together without 
perturbing the double helix. As shown, two hydrogen bonds form between A and T, whereas three form between 

G and C. The bases can pair in this way only if the two polynucleotide chains that contain them are antiparallel—that 
is, oriented in opposite directions. (B) A short section of the double helix viewed from its side. Four base pairs are 
illustrated; note that they lie perpendicular to the axis of the helix, unlike the schematic shown in (A). As shown in 
Figure 5-3, the nucleotides are linked together covalently by phosphodiester bonds that connect the 3'-hydroxyl 
(-OH) group of one sugar and the 5’ phosphate (-PO3) attached to the next (see Panel 2-7, pp. 78-79, to review how 
the carbon atoms in the sugar ring are numbered). This linkage gives each polynucleotide strand a chemical polarity; 
that is, its two ends are chemically different. The 3' end carries an unlinked -OH group attached to the 3' position on 
the sugar ring; the 5' end carries a free phosphate group attached to the 5' position on the sugar ring. 


can the information for specifying an organism be carried in chemical 
form, and how can the information be accurately copied? The structure 
of DNA provides the answer to both questions. 


Information is encoded in the order, or sequence, of the nucleotides along 
each DNA strand. Each base—A, C, T, or G—can be considered a letter in 
a four-letter alphabet that is used to spell out biological messages (Figure 
5-6). Organisms differ from one another because their respective DNA 
molecules have different nucleotide sequences and, consequently, carry 
different biological messages. But how is the nucleotide alphabet used to 


major 


groove 
make up messages, and what do they spell out? 
Before the structure of DNA was determined, investigators had estab- J 
lished that genes contain the instructions for producing proteins. Thus, it J]. 
was clear that DNA messages must somehow be able to encode proteins. SOONE 


Consideration of the chemical character of proteins makes the problem 


Figure 5-5 A space-filling model shows the conformation of the 
DNA double helix. The two DNA strands wind around each other to 
form a right-handed helix (see Figure 4-14) with 10 bases per turn. 
Shown here are 1.5 turns of the DNA double helix. The coiling of the 
two strands around each other creates two grooves in the double 
helix. The wider groove is called the major groove and the smaller one 
the minor groove. The colors of the atoms are: N, blue; O, red; 

P, yellow; H, white; and C, black. (See Movie 5.1.) 
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Figure 5-6 Linear messages come in 
many forms. The languages shown are 
(A) English, (B) a musical score, (C) Morse 
code, (D) Japanese, and (E) DNA. 


Figure 5-7 Most genes contain 
information to make proteins. As we 
discuss in Chapter 7, protein-coding genes 
each produce a set of RNA molecules, 
which then direct the production of a 
specific protein molecule. Note that for a 
minority of genes, the final product is the 
RNA molecule itself, as shown here for 
gene C. In these cases, gene expression is 
complete once the nucleotide sequence 
of the DNA has been transcribed into the 
nucleotide sequence of its RNA. 


easier to define. As discussed in Chapter 4, the function of a protein is 
determined by its three-dimensional structure, which in turn is deter- 
mined by the sequence of the amino acids in its polypeptide chain. The 
linear sequence of nucleotides in a gene, therefore, must somehow spell 
out the linear sequence of amino acids in a protein. 


The exact correspondence between the 4-letter nucleotide alphabet of 
DNA and the 20-letter amino acid alphabet of proteins—the genetic 
code—is not at all obvious from the structure of the DNA molecule. It 
took more than a decade of clever experiments after the discovery of 
the double helix to work this code out. In Chapter 7, we describe the 
genetic code in detail when we discuss gene expression—the process by 
which the nucleotide sequence of a gene is transcribed into the nucleotide 
sequence of an RNA molecule—and then, in most cases, translated into 
the amino acid sequence of a protein (Figure 5-7). 


The amount of information in an organism's DNA is staggering: writ- 
ten out in the four-letter nucleotide alphabet, the nucleotide sequence of 
a very small protein-coding gene from humans occupies a quarter of a 
page of text, while the complete human DNA sequence would fill more 
than 1000 books the size of this one. Herein lies a problem that affects the 
architecture of all eukaryotic chromosomes: How can all this information 
be packed neatly into the cell nucleus? In the remainder of this chapter, 
we discuss the answer to this question. 


THE STRUCTURE OF EUKARYOTIC 
CHROMOSOMES 


Large amounts of DNA are required to encode all the information needed 
to make a single-celled bacterium, and far more DNA is needed to encode 
the information to make a multicellular organism like you. Each human 
cell contains about 2 meters (m) of DNA; yet the cell nucleus is only 5-8 
um in diameter. Tucking all this material into such a small space is the 
equivalent of trying to fold 40 km (24 miles) of extremely fine thread into 
a tennis ball. 


In eukaryotic cells, very long, double-stranded DNA molecules are pack- 
aged into chromosomes. These chromosomes not only fit handily inside 
the nucleus, but, after they are duplicated, they can be accurately appor- 
tioned between the two daughter cells at each cell division. The complex 
task of packaging DNA is accomplished by specialized proteins that bind 
to and fold the DNA, generating a series of coils and loops that provide 
increasingly higher levels of organization and prevent the DNA from 
becoming a tangled, unmanageable mess. Amazingly, this DNA is folded 
in a way that allows it to remain accessible to all of the enzymes and 
other proteins that replicate and repair it, and that cause the expression 
of its genes. 
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The Structure of Eukaryotic Chromosomes 


Bacteria typically carry their genes on a single, circular DNA molecule. 
This molecule is also associated with proteins that condense the DNA, 
but these bacterial proteins differ from the ones that package eukaryotic 
DNA. Although this prokaryotic DNA is called a bacterial “chromosome,” 
it does not have the same structure as eukaryotic chromosomes, and 
less is known about how it is packaged. Our discussion of chromo- 
some structure in this chapter will therefore focus entirely on eukaryotic 
chromosomes. 


Eukaryotic DNA Is Packaged into Multiple 
Chromosomes 


In eukaryotes, such as ourselves, nuclear DNA is distributed among a set 
of different chromosomes. The DNA in a human nucleus, for example, is 
parceled out into 23 or 24 different types of chromosome, depending on 
an individual's sex (males, with their Y chromosome, have an extra type 
of chromosome that females do not). Each of these chromosomes con- 
sists of a single, enormously long, linear DNA molecule associated with 
proteins that fold and pack the fine thread of DNA into a more compact 
structure. This complex of DNA and protein is called chromatin. In addi- 
tion to the proteins involved in packaging the DNA, chromosomes also 
associate with many other proteins involved in DNA replication, DNA 
repair, and gene expression. 


With the exception of the gametes (sperm and eggs) and highly special- 
ized cells that lack DNA entirely (such as mature red blood cells), human 
cells each contain two copies of every chromosome, one inherited from 
the mother and one from the father. The maternal and paternal versions 
of each chromosome are called homologous chromosomes (homologs). 
The only nonhomologous chromosome pairs in humans are the sex chro- 
mosomes in males, where a Y chromosome is inherited from the father 
and an X chromosome from the mother. (Females inherit one X chromo- 
some from each parent and have no Y chromosome.) Each full set of 
human chromosomes contains a total of approximately 3.2 x 10? nucleo- 
tide pairs of DNA—which together comprise the human genome. 


In addition to being different sizes, the different human chromosomes 
can be distinguished from one another by a variety of techniques. Each 
chromosome can be “painted” a different color using sets of chromo- 
some-specific DNA molecules coupled to different fluorescent dyes 
(Figure 5-8A). An earlier and more traditional way of distinguishing one 
chromosome from another involves staining the chromosomes with dyes 
that bind to certain types of DNA sequences. These dyes mainly distin- 
guish between DNA that is rich in A-T nucleotide pairs and DNA that is 
G-C rich, and they produce a predictable pattern of bands along each type 
of chromosome. The resulting patterns allow each chromosome to be 
identified and numbered. 


(A) 


Figure 5-8 Each human chromosome 

can be “painted” a different color to 
allow its unambiguous identification. The 
chromosomes shown here were isolated 
from a cell undergoing nuclear division 
(mitosis) and are therefore in a highly 
compact (condensed) state. Chromosome 
painting is carried out by exposing the 
chromosomes to a collection of single- 
stranded DNA molecules that have been 
coupled to a combination of fluorescent 
dyes. For example, single-stranded 

DNA molecules that match sequences 

in chromosome 1 are labeled with one 
specific dye combination, those that match 
sequences in chromosome 2 with another, 
and so on. Because the labeled DNA can 
form base pairs (hybridize) only with its 
specific chromosome (discussed in Chapter 
10), each chromosome is differently colored. 
For such experiments, the chromosomes 
are treated so that the individual strands 
of its double-helical DNA partly separate 
to enable base-pairing with the labeled, 
single-stranded DNA. 

(A) Micrograph showing the array of 
chromosomes as they originally spilled from 
the lysed cell. (B) The same chromosomes 
artificially lined up in their numerical order. 
This arrangement of the full chromosome 
set is called a karyotype. (Adapted from 
N. McNeil and T. Ried, Expert Rev. Mol. 
Med. 2:1-14, 2000. With permission from 
Cambridge University Press.) 
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Figure 5-9 Abnormal chromosomes are associated with some 
inherited genetic disorders. (A) Two normal human chromosomes, 
chromosome 6 and chromosome 4, have been subjected to 
chromosome painting as described in Figure 5-8. (B) In an individual 
with a reciprocal chromosomal translocation, a segment of one 
chromosome has been swapped with a segment from the other. 
Such chromosomal translocations are a frequent event in cancer 
cells. (Courtesy of Zhenya Tang and the NIGMS Human Genetic Cell 
Repository at the Coriell Institute for Medical Research.) 


An ordered display of the full set of 46 human chromosomes is called 
the human karyotype (Figure 5-8B). If parts of a chromosome are lost, 
or switched between chromosomes, these changes can be detected. 
Cytogeneticists analyze karyotypes to detect chromosomal abnormalities 
that are associated with some inherited disorders (Figure 5-9) and with 
certain types of cancer (as we see in Chapter 20). 


Chromosomes Organize and Carry Genetic Information 


The most important function of chromosomes is to carry genes—the 
functional units of heredity. A gene is often defined as a segment of DNA 
that contains the instructions for making a particular protein or RNA mol- 
ecule. Most of the RNA molecules encoded by genes are subsequently 
used to produce a protein. In some cases, however, the RNA molecule 
is the final product (see Figure 5-7). Like proteins, these RNA molecules 
have diverse functions in the cell, including structural, catalytic, and gene 
regulatory roles, as we discuss in later chapters. 


Together, the total genetic information carried by a complete set of the 
chromosomes present in a cell or organism constitutes its genome. 
Complete genome sequences have been determined for thousands of 
organisms, from E. coli to humans. As might be expected, some correla- 
tion exists between the complexity of an organism and the number of 
genes in its genome. For example, the total number of genes is about 
500 for the simplest bacterium and about 24,000 for humans. Bacteria 
and some single-celled eukaryotes, including the budding yeast S. cerevi- 
siae, have especially compact genomes: the DNA molecules that make up 
their chromosomes are little more than strings of closely packed genes 
(Figure 5-10). However, chromosomes from many eukaryotes—includ- 
ing humans—contain, in addition to genes and the specific nucleotide 
sequences required for normal gene expression, a large excess of inter- 
spersed DNA (Figure 5-11). This extra DNA is sometimes erroneously 
called “junk DNA,” because its usefulness to the cell has not yet been dem- 
onstrated. Although this spare DNA does not code for protein, much of it 
may serve some other biological function. Comparisons of the genome 
sequences from many different species reveal that small portions of this 
extra DNA are highly conserved among related species, suggesting their 
importance for these organisms. 
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Figure 5-10 In yeast, genes are closely packed along chromosomes. This figure shows a small region of the DNA double helix in 
one chromosome from the budding yeast S. cerevisiae. The S. cerevisiae genome contains about 12.5 million nucleotide pairs and 6600 
genes—spread across 16 chromosomes. Note that, for each gene, only one of the two DNA strands actually encodes the information 
to make an RNA molecule. This coding region can fall on either strand, as indicated by the light red bars. However, each “gene” is 
considered to include both the “coding strand” and its complement. The high density of genes is characteristic of S. cerevisiae. 
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Figure 5-11 In many eukaryotes, genes include an excess of 
interspersed, noncoding DNA. Presented here is the nucleotide 
sequence of the human B-globin gene. This gene carries the 
information that specifies the amino acid sequence of one of the two 
types of subunits found in hemoglobin, a protein that carries oxygen 
in the blood. Only the sequence of the coding strand is shown here; 
the noncoding strand of the double helix carries the complementary 
sequence. Starting from its 5’ end, such a sequence is read from left 
to right, like any piece of English text. The segments of the DNA 
sequence that encode the amino acid sequence of B-globin are 
highlighted in yellow. We will see in Chapter 7 how this information is 
transcribed and translated to produce a full-length B-globin protein. 


In general, the more complex an organism, the larger is its genome. 
But this relationship does not always hold true. The human genome, for 
example, is 200 times larger than that of the yeast S. cerevisiae, but 30 
times smaller than that of some plants and at least 60 times smaller than 
some species of amoeba (see Figure 1-41). Furthermore, how the DNA is 
apportioned over chromosomes also differs from one species to another. 
Humans have a total of 46 chromosomes (including both maternal and 
paternal sets), but a species of small deer has only 7, while some carp 
species have more than 100. Even closely related species with similar 
genome sizes can have very different chromosome numbers and sizes 
(Figure 5-12). Thus, although gene number is roughly correlated with 
species complexity, there is no simple relationship between gene num- 
ber, chromosome number, and total genome size. The genomes and 
chromosomes of modern species have each been shaped by a unique 
history of seemingly random genetic events, acted on by specific selec- 
tion pressures, as we discuss in Chapter 9. 


Specialized DNA Sequences Are Required for DNA 
Replication and Chromosome Segregation 


To form a functional chromosome, a DNA molecule must do more than 
simply carry genes: it must be able to be replicated, and the replicated 
copies must be separated and partitioned equally and reliably into the 
two daughter cells at each cell division. These processes occur through 
an ordered series of events, known collectively as the cell cycle. This 
cycle of cell growth and division is summarized—very briefly—in Figure 
5-13 and will be discussed in detail in Chapter 18. Only two broad stages 
of the cell cycle need concern us in this chapter: interphase, when chro- 
mosomes are duplicated, and mitosis, the much more brief stage when 
the duplicated chromosomes are distributed, or segregated, to the two 
daughter nuclei. 


During interphase, chromosomes are extended as long, thin, tangled 
threads of DNA in the nucleus and cannot be easily distinguished in 
the light microscope (see Figure 5-1). We refer to chromosomes in this 
extended state as interphase chromosomes. It is during interphase that 
DNA replication takes place. As we discuss in Chapter 6, two specialized 
DNA sequences, found in all eukaryotes, ensure that this process occurs 
efficiently. One type of nucleotide sequence, called a replication origin, 
is where DNA replication begins; eukaryotic chromosomes contain many 
replication origins to allow the long DNA molecules to be replicated rap- 
idly (Figure 5-14). Another DNA sequence forms the telomeres that mark 
the ends of each chromosome. Telomeres contain repeated nucleotide 
sequences that are required for the ends of chromosomes to be fully rep- 
licated. They also serve as a protective cap that keeps the chromosome 
tips from being mistaken by the cell as broken DNA in need of repair. 


CCCTGTGGAGCCACACCCTAGGGTTGGCCA 
ATCTACTCCCAGGAGCAGGGAGGGCAGGAG 
CCAGGGCTGGGCATAAAAGTCAGGGCAGAG 
CCATCTATTGCTTACATTTGCTTCTGACAC 
AACTGTGTTCACTAGCAACTCAAACAGACA 


T TGGTATCAAGGTTACAAGACAGGT 
TTAAGGAGACCAATAGAAACTGGGCATGTG 
GAGACAGAGAAGACTCTTGGGTTTCTGATA 
GGCACTGACTCTCTCTGCCTATTGGTCTAT 
TTTCCCACCCTTAG! 


AGTCTATGGGACCCTTGATGTTTTCTTTCC 
CCTTCTTTTCTATGGTTAAGTTCATGTCAT 
AGGAAGGGGAGAAGTAACAGGGTACAGTTT 
AGAATGGGAAACAGACGAATGATTGCATCA 
GTGTGGAAGTCTCAGGATCGTTTTAGTTTC 
TTTTATTTGCTGTITCATAACAATTGTTTTC 
TITTGTTTAATTCTIGCTTTCTTTTTTTTT 
CTTCTCCGCAATTTTTACTATTATACTTAA 
TGCCTTAACATTGTGTATAACAAAAGGAAA 
TATCTCTGAGATACATTAAGTAACTTAAAA 
AAAAACTTTACACAGTCTGCCTAGTACATT 
ACTATTTGGAATATATGTGTGCTTATTTGC 
ATATTCATAATCTCCCTACTTTATTTTCTT 
TTATTTTTAATTGATACATAATCATTATAC 
ATATTTATGGGTTAAAGTGTAATGTTTTAA 
TATGTGTACACATATTGACCAAATCAGGGT 
AATTTTGCATTTGTAATTTTAAAAAATGCT 
TICTTCTTTTAATATACTTTTTTGTTTATC 
TTATTTCTAATACTTTCCCTAATCTCTTTC 
TTTCAGGGCAATAATGATACAATGTATCAT 
GCCTCTTTGCACCATTCTAAAGAATAACAG 
TGATAATTTCTGGGTTAAGGCAATAGCAAT 
ATTTCTGCATATAAATATTTCTGCATATAA 
ATTGTAACTGATGTAAGAGGTTTCATATTG 
CTAATAGCAGCTACAATCCAGCTACCATTC 
TGCTTTTATTTTATGGTTGGGATAAGGCTG 
GATTATTCTGAGTCCAAGCTAGGCCCTTTT 
GCTAATCATGTTCATACCTCTTATCTTCCT 


GCTCGCTTTCTTGC 
TGTCCAATTTCTATTAAAGGT TCCTTTIGTT 
CCCTAAGTCCAACTACTAAACTGGGGGATA 
TTATGAAGGGCCTTGAGCATCTGGATTCTG 
CCTAATAAAAAACATTTATTTTCATTGCAA 
TGATGTATTTAAATTATTICTGAATATTTT 
ACTAAAAAGGGAATGTGGGAGGTCAGTGCA 
TTTAAAACATAAAGAAATGATGAGCTGTTC 
AAACCTTGGGAAAATACACTATATCTTAAA 
CTCCATGAAAGAAGGT GAGGCTGCAACCAG 
CTAATGCACATTGGCAACAGCCCCTGATGC 
CTATGCCTTATTCATCCCTCAGAAAAGGAT 
TCTTGTAGAGGCTTGATTTGCAGGTTAAAG 
TTTTGCTATGCTGTATTTTACATTACTTAT 
TGTTTTAGCTGTCCTCATGAATGTCTTTTC 
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Figure 5-12 Two closely related species can have similar genome sizes but very different chromosome 
numbers. In the evolution of the Indian muntjac deer, chromosomes that were initially separate, and that remain 
separate in the Chinese species, fused without having a major effect on the number of genes—or the animal. 
(Image left, courtesy of Deborah Carreno, Natural Wonders Photography; image right, courtesy of Beatrice Bourgery.) 
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Eukaryotic chromosomes also contain a third type of specialized DNA 
sequence, called the centromere, that allows duplicated chromosomes 
to be separated during M phase (see Figure 5-14). During this stage of 
the cell cycle, the DNA coils up, adopting a more and more compact 
structure, ultimately forming highly compacted, or condensed, mitotic 
chromosomes (Figure 5-15). This is the state in which the duplicated 
chromosomes can be most easily visualized (see Figure 5-1). Once the 
chromosomes have condensed, the centromere allows the mitotic spin- 
dle to attach to each duplicated chromosome in a way that directs one 
copy of each chromosome to be segregated to each of the two daughter 
cells (see Figure 5-13). We describe the central role that centromeres play 
in cell division in Chapter 18. 


Interphase Chromosomes Are Not Randomly 
Distributed Within the Nucleus 


Interphase chromosomes are much longer and finer than mitotic chro- 
mosomes. They are nevertheless organized within the nucleus in several 
ways. First, although interphase chromosomes are constantly undergo- 
ing dynamic rearrangements, each tends to occupy a particular region, 
or territory, of the interphase nucleus (Figure 5-16). This loose organi- 
zation prevents interphase chromosomes from becoming extensively 
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Figure 5-13 The duplication and segregation of chromosomes occurs through an ordered cell cycle in proliferating cells. During 
interphase, the cell expresses many of its genes, and—during part of this phase—it duplicates its chromosomes. Once chromosome 
duplication is complete, the cell can enter M phase, during which nuclear division, or mitosis, occurs. In mitosis, the duplicated 
chromosomes condense, gene expression largely ceases, the nuclear envelope breaks down, and the mitotic spindle forms from 
microtubules and other proteins. The condensed chromosomes are then captured by the mitotic spindle, one complete set is pulled 
to each end of the cell, and a nuclear envelope forms around each chromosome set. In the final step of M phase, the cell divides to 
produce two daughter cells. Only two different chromosomes are shown here for simplicity. 
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Figure 5-14 Three DNA sequence elements are needed to produce a eukaryotic 
chromosome that can be duplicated and then segregated at mitosis. Each 
chromosome has multiple origins of replication, one centromere, and two telomeres. 
The sequence of events that a typical chromosome follows during the cell cycle is 
shown schematically. The DNA replicates in interphase, beginning at the origins of 
replication and proceeding bidirectionally from each origin along the chromosome. 
In M phase, the centromere attaches the compact, duplicated chromosomes to the 
mitotic spindle so that one copy will be distributed to each daughter cell when the 
cell divides. Prior to cell division, the centromere also helps to hold the duplicated 
chromosomes together until they are ready to be pulled apart. Telomeres contain 
DNA sequences that allow for the complete replication of chromosome ends. 


entangled, like spaghetti in a bowl. In addition, some chromosomal 
regions are physically attached to particular sites on the nuclear enve- 
lope—the pair of concentric membranes that surround the nucleus—or 
to the underlying nuclear lamina, the protein meshwork that supports the 
envelope (discussed in Chapter 17). These attachments also help inter- 
phase chromosomes remain within their distinct territories. 


The most obvious example of chromosomal organization in the inter- 
phase nucleus is the nucleolus—a structure large enough to be seen in 
the light microscope (Figure 5-17A). During interphase, the parts of dif- 
ferent chromosomes that carry genes encoding ribosomal RNAs come 
together to form the nucleolus. In human cells, several hundred copies 
of these genes are distributed in 10 clusters, located near the tips of five 
different chromosome pairs (Figure 5-17B). In the nucleolus, ribosomal 
RNAs are synthesized and combine with proteins to form ribosomes, the 
cell's protein-synthesizing machines. As we discuss in Chapter 7, riboso- 
mal RNAs play both structural and catalytic roles in the ribosome. 


The DNA in Chromosomes Is Always Highly Condensed 


As we have seen, all eukaryotic cells, whether in interphase or mitosis, 
package their DNA tightly into chromosomes. Human chromosome 22, 
for example, contains about 48 million nucleotide pairs; stretched out 
end-to-end, its DNA would extend about 1.5 cm. Yet, during mitosis, chro- 
mosome 22 measures only about 2 um in length—that is, nearly 10,000 
times more compact than the DNA would be if it were extended to its 
full length. This remarkable feat of compression is performed by proteins 
that coil and fold the DNA into higher and higher levels of organization. 
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Figure 5-15 A typical duplicated mitotic 
chromosome is highly compact. Because 
DNA is replicated during interphase, 

each mitotic chromosome contains two 
identical duplicated DNA molecules 

(see Figure 5-14). Each of these very 

long DNA molecules, with its associated 
proteins, is called a chromatid; as soon as 
the two sister chromatids separate, they 
are considered individual chromosomes. 
(A) A scanning electron micrograph of a 
mitotic chromosome. The two chromatids 
are tightly joined together. The constricted 
region reveals the position of the 
centromere. (B) A cartoon representation 
of a mitotic chromosome. (A, courtesy of 
Terry D. Allen.) 
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Figure 5-16 Interphase chromosomes 
occupy their own distinct territories 
within the nucleus. DNA probes coupled 
with different fluorescent markers are used 
to paint individual interphase chromosomes 
in a human cell. (A) Viewed in a fluorescence 
microscope, the nucleus is seen to be 

filled with a patchwork of discrete colors. 

(B) To highlight their distinct locations, 
three sets of chromosomes are singled 

out: chromosomes 3, 5, and 11. Note that 
pairs of homologous chromosomes, such 

as the two copies of chromosome 3, are 

not generally located in the same position. 
(Adapted from M.R. Hubner and 

D.L. Spector, Annu. Rev. Biophys. 
39:471-489, 2010.) 


Figure 5-17 The nucleolus is the most 
prominent structure in the interphase 
nucleus. (A) Electron micrograph of a thin 
section through the nucleus of a human 
fibroblast. The nucleus is surrounded by the 
nuclear envelope. Inside the nucleus, the 
chromatin appears as a diffuse speckled 
mass; regions that are especially dense 

are called heterochromatin (dark staining). 
Heterochromatin contains few genes and 

is located mainly around the periphery of 
the nucleus, immediately under the nuclear 
envelope. The large, dark region within the 
nucleus is the nucleolus, which contains the 
genes for ribosomal RNAs. (B) Schematic 
illustration showing how ribosomal RNA 
genes, which are clustered near the tips 

of five different human chromosomes 

(13, 14, 15, 21, and 22), come together 

to form the nucleolus, which is 

a biochemical subcompartment 

produced by the aggregation of a set of 
macromolecules—DNA, RNAs, and proteins 
(see Figure 4-54). (A, courtesy of 

E.G. Jordan and J. McGovern.) 
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Although the DNA of interphase chromosomes is packed tightly into the 
nucleus, it is about 20 times less condensed than that of mitotic chromo- 
somes (Figure 5-18). 


In the next sections, we introduce the specialized proteins that make this 
compression possible. Bear in mind, though, that chromosome structure 
is dynamic. Not only do chromosomes condense and decondense dur- 
ing the cell cycle, but chromosome packaging must be flexible enough 
to allow rapid, on-demand access to different regions of the interphase 
chromosome, unpacking enough to allow protein complexes access to 
specific, localized nucleotide sequences for DNA replication, DNA repair, 
or gene expression. 


Nucleosomes Are the Basic Units of Eukaryotic 
Chromosome Structure 


The proteins that bind to DNA to form eukaryotic chromosomes are tradi- 
tionally divided into two general classes: the histones and the nonhistone 
chromosomal proteins. Histones are present in enormous quantities 
(more than 60 million molecules of several different types in each human 
cell), and their total mass in chromosomes is about equal to that of the 
DNA itself. Nonhistone chromosomal proteins are also present in large 
numbers; they include hundreds of different chromatin-associated pro- 
teins. In contrast, only a handful of different histone proteins are present 
in eukaryotic cells. The complex of both classes of protein with nuclear 
DNA is called chromatin. 


Histones are responsible for the first and most fundamental level of chro- 
matin packing: the formation of the nucleosome. Nucleosomes convert 
the DNA molecules in an interphase nucleus into a chromatin fiber that 
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Figure 5-18 DNA in interphase chromosomes is less compact 
than in mitotic chromosomes. (A) An electron micrograph showing 
an enormous tangle of chromatin (DNA with its associated proteins) 
spilling out of a lysed interphase nucleus. (B) For comparison, a 
compact, human mitotic chromosome is shown at the same scale. 
(A, courtesy of Victoria Foe; B, courtesy of Terry D. Allen.) 


is approximately one-third the length of the initial DNA. These chroma- 
tin fibers, when examined with an electron microscope, contain clusters 
of closely packed nucleosomes (Figure 5-19A). If this chromatin is then 
subjected to treatments that cause it to unfold partially, it can then be 
seen in the electron microscope as a series of “beads on a string” (Figure 
5-19B). The string is DNA, and each bead is a nucleosome core particle, 
which consists of DNA wound around a core of histone proteins. 


To determine the structure of the nucleosome core particle, investigators 


treated chromatin in its unfolded, “beads-on-a-string” form with enzymes (A) interphase 
called nucleases, which cut the DNA by breaking the phosphodiester chromatin 
bonds between nucleotides. When this nuclease digestion is carried out 


for a short time, only the exposed DNA between the core particles—the 
linker DNA—will be cleaved, allowing the core particles to be isolated. 
An individual nucleosome core particle consists of a complex of eight 
histone proteins—two molecules each of histones H2A, H2B, H3, and 
H4—along with a segment of double-stranded DNA, 147 nucleotide pairs 
long, that winds around this histone octamer (Figure 5-20). The high- (8) 
resolution structure of the nucleosome core particle was solved in 1997, 
revealing in atomic detail the disc-shaped histone octamer around which 
the DNA is tightly wrapped, making 1.7 turns in a left-handed coil (Figure 
5-21). The linker DNA between each nucleosome core particle can vary 
in length from a few nucleotide pairs up to about 80. Technically speak- 
ing, a “nucleosome” consists of a nucleosome core particle plus one of 
its adjacent DNA linkers, as shown in Figure 5-20; however, the term is 
often used to refer to the nucleosome core particle itself. 
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Figure 5-19 Nucleosomes can be seen in the electron microscope. (A) Chromatin 
isolated directly from an interphase nucleus can appear in the electron microscope 
as a chromatin fiber, composed of packed nucleosomes. (B) Another electron 
micrograph shows a length of a chromatin fiber that has been experimentally 
unpacked, or decondensed, after isolation to show the “beads-on-a-string” 
appearance of the nucleosomes. (A, courtesy of Barbara Hamkalo; B, courtesy 

of Victoria Foe.) 
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core histones Figure 5-20 Nucleosomes contain DNA wrapped around a protein 
ie of nucleosome core of eight histone molecules. In a test tube, the nucleosome core 
particle can be released from chromatin by digestion of the linker 
š $ DNA with a nuclease, which cleaves the exposed linker DNA but not 
wo o o i the DNA wound tightly around the nucleosome core. When the DNA 
“beads-on-a-string” a aN ora around each isolated nucleosome core particle is released, its length is 
form of chromatin ~ paire of DRA $ found to be 147 nucleotide pairs; this DNA wraps around the histone 
octamer that forms the nucleosome core nearly twice. 
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All four of the histones that make up the octamer are relatively small 
ca 2 "aa 2 "ans 2 s proteins with a high proportion of positively charged amino acids (lysine 
and arginine). The positive charges help the histones bind tightly to the 

| negatively charged sugar—-phosphate backbone of DNA. These numer- 

_ ous electrostatic interactions explain in part why DNA of virtually any 

released sequence can bind to a histone octamer. Each of the histones in the 
Lesa j ie octamer also has a long, unstructured N-terminal amino acid “tail” that 
7 extends out from the nucleosome core particle (see the H3 tail in Figure 


DISSOCIATION 5-21). These histone tails are subject to several types of reversible, cova- 


WITH HIGH lent chemical modifications that control many aspects of chromatin 
eee structure. 

The histones that form the nucleosome core are among the most highly 
ofS conserved of all known eukaryotic proteins: there are only two differ- 
= ences between the amino acid sequences of histone H4 from peas and 
aro Le Wien nal cows, for example. This extreme evolutionary conservation reflects the 

n vital role of histones in controlling eukaryotic chromosome structure. 

yN DISSOCIATION Chromosome Packing Occurs on Multiple Levels 

Although long strings of nucleosomes form on most chromosomal DNA, 
chromatin in the living cell rarely adopts the extended beads-on-a-string 
D 2 FA 2 Ce A e form seen in Figure 5-19B. Instead, the nucleosomes are further packed 
H2A H2B H3 H4 on top of one another to generate a more compact structure, such as 


the chromatin fiber shown in Figure 5-19A and Movie 5.2. This addi- 
tional packing of nucleosomes into a chromatin fiber depends on a fifth 


an H3 histone tail 


viewed 
face-on 


viewed 
from the 


edge 


DNA double helix 


(=) histone H2A Q histone H2B (=) histone H3 (>) histone H4 


Figure 5-21 The structure of the nucleosome core particle, as determined by x-ray diffraction analysis, reveals how DNA is 
tightly wrapped around a disc-shaped histone octamer. Two views of a nucleosome core particle are shown here. The two strands 
of the DNA double helix are shown in gray. A portion of an H3 histone tail (green) can be seen extending from the nucleosome core 
particle, but the tails of the other histones have been truncated. (From K. Luger et al., Nature 389:251-260, 1997.) 
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histone called histone H1, which is thought to pull adjacent nucleosomes 
together into a regular repeating array. This “linker” histone changes the 
path the DNA takes as it exits the nucleosome core, allowing it to form a 
more condensed chromatin fiber. 


We saw earlier that, during mitosis, chromatin becomes so highly con- 
densed that individual chromosomes can be seen in the light microscope. 
How is a chromatin fiber folded to produce mitotic chromosomes? 
Although the answer is not yet known in detail, it is known that special- 
ized nonhistone chromosomal proteins fold the chromatin into a series 
of loops (Figure 5-22). These loops are further condensed to produce the 
interphase chromosome. Finally, this compact string of loops is thought 
to undergo at least one more level of packing to form the mitotic chromo- 
some (Figure 5-23). 
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NET RESULT: EACH DNA MOLECULE HAS BEEN 
PACKAGED INTO A MITOTIC CHROMOSOME THAT 
IS 10,000-FOLD SHORTER THAN ITS FULLY 
EXTENDED LENGTH 


Figure 5-22 The chromatin in human 
chromosomes is folded into looped 
domains. These loops are established by 
special nonhistone chromosomal proteins 
that bind to specific DNA sequences, 
creating a clamp at the base of each loop. 


QUESTION 5-2 


Assuming that the histone 

octamer (shown in Figure 5-20) 
forms a cylinder 9 nm in diameter 
and 5 nm in height and that the 
human genome forms 32 million 
nucleosomes, what volume of 

the nucleus (6 um in diameter) is 
occupied by histone octamers? 
(Volume of a cylinder is nr?h; volume 
of a sphere is 4/3 ar3.) What fraction 
of the total volume of the nucleus 
do the histone octamers occupy? 
How does this compare with the 
volume of the nucleus occupied by 
human DNA? 


Figure 5-23 DNA packing occurs on 
several levels in chromosomes. This 
schematic drawing shows some of the levels 
thought to give rise to the highly condensed 
mitotic chromosome. Both histone H1 anda 
set of specialized nonhistone chromosomal 
proteins are known to help drive these 
condensations, including the chromosome 
loop-forming clamp proteins and the 
abundant non-histone protein condensin 
(see Figure 18-18). However, the actual 
structures are still uncertain. 
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So far, we have discussed how DNA is packed tightly into chromatin. We 
now turn to the question of how this packaging can be adjusted to allow 
rapid access to the underlying DNA. The DNA in cells carries enormous 
amounts of coded information, and cells must be able to retrieve this 
information as needed. 


In this section, we discuss how a cell can alter its chromatin structure to 
expose localized regions of DNA and allow access to specific proteins and 
protein complexes, particularly those involved in gene expression and in 
DNA replication and repair. We then discuss how chromatin structure is 
established and maintained—and how a cell can pass on some forms of 
this structure to its descendants, helping different cell types to sustain 
their identity. Although many of the details remain to be deciphered, the 
regulation and inheritance of chromatin structure play crucial roles in the 
development of eukaryotic organisms. 


Changes in Nucleosome Structure Allow Access to DNA 


Eukaryotic cells have several ways to adjust rapidly the local structure 
of their chromatin. One way takes advantage of a set of ATP-dependent 
chromatin-remodeling complexes. These protein machines use the 
energy of ATP hydrolysis to change the position of the DNA wrapped 
around nucleosomes (Figure 5-24). By interacting with both the histone 
octamer and the DNA wrapped around it, chromatin-remodeling com- 
plexes can locally alter the arrangement of the nucleosomes, rendering 
the DNA more accessible (or less accessible) to other proteins in the cell. 
During mitosis, many of these complexes are inactivated, which may 
help mitotic chromosomes maintain their tightly packed structure. 


Another way of altering chromatin structure relies on the reversible 
chemical modification of histones, catalyzed by a large number of dif- 
ferent histone-modifying enzymes. The tails of all four of the core 
histones are particularly subject to these covalent modifications, which 
include the addition (and removal) of acetyl, phosphate, or methyl groups 


10 nm 


(B) 


Figure 5-24 Chromatin-remodeling complexes locally reposition the DNA wrapped around nucleosomes. (A) The complexes use 
energy derived from ATP hydrolysis to loosen the nucleosomal DNA and push it along the histone octamer. In this way, the enzyme 
can expose or hide a sequence of DNA, controlling its availability to other DNA-binding proteins. The blue stripes have been added 
to show how the DNA shifts its position. Many cycles of ATP hydrolysis are required to produce such a shift. (B) The structure of a 
chromatin-remodeling complex, showing how the enzyme cradles a nucleosome core particle, including a histone octamer (orange) 
and the DNA wrapped around it (light green). This large complex, purified from yeast, contains 15 subunits, including one that 
hydrolyzes ATP and four that recognize specific covalently modified histones. (B, adapted from A.E. Leschziner et al., Proc. Natl. Acad. 
Sci. USA 104:4913-4918, 2007.) 
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(Figure 5—25A). These and other modifications can have important con- 
sequences for the packing of the chromatin fiber. Acetylation of lysines, 
for instance, can reduce the affinity of the tails for adjacent nucleosomes, 
thereby loosening chromatin structure and allowing access to particular 
nuclear proteins. 


Most importantly, however, these modifications generally serve as dock- 
ing sites on the histone tails for a variety of regulatory proteins. Different 
patterns of modifications attract specific sets of non-histone chro- 
mosomal proteins to a particular stretch of chromatin. Some of these 
proteins promote chromatin condensation, whereas others promote 
chromatin expansion and thus facilitate access to the DNA. Specific 
combinations of tail modifications, and the proteins that bind to them, 
have different functional outcomes for the cell: one pattern, for example, 
might mark a particular stretch of chromatin as newly replicated; another 
might indicate that the genes in that stretch of chromatin are being 
actively expressed; still others are associated with genes that are silenced 
(Figure 5—25B). 


Both ATP-dependent chromatin-remodeling complexes and histone-mod- 
ifying enzymes are tightly regulated. These enzymes are often brought to 
particular chromatin regions by interactions with proteins that bind to a 
specific nucleotide sequence in the DNA—or in an RNA transcribed from 
this DNA (a topic we return to in Chapter 8). Histone-modifying enzymes 
work in concert with the chromatin-remodeling complexes to condense 
and relax stretches of chromatin, allowing local chromatin structure to 
change rapidly according to the needs of the cell. 


Interphase Chromosomes Contain both Highly 
Condensed and More Extended Forms of Chromatin 


The localized alteration of chromatin packing by remodeling complexes 
and histone modification has important effects on the large-scale struc- 
ture of interphase chromosomes. Interphase chromatin is not uniformly 
packed. Instead, regions of the chromosome containing genes that are 
being actively expressed are generally more extended, whereas those 
that contain silent genes are more condensed. Thus, the detailed struc- 
ture of an interphase chromosome can differ from one cell type to the 
next, helping to determine which genes are switched on and which are 
shut down. Most cell types express only about half of the genes they con- 
tain, and many of these are active only at very low levels. 


The most highly condensed form of interphase chromatin is called 
heterochromatin (from the Greek heteros, “different,” chromatin). This 
highly compact form of chromatin was first observed in the light micro- 
scope in the 1930s as discrete, strongly staining regions within the total 


Figure 5-25 The pattern of modification 

of histone tails can determine how a stretch 
of chromatin is handled by the cell. 

(A) Schematic drawing showing the positions 
of the histone tails that extend from each 
nucleosome core particle. Each histone can 
be modified by the covalent attachment of a 
number of different chemical groups, mainly 
to the tails. The tail of histone H3, for example, 
can receive acetyl groups (Ac), methyl groups 
(M), or phosphate groups (P). The numbers 
denote the positions of the modified amino 
acids in the histone tail, with each amino acid 
designated by its one-letter code. Note that 
some amino acids, such as the lysine (K) at 
positions 9, 14, 23, and 27, can be modified 
by acetylation or methylation (but not by 
both at once). Lysines, in addition, can be 
modified with either one, two, or three methyl 
groups; trimethylation, for example, is shown 
in (B). Note that histone H3 contains 135 
amino acids, most of which are in its globular 
portion (represented by the wedge); most 
modifications occur on the N-terminal tail, for 
which 36 amino acids are shown. (B) Different 
combinations of histone tail modifications can 
confer a specific meaning on the stretch of 
chromatin on which they occur, as indicated. 
Only a few of these functional outcomes are 
known. 


QUESTION 5-3 


Histone proteins are among the 
most highly conserved proteins in 
eukaryotes. Histone H4 proteins 
from a pea and a cow, for example, 
differ in only 2 of 102 amino acids. 
Comparison of the gene sequences 
shows many more differences, but 
only two change the amino acid 
sequence. These observations 
indicate that mutations that change 
amino acids must have been 
selected against during evolution. 
Why do you suppose that amino- 
acid-altering mutations in histone 
genes are deleterious? 
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Figure 5—26 The structure of chromatin 
varies along a single interphase 
chromosome. As schematically indicated by 
the path of the DNA molecule (represented 
by the central black line) and the different 
arbitrarily assigned colors, heterochromatin 
and euchromatin each represent a set 

of different chromatin structures with 
different degrees of condensation. Overall, 
heterochromatin is more condensed than 
euchromatin. 


Figure 5-27 Heterochromatin- 

specific histone modifications allow 
heterochromatin to form and to 
spread. These modifications attract 
heterochromatin-specific proteins that 
reproduce the same histone modifications 
on neighboring nucleosomes. In this 
manner, heterochromatin can spread until 
it encounters a barrier DNA sequence that 
blocks further propagation into regions of 
euchromatin. 


centromere 


telomere 


chromatin mass. Heterochromatin typically makes up about 10% of an 
interphase chromosome, and in mammalian chromosomes, it is concen- 
trated around the centromere region and in the telomeric DNA at the 
chromosome ends (see Figure 5-14). 


The rest of the interphase chromatin is called euchromatin (from the 
Greek eu, “true” or “normal,” chromatin). Although we use the term 
euchromatin to refer to chromatin that exists in a less condensed state 
than heterochromatin, it is now clear that both euchromatin and hetero- 
chromatin are composed of mixtures of different chromatin structures 
(Figure 5-26). 


Each type of chromatin structure is established and maintained by dif- 
ferent sets of histone tail modifications, which attract distinct sets of 
nonhistone chromosomal proteins. The modifications that direct the 
formation of the most common type of heterochromatin, for example, 
include the methylation of lysine 9 in the tail of histone H3 (see Figure 
5-25B). Once heterochromatin has been established, it can spread to 
neighboring regions of DNA, because its histone tail modifications attract 
a set of heterochromatin-specific proteins, including histone-modifying 
enzymes, which then add the same histone tail modifications on adjacent 
nucleosomes. These modifications in turn recruit more of the hetero- 
chromatin-specific proteins, causing a wave of condensed chromatin to 
propagate along the chromosome. This extended region of heterochro- 
matin will continue to spread until it encounters a barrier DNA sequence 
that stops the propagation (Figure 5-27). As an example, some barrier 
sequences contain binding sites for histone-modifying enzymes that add 
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an acetyl group to lysine 9 of the histone H3 tail; this modification blocks 
the methylation of that lysine, preventing any further spread of hetero- 
chromatin (see Figure 5—25B). 


Much of the DNA that is folded into heterochromatin does not contain 
genes. Because heterochromatin is so compact, genes that accidentally 
become packaged into heterochromatin usually fail to be expressed. Such 
inappropriate packaging of genes in heterochromatin can cause disease: 
in humans, the gene that encodes B-globin—a protein that forms part of 
the oxygen-carrying hemoglobin molecule—is situated near a region of 
heterochromatin. In an individual with an inherited deletion of its barrier 
DNA, that heterochromatin spreads and deactivates the B-globin gene, 
causing severe anemia. 


Perhaps the most striking example of the use of heterochromatin to 
keep genes shut down, or silenced, is found in the interphase X chro- 
mosomes of female mammals. In mammals, female cells contain two X 
chromosomes, whereas male cells contain one X and one Y. A double 
dose of X-chromosome products could be lethal, and female mammals 
have evolved a mechanism for permanently inactivating one of the two X 
chromosomes in each cell. At random, one or other of the two X chromo- 
somes in each nucleus becomes highly condensed into heterochromatin 
early in embryonic development. Thereafter, the condensed and inactive 
state of that X chromosome is inherited in all of the many descendants of 
those cells (Figure 5-28). This process of X-inactivation is responsible for 
the patchwork coloration of calico cats (Figure 5-29). 


X-inactivation is an extreme example of a process that takes place in all 
eukaryotic cells—one that operates on a much finer scale to help control 
gene expression. When a cell divides, it can pass along its histone modi- 
fications, chromatin structure, and gene expression patterns to the two 
daughter cells. Such “cell memory” transmits information about which 
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Figure 5-28 One of the two X chromosomes 
is inactivated in the cells of mammalian 
females by heterochromatin formation. 

(A) Each female cell contains two X 
chromosomes, one from the mother (Xm) and 
one from the father (X,). At an early stage in 
embryonic development, one of these two 
chromosomes becomes condensed into 
heterochromatin in each cell, apparently 

at random. At each cell division, the same 

X chromosome becomes condensed (and 
inactivated) in all the descendants of that 
original cell. Thus, all mammalian females end 
up as mixtures (mosaics) of cells bearing either 
inactivated maternal or inactivated paternal 

X chromosomes. In most of their tissues and 
organs, about half the cells will be of one 
type, and the rest will be of the other. (B) In 
the nucleus of a female cell, the inactivated X 
chromosome can be seen as a small, discrete 
mass of chromatin called a Barr body, named 
after the physician who first observed it. In 
these micrographs of the nuclei of human 
fibroblasts, the inactivated X chromosome in 
the female nucleus (bottom micrograph) has 
been visualized by use of an antibody that 
recognizes proteins associated with the Barr 
body. The male nucleus (top) contains only a 
single X chromosome, which is not inactivated 
and thus not recognized by this antibody. 
Below the micrographs, a cartoon shows the 
locations of both the active and the inactive 
X chromosomes in the female nucleus. 

(B, adapted from B. Hong et al. Proc. Natl 
Acad. Sci. USA 98:8703-8708, 2001.) 
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Figure 5-29 The coat color of a calico 

cat is dictated in large part by patterns 

of X-inactivation. In cats, one of the genes 
specifying coat color is located on the 

X chromosome. In female calicos, one 

X chromosome carries the form of the gene that 
specifies black fur, the other carries the form of 
the gene that specifies orange fur. Skin cells in 
which the X chromosome carrying the gene for 
black fur is inactivated will produce orange fur; 
those in which the X chromosome carrying the 
gene for orange fur is inactivated will produce 
black fur. The size of each patch will depend on 
the number of skin cells that have descended 
from an embryonic cell in which one or the 
other of the X chromosomes was randomly 
inactivated during development (see Figure 
5-28). (bluecaterpillar/Depositphotos.) 


QUESTION 5-4 


Mutations in a particular gene on 
the X chromosome result in color 
blindness in men. By contrast, most 
women carrying the mutation have 
proper color vision but see colored 
objects with reduced resolution, as 
though functional cone cells (the 
photoreceptor cells responsible for 
color vision) are spaced farther apart 
than normal in the retina. Can you 
give a plausible explanation for this 
observation? If a woman is color- 
blind, what could you say about her 
father? About her mother? Explain 
your answers. 


genes are active and which are not—a process critical for the establish- 
ment and maintenance of different cell types during the development of 
a complex multicellular organism. We discuss some of the mechanisms 
involved in cell memory in Chapter 8, when we consider how cells con- 
trol gene expression. 


ESSENTIAL CONCEPTS 


Life depends on the stable storage, maintenance, and inheritance of 
genetic information. 


Genetic information is carried by very long DNA molecules and is 
encoded in the linear sequence of four nucleotides: A, T, G, and C. 


Each molecule of DNA is a double helix composed of a pair of 
antiparallel, complementary DNA strands, which are held together 
by hydrogen bonds between G-C and A-T base pairs. 


The genetic material of a eukaryotic cell—its genome—is contained 
in a set of chromosomes, each formed from a single, enormously long 
DNA molecule that contains many genes. 


When a gene is expressed, part of its nucleotide sequence is tran- 
scribed into RNA molecules, most of which are translated to produce 
a protein. 


The DNA that forms each eukaryotic chromosome contains, in addi- 
tion to genes, many replication origins, one centromere, and two 
telomeres. These special DNA sequences ensure that, before cell 
division, each chromosome can be duplicated efficiently, and that 
the resulting daughter chromosomes can be parceled out equally to 
the two daughter cells. 


In eukaryotic chromosomes, the DNA is tightly folded by binding to a 
set of histone and nonhistone chromosomal proteins. This complex 
of DNA and protein is called chromatin. 


Histones pack the DNA into a repeating array of DNA-protein par- 
ticles called nucleosomes, which further fold up into even more 
compact chromatin structures. 


A cell can regulate its chromatin structure—temporarily decondens- 
ing or condensing particular regions of its chromosomes—using 
chromatin-remodeling complexes and enzymes that covalently mod- 
ify histone tails in various ways. 


The loosening of chromatin to a more decondensed state allows pro- 
teins involved in gene expression, DNA replication, and DNA repair to 
gain access to the necessary DNA sequences. 


Some forms of chromatin have a pattern of histone tail modification 
that causes the DNA to become so highly condensed that its genes 
cannot be expressed to produce RNA; a high degree of condensation 
occurs on all chromosomes during mitosis and in the heterochroma- 
tin of interphase chromosomes. 
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HOW WE KNOW 
GENES ARE MADE OF DNA 


By the 1920s, scientists generally agreed that genes 
reside on chromosomes. And studies in the late nine- 
teenth century had demonstrated that chromosomes are 
composed of both DNA and proteins. But because DNA 
is so chemically simple, biologists naturally assumed 
that genes had to be made of proteins, which are much 
more chemically diverse than DNA molecules. Even 
when the experimental evidence suggested otherwise, 
this assumption proved hard to shake. 
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Messages from the dead 


The case for DNA began to emerge in the late 1920s, 
when a British medical officer named Fred Griffith made 
an astonishing discovery. He was studying Streptococcus 
pneumoniae (pneumococcus), a bacterium that causes 
pneumonia. As antibiotics had not yet been discovered, 
infection with this organism was usually fatal. When 
grown in the laboratory, pneumococci come in two 
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Figure 5-30 Griffith showed that 
heat-killed infectious bacteria can 
transform harmless live bacteria 

into pathogens. The bacterium 
Streptococcus pneumoniae comes in 
two forms that differ in their microscopic 
appearance and in their ability to cause 
disease. Cells of the pathogenic strain, 
which are lethal when injected into 
mice, are encased in a slimy, glistening 
polysaccharide capsule. When grown 

on a plate of nutrients in the laboratory, 
this disease-causing bacterium forms 
colonies that look dome-shaped and 
smooth; hence it is designated the 

S form. The harmless strain of the 
pneumococcus, on the other hand, lacks 
this protective coat; it forms colonies 
that appear flat and rough—hence, it is 
referred to as the R form. As illustrated 
in this diagram, Griffith found that a 
substance present in the pathogenic 

S strain could permanently change, or 
transform, the nonlethal R strain into the 
deadly S strain. 
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living, pathogenic 
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forms: a pathogenic form that causes a lethal infection 
when injected into animals, and a harmless form that is 
easily conquered by the animal’s immune system and 
does not produce an infection. 


In the course of his investigations, Griffith injected vari- 
ous preparations of these bacteria into mice. He showed 
that pathogenic pneumococci that had been killed by 
heating were no longer able to cause infection. The 
surprise came when Griffith injected both heat-killed 
pathogenic bacteria and live harmless bacteria into 
the same mouse. This combination proved unexpect- 
edly lethal: not only did the animals die of pneumonia, 
but Griffith found that their blood was teeming with 
live bacteria of the pathogenic form (Figure 5-30). The 
heat-killed pneumococci had somehow converted the 
harmless bacteria into the lethal form. What’s more, 
Griffith found that the change was permanent: he could 
grow these “transformed” bacteria in culture, and they 
remained pathogenic. But what was this mysterious 
material that turned harmless bacteria into killers? And 
how was this change passed on to progeny bacteria? 


Transformation 


Griffith’s remarkable finding set the stage for the experi- 
ments that would provide the first strong evidence 
that genes are made of DNA. The American bacteri- 
ologist Oswald Avery, following up on Griffith’s work, 
discovered that the harmless pneumococcus could be 
transformed into a pathogenic strain in a test tube by 
exposing it to an extract prepared from the pathogenic 
strain. It would take another 15 years, however, for 
Avery and his colleagues Colin MacLeod and Maclyn 
McCarty to successfully purify the “transforming prin- 
ciple” from this soluble extract and to demonstrate that 
the active ingredient was DNA. Because the transform- 
ing principle caused a heritable change in the bacteria 
that received it, DNA must be the very stuff of which 
genes are made. 


The 15-year delay was in part a reflection of the aca- 
demic climate—and the widespread supposition that 
the genetic material was likely to be made of protein. 
Because of the potential ramifications of their work, the 
researchers wanted to be absolutely certain that the 
transforming principle was DNA before they announced 
their findings. As Avery noted in a letter to his brother, 
also a bacteriologist, “It’s lots of fun to blow bubbles, 
but it’s wiser to prick them yourself before someone else 
tries to.” So the researchers subjected the transforming 
material to a battery of chemical tests (Figure 5-31). 
They found that it exhibited all the chemical proper- 
ties characteristic of DNA; furthermore, they showed 
that enzymes that destroy proteins and RNA did not 
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Figure 5-31 Avery, MacLeod, and McCarty demonstrated 
that DNA is the genetic material. The researchers prepared 

an extract from the disease-causing S strain of pneumococci 
and showed that the “transforming principle” that would 
permanently change the harmless R-strain pneumococci into the 
pathogenic S strain is DNA. This was the first evidence that DNA 
could serve as the genetic material. 


affect the ability of the extract to transform bacteria, 
while enzymes that destroy DNA inactivated it. And like 
Griffith before them, the investigators found that their 
purified preparation changed the bacteria permanently: 
DNA from the pathogenic species was taken up by the 
harmless species, and this change was faithfully passed 
on to subsequent generations of bacteria. 


This landmark study offered rigorous proof that purified 
DNA can act as genetic material. But the resulting paper, 
published in 1944, drew strangely little attention. Despite 
the meticulous care with which these experiments were 
performed, geneticists were not immediately convinced 
that DNA is the hereditary material. Many argued that 
the transformation might have been caused by some 
trace protein contaminant in the preparations. Or that 
the extract might contain a mutagen that alters the 
genetic material of the harmless bacteria—converting 
them to the pathogenic form—rather than containing 
the genetic material itself. 


Virus cocktails 


The debate was not settled definitively until 1952, when 
Alfred Hershey and Martha Chase fired up their labora- 
tory blender and demonstrated, once and for all, that 
genes are made of DNA. The researchers were study- 
ing T2—a virus that infects and eventually destroys the 
bacterium E. coli. These bacteria-killing viruses behave 
like tiny molecular syringes: they inject their genetic 
material into the bacterial host cell, while the empty 
virus heads remain attached outside (Figure 5-32A). 
Once inside the bacterial cell, the viral genes direct the 
formation of new virus particles. In less than an hour, 
the infected cells explode, spewing thousands of new 
viruses into the medium. These then infect neighboring 
bacteria, and the process begins again. 


The beauty of T2 is that these viruses contain only two 
kinds of molecules: DNA and protein. So the genetic 
material had to be one or the other. But which? The 
experiment was fairly straightforward. Because the 
viral genes enter the bacterial cell, while the rest of the 
virus particle remains outside, the researchers decided 
to radioactively label the protein in one batch of virus 
and the DNA in another. Then, all they had to do was 
follow the radioactivity to see whether viral DNA or 
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viral protein wound up inside the bacteria. To do this, 
Hershey and Chase incubated their radiolabeled viruses 
with E. coli; after allowing a few minutes for infection to 
take place, they poured the mix into a Waring blender 
and hit “puree.” The blender’s spinning blades sheared 
the empty virus heads from the surfaces of the bacte- 
rial cells. The researchers then centrifuged the sample 
to separate the heavier, infected bacteria, which formed 
a pellet at the bottom of the centrifuge tube, from the 
empty viral coats, which remained in suspension (Figure 
5-32B). 

As you have probably guessed, Hershey and Chase 
found that the radioactive DNA entered the bacterial 
cells, while the radioactive proteins remained outside 
with the empty virus heads. They found that the radioac- 
tive DNA was also incorporated into the next generation 
of virus particles. 


This experiment demonstrated conclusively that viral 
DNA enters bacterial host cells, whereas viral protein 
does not. Thus, the genetic material in this virus had 
to be made of DNA. Together with the studies done by 
Avery, MacLeod, and McCarty, this evidence clinched 
the case for DNA as the agent of heredity. 
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Figure 5-32 Hershey and Chase showed definitively that genes are made of DNA. (A) The researchers worked with T2 viruses, 
which are made entirely of protein and DNA. Each virus acts as a molecular syringe, injecting its genetic material into a bacterium; 
the empty viral capsule remains attached to the outside of the cell. (B) To determine whether the genetic material of the virus is made 
of protein or DNA, the researchers labeled the DNA in one batch of viruses with radioactive phosphorous (32P) and the proteins in a 
second batch of viruses with radioactive sulfur (S). Because DNA lacks sulfur and the proteins lack phosphorus, these radioactive 


isotopes allowed the researchers to distinguish these two types of molecules. The radioactively labeled viruses were allowed to infect 
E. coli, and the mixture was then disrupted by brief pulsing in a Waring blender and centrifuged to separate the infected bacteria from 
the empty viral heads. When the researchers measured the radioactivity, they found that much of the °#P-labeled DNA had entered 
the bacterial cells, while the vast majority of the S-labeled proteins remained in solution with the spent viral particles. Furthermore, 
the radioactively labeled DNA also made its way into subsequent generations of virus particles, confirming that DNA is the heritable, 
genetic material. 
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QUESTION 5-5 


A. The nucleotide sequence of one DNA strand of a DNA 
double helix is 5’-GGATTTTTGTCCACAATCA-3’. 


What is the sequence of the complementary strand? 


B. Inthe DNA of certain bacterial cells, 13% of the 
nucleotides contain adenine. What are the percentages of 
the other nucleotides? 


C. How many possible nucleotide sequences are there for a 
stretch of single-stranded DNA that is N nucleotides long? 


D. Suppose you had a method of cutting DNA at specific 
sequences of nucleotides. How many nucleotides long 
(on average) would such a sequence have to be in order 
to make just one cut in a bacterial genome of 3 x 10° 
nucleotide pairs? How would the answer differ for the 
genome of an animal cell that contains 3 x 10? nucleotide 
pairs? 


QUESTION 5-6 


An A-T base pair is stabilized by only two hydrogen bonds. 
Hydrogen-bonding schemes of very similar strengths can 
also be drawn between other base combinations that 
normally do not occur in DNA molecules, such as the A-C 
and the A-G pairs shown in Figure Q5—6. What would 
happen if these pairs formed during DNA replication and 
the inappropriate bases were incorporated? Discuss why 
this does not often happen. (Hint: see Figure 5-4.) 
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QUESTION 5-7 


A. A macromolecule isolated from an extraterrestrial source 
superficially resembles DNA, but closer analysis reveals that 


the bases have quite different structures (Figure Q5-7). 
Bases V, W, X, and Y have replaced bases A, T, G, and C. 
Look at these structures closely. Could these DNA-like 
molecules have been derived from a living organism that 
uses principles of genetic inheritance similar to those used 
by organisms on Earth? 


B. Simply judged by their potential for hydrogen-bonding, 
could any of these extraterrestrial bases replace terrestrial 
A, T, G, or C in terrestrial DNA? Explain your answer. 
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QUESTION 5-8 


The two strands of a DNA double helix can be separated 
by heating. If you raised the temperature of a solution 
containing the following three DNA molecules, in what 
order do you suppose they would “melt”? Explain your 
answer. 


A. 5'-GCGGGCCAGCCCGAGTGGGTAGCCCAGG-3' 
3'-CGCCCGGTCGGGCTCACCCATCGGGTCC-5' 


B. 5’-ATTATAAAATATTTAGATACTATATTTACAA-3' 


3’-TAATATTTTATAAATCTATGATATAAATGTT-5' 


C. 5'-AGAGCTAGATCGAT-3" 


3’-TCTCGATCTAGCTA-5' 


QUESTION 5-9 


The total length of DNA in one copy of the human genome 
is about 1 m, and the diameter of the double helix is about 
2 nm. Nucleotides in a DNA double helix are stacked (see 
Figure 5-4B) at an interval of 0.34 nm. If the DNA were 
enlarged so that its diameter equaled that of an electrical 
extension cord (5 mm), how long would the extension cord 
be from one end to the other (assuming that it is completely 
stretched out)? How close would the bases be to each 
other? How long would a gene of 1000 nucleotide pairs be? 


Questions 197 


QUESTION 5-10 


A compact disc (CD) stores about 4.8 x 10? bits of 
information in a 96 cm? area. This information is stored as a 
binary code—that is, every bit is either a O or a1. 


QUESTION 5-14 


The two electron micrographs in Figure Q5-14 show nuclei 
of two different cell types. Can you tell from these pictures 
which of the two cells is transcribing more of its genes? 
Explain how you arrived at your answer. (Micrographs 


A. How many bits would it take to specify each nucleotide 
courtesy of Don W. Fawcett.) 


pair in a DNA sequence? 

B. How many CDs would it take to store the information 
contained in the human genome? 

QUESTION 5-11 


Which of the following statements are correct? Explain your 
answers. 


A. Each eukaryotic chromosome must contain the following 
DNA sequence elements: multiple origins of replication, two 


telomeres, and one centromere. 


B. Nucleosome core particles are 30 nm in diameter. 


QUESTION 5-12 


Define the following terms and their relationships to one 
another: 


Interphase chromosome 
Mitotic chromosome 
Chromatin 
Heterochromatin 


Histones 
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Nucleosome 


QUESTION 5-13 


Carefully consider the result shown in Figure O5-13. 

Each of the two colonies shown on the left is a clump of 
approximately 100,000 yeast cells that has grown up from 
a single cell, which is now somewhere in the middle of the 
colony. The two yeast colonies are genetically different, 

as shown by the chromosomal maps on the right. The 
yeast Ade2 gene encodes one of the enzymes required for 
adenine biosynthesis, and the absence of the Ade2 gene 
product leads to the accumulation of a red pigment. At its 
normal chromosome location, Ade2 is expressed in all cells. 
When it is positioned near the telomere, which is highly 
condensed, Ade2 is no longer expressed. How do you think 
the white sectors arise? What can you conclude about the 
propagation of the transcriptional state of the Ade2 gene 
from mother to daughter cells? 
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QUESTION 5-15 


DNA forms a right-handed helix. Pick out the right-handed 
helix from those shown in Figure Q5-15. 
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QUESTION 5-16 


A single nucleosome core particle is 11 nm in diameter 

and contains 147 base pairs (bp) of DNA (the DNA double 
helix measures 0.34 nm/bp). What packing ratio (ratio of 
DNA length to nucleosome diameter) has been achieved by 
wrapping DNA around the histone octamer? Assuming that 
there are an additional 54 bp of extended DNA in the linker 
between nucleosomes, how condensed is “beads-on-a- 
string” DNA relative to fully extended DNA? What fraction 
of the 10,000-fold condensation that occurs at mitosis does 
this first level of packing represent? 
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DNA Replication and Repair 


For a cell to survive and proliferate in a chaotic environment, it must be 
able to accurately copy the vast quantity of genetic information carried in 
its DNA. This fundamental process, called DNA replication, must occur 
before a cell can divide to produce two genetically identical daughter 
cells. In addition to carrying out this painstaking task with stunning accu- 
racy and efficiency, a cell must also continuously monitor and repair its 
genetic material, as DNA is subjected to unavoidable damage by chemi- 
cals and radiation in the environment and by reactive molecules that are 
generated inside the cell. 


Yet despite the molecular safeguards that have evolved to protect a cell’s 
DNA from copying errors and accidental damage, permanent changes— 
or mutations—sometimes do occur. Although most mutations do not 
affect the organism in any noticeable way, some have profound con- 
sequences. Occasionally, these changes can benefit the organism: for 
example, mutations can make bacteria resistant to antibiotics that are 
used to kill them. What is more, changes in DNA sequence can produce 
small variations that underlie the differences between individuals of the 
same species ( ); such changes, when they accumulate over 
hundreds of millions of years, provide the variety in genetic material that 
makes one species distinct from another, as we discuss in Chapter 9. 


Unfortunately, as mutations occur randomly, they are more likely to be 
detrimental than beneficial: they are responsible for thousands of human 
diseases, including cancer. The survival of a cell or organism, therefore, 
depends on keeping the changes in its DNA to a minimum. Without the 
systems that are continually inspecting and repairing damage to DNA, it 
is questionable whether life could exist at all. In this chapter, we describe 
the protein machines that replicate and repair the cell’s DNA. These 
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DNA Replication and Repair 


Figure 6-1 Differences in DNA can produce the variations 
that underlie the differences between individuals of the same 
species—even within the same family. Over evolutionary time, 
these genetic changes give rise to the differences that distinguish 
one species from another. 


machines catalyze some of the most rapid and elegant processes that take 
place within cells, and uncovering the strategies they employ to achieve 
these marvelous feats represents a triumph of scientific investigation. 


DNA REPLICATION 


At each cell division, a cell must copy its genome with extraordinary 
accuracy. In this section, we explore how the cell achieves this feat, while 
replicating its DNA at rates as high as 1000 nucleotides per second. 


Base-Pairing Enables DNA Replication 


In the preceding chapter, we saw that each strand of a DNA double helix 
contains a sequence of nucleotides that is exactly complementary to 
the nucleotide sequence of its partner strand. Each strand can therefore 
serve as a template, or mold, for the synthesis of a new complementary 
strand. In other words, if we designate the two DNA strands as S and S’, 
strand S can serve as a template for making a new strand S’, while strand 
S' can serve as a template for making a new strand S (Figure 6-2). Thus, 
the genetic information in DNA can be accurately copied by the beauti- 
fully simple process in which strand S separates from strand S', and each 
separated strand then serves as a template for the production of a new 
complementary partner strand that is identical to its former partner. 


The ability of each strand of a DNA molecule to act as a template for 
producing a complementary strand enables a cell to copy, or replicate, 
its genes before passing them on to its descendants. Although simple in 
principle, the process is awe-inspiring, as it can involve the copying of 
billions of nucleotide pairs with incredible speed and accuracy: a human 
cell undergoing division will copy the equivalent of 1000 books like this 
one in about 8 hours and, on average, get no more than a few letters 
wrong. This impressive feat is performed by a cluster of proteins that 
together form a replication machine. 


template S strand 


parent DNA double helix 


template S’ strand 


Figure 6-2 DNA acts as a template for its own replication. Because the nucleotide A 
will successfully pair only with T, and G with C, each strand of a DNA double helix—labeled 
here as the S strand and its complementary S’ strand—can serve as a template to specify 
the sequence of nucleotides in its complementary strand. In this way, both strands of a 
DNA double helix can be copied with precision. 


Figure 6-3 In each round of DNA replication, each of the two 
strands of DNA is used as a template for the formation of a new, 
complementary strand. DNA replication is “semiconservative” 
because each daughter DNA double helix is composed of one 
conserved (old) strand and one newly synthesized strand. 


DNA replication produces two complete double helices from the original 
DNA molecule, with each new DNA helix being identical in nucleotide 
sequence (except for rare copying errors) to the original DNA double 
helix (see Figure 6-2). Because each parental strand serves as the tem- 
plate for one new strand, each of the daughter DNA double helices ends 
up with one of the original (old) strands plus one strand that is completely 
new; this style of replication is said to be semiconservative (Figure 6-3). 
We describe the inventive experiments that revealed this feature of DNA 
replication in How We Know, pp. 202-204. 


DNA Synthesis Begins at Replication Origins 


The DNA double helix is normally very stable: the two DNA strands are 
locked together firmly by the large numbers of hydrogen bonds between 
the bases on both strands (see Figure 5-2). As a result, only temperatures 
approaching those of boiling water provide enough thermal energy to 
separate the two strands. To be used as a template, however, the double 
helix must first be opened up and the two strands separated to expose 
the nucleotide bases. How does this separation occur at the temperatures 
found in living cells? 


The process of DNA synthesis is begun by initiator proteins that bind to 
specific DNA sequences called replication origins. Here, the initiator 
proteins pry the two DNA strands apart, breaking the hydrogen bonds 
between the bases (Figure 6-4). Although the hydrogen bonds collec- 
tively make the DNA helix very stable, individually each hydrogen bond is 
weak (as discussed in Chapter 2). Separating a short length of DNA a few 
base pairs at a time therefore does not require a large energy input, and 
the initiator proteins can readily unzip short regions of the double helix 
at normal temperatures. 


In simple cells such as bacteria or yeast, replication origins span approxi- 
mately 100 nucleotide pairs. They are composed of DNA sequences that 
attract the initiator proteins and are especially easy to open. We saw in 
Chapter 5 that an A-T base pair is held together by fewer hydrogen bonds 
than is a G-C base pair. Therefore, DNA rich in A-T base pairs is easier to 
pull apart, and A-T-rich stretches of DNA are typically found at replica- 
tion origins. 


A bacterial genome, which is typically contained in a circular DNA mol- 
ecule of several million nucleotide pairs, has a single replication origin. 
The human genome, which is very much larger, has approximately 10,000 
such origins—an average of 220 origins per chromosome. Beginning 
DNA replication at many places at once greatly shortens the time a cell 
needs to copy its entire genome. 


Once an initiator protein binds to DNA at a replication origin and locally 
opens up the double helix, it attracts a group of proteins that carry out 
DNA replication. These proteins form a replication machine, in which 
each protein carries out a specific function. 


Two Replication Forks Form at Each Replication Origin 


DNA molecules in the process of being replicated contain Y-shaped junc- 
tions called replication forks. Two replication forks are formed at each 
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Figure 6-4 A DNA double helix is opened 


at replication origins. DNA sequences 
at replication origins are recognized by 


initiator proteins (not shown), which locally 


pull apart the two strands of the double 


helix. The exposed single strands can then 


serve as templates for copying the DNA. 


™ HOW WE KNOW 
THE NATURE OF REPLICATION 


In 1953, James Watson and Francis Crick published unwound without making a big tangled mess? Watson 
their famous two-page paper describing a model for and Crick’s conception of the DNA helix opening up like 
the structure of DNA. In this report, they proposed that a zipper seemed, to Delbrück, physically unlikely and 
complementary bases—adenine and thymine, guanine simply “too inelegant to be efficient.” 


and cytosine—pair with one another along the center 
of the double helix, holding together the two strands 
of DNA (see Figure 5-2). At the very end of this suc- 
cinct scientific blockbuster, they comment, almost as 


Instead, Delbrtick proposed that DNA replication pro- 
ceeds through a series of breaks and reunions, in which 
the DNA backbone is broken and the strands are cop- 
ied in short segments—perhaps only 10 nucleotides at 


an aside, “It has not escaped our notice that the spe- à time—before being rejoined. In this model, which was 
cific pairing we have postulated immediately suggests later dubbed dispersive, the resulting copies would be 
possible copying mechanism for the genetic material.” patchwork collections of old and new DNA, each strand 
Indeed, one month after the classic paper appeared in containing a mixture of both (Figure 6-5B). No unwind- 
print in the journal Nature, Watson and Crick published ing was necessary. 

a second article, suggesting how DNA might be repli- Yet a third camp promoted the idea that DNA replica- 
cated. In this paper, they proposed that the two strands tion might be conservative: that the parent helix would 
of the double helix unwind, and that each strand serves somehow remain entirely intact after copying, and the 
as a template for the synthesis of a complementary daughter molecule would contain two entirely new 
daughter strand. In their model, dubbed semiconserva- DNA strands (Figure 6-5C). To determine which of 
tive replication, each new DNA molecule consists of these models was correct, an experiment was needed— 
one strand derived from the original parent molecule one that would reveal the composition of the newly 
and one newly synthesized strand (Figure 6—5A). synthesized DNA strands. That’s where Matt Meselson 


We now know that Watson and Crick’s model for DNA and Frank Stahl came in. 


replication was correct—but it was not universally 
accepted at first. Respected physicist-turned-geneticist Heavy DNA 


Max Delbrück, for one, got hung up on what he termed As a graduate student working with Linus Pauling, 
“the untwiddling problem”; that is: How could the two Meselson was toying with a method for telling the dif- 
strands of a double helix, twisted around each other ference between old and new proteins. After chatting 
so many times all along their great length, possibly be with Delbrück about Watson and Crick’s replication 
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Figure 6-5 Three models for DNA replication make different predictions. (A) In the semiconservative model, each parent strand 
serves as a template for the synthesis of a new daughter strand. The first round of replication would produce two hybrid molecules, 
each containing one strand from the original parent and one newly synthesized strand. A subsequent round of replication would yield 
two hybrid molecules and two molecules that contain none of the original parent DNA (see Figure 6-3). (B) In the dispersive model, 
each generation of replicated DNA molecules will be a mosaic of DNA from the parent strands and the newly synthesized DNA. (C) In 
the conservative model, the parent molecule remains intact after being copied. In this case, the first round of replication would yield 
the original parent double helix and an entirely new double helix. For each model, parent DNA molecules are shown in orange; newly 
replicated DNA is red. Note that only a very small segment of DNA is shown for each model. 


model, it occurred to Meselson that the approach 
he'd envisaged for exploring protein synthesis might 
also work for studying DNA. In the summer of 1954, 
Meselson met Stahl, who was then a graduate student 
in Rochester, NY, and they agreed to collaborate. It 
took a few years to get everything working, but the two 
eventually performed what has come to be known as 
“the most beautiful experiment in biology.” 


Their approach, in retrospect, was stunningly straight- 
forward. They started by growing two batches of 
Escherichia coli bacteria, one in a medium containing a 
heavy isotope of nitrogen, !®N, the other in a medium 
containing the normal, lighter '4N. The nitrogen in the 
nutrient medium gets incorporated into the nucleotide 
bases and, from there, makes its way into the DNA 
of the organism. After growing bacterial cultures for 
many generations in either the !5N- or !4N-containing 
medium, the researchers had two flasks of bacteria, one 
with heavy DNA (containing E. coli that had incorpo- 
rated the heavy isotope), the other with DNA that was 
light. Meselson and Stahl then broke open the bacterial 
cells and loaded the DNA into tubes containing a high 
concentration of the salt cesium chloride. When these 
tubes are centrifuged at high speed, the cesium chloride 
forms a density gradient, and the DNA molecules float 
or sink within the solution until they reach the point at 
which their density equals that of the salt solution that 
surrounds them (see Panel 4-3, pp. 164-165). Using 
this method, called equilibrium density centrifugation, 
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Meselson and Stahl found that they could distin- 
guish between heavy (!°N-containing) DNA and light 
('4N-containing) DNA by observing the positions of the 
DNA within the cesium chloride gradient. Because the 
heavy DNA was denser than the light DNA, it collected 
at a position nearer to the bottom of the centrifuge tube 
(Figure 6-6). 


And the winner is... 


Once they had established this method for differentiat- 
ing between light and heavy DNA, Meselson and Stahl 
set out to test the various hypotheses proposed for DNA 
replication. To do this, they took a flask of bacteria that 
had been grown in heavy nitrogen and transferred the 
bacteria into a medium containing the light isotope. 
At the start of the experiment, all the DNA would be 
heavy. But, as the bacteria divided, the newly synthe- 
sized DNA would be light. They could then monitor the 
accumulation of light DNA and see which model, if any, 
best fit their data. After one generation of growth, the 
researchers found that the parental, heavy DNA mol- 
ecules—those made of two strands containing '*N—had 
disappeared and were replaced by a new species of 
DNA that banded at a density halfway between those 
of !5N-DNA and !4N-DNA (Figure 6-7). These newly 
synthesized daughter helices, Meselson and Stahl rea- 
soned, must be hybrids—containing both heavy and 
light isotopes. 


Right away, this observation ruled out the conserva- 
tive model of DNA replication, which predicted that the 


Figure 6-6 Centrifugation in a cesium 
chloride gradient allows the separation 
of heavy and light DNA. Bacteria 

are grown for several generations in 

a medium containing either 'N (the 
heavy isotope) or 14N (the light isotope) 
bacteria grown in to label their DNA. The cells are then 
\4N-containing medium broken open, and the DNA is loaded 
into an ultracentrifuge tube containing 

a cesium chloride salt solution (yellow). 
These tubes are centrifuged at high 
speed for two days to allow the cesium 
chloride to form a gradient with low 
density at the top of the tube and high 
density at the bottom. As the gradient 
forms, the DNA will migrate to the region 
where its density matches that of the 

salt surrounding it. The heavy and light 
DNA molecules thus collect in different 
positions in the tube. 
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Figure 6-7 The first part of the Meselson-Stahl experiment ruled out the 
conservative model of DNA replication. (A) Bacteria grown in light medium 
(containing 14N) yield DNA that forms a band near the top of the centrifuge tube, 
whereas bacteria grown in 1°N-containing heavy medium (B) produce DNA that 
reaches a position further down the tube. (C) When bacteria grown in a heavy 
medium are transferred to a light medium and allowed to divide for one hour (the 
time needed for one generation), they produce a band that is positioned about 
midway between the heavy and light DNA. These results rule out the conservative 
model of replication but do not distinguish between the semiconservative and 
dispersive models, both of which predict the formation of daughter DNA molecules 
with intermediate densities. 

The fact that the results came out looking so clean—with discrete bands forming 
at the expected positions for newly replicated hybrid DNA molecules—was a happy 
accident of the experimental protocol. The researchers used a hypodermic syringe 
to load their DNA samples into the ultracentrifuge tubes (see Figure 6—6). In the 
process, they unwittingly sheared the large bacterial chromosome into smaller 
fragments. Had the chromosomes remained whole, the researchers might have 
isolated DNA molecules that were only partially replicated, because many cells 
would have been caught in the middle of copying their DNA. Molecules in such an 
intermediate stage of replication would not have separated into such beautifully 
discrete bands. But because the researchers were instead working with smaller 
pieces of DNA, the likelihood that any given fragment had been fully replicated— 
and contained a complete parent and daughter strand—was high, thus yielding 
clean, easy-to-interpret results. 


parental DNA would remain entirely 
heavy, while the daughter DNA 
would be entirely light (see Figure 
6-5C). The data supported the semi- 
conservative model, which predicted 
the formation of hybrid molecules 
containing one strand of heavy DNA 
and one strand of light (see Figure 
6-5A). The results, however, were 
also consistent with the dispersive 
model, in which hybrid DNA strands 
would contain a mixture of heavy 
and light DNA (see Figure 6-5B). 


To distinguish between the remain- 
ing two models, Meselson and Stahl 
turned up the heat. When DNA is 
subjected to high temperature, the 
hydrogen bonds holding the two 
strands together break and the 
helix comes apart, leaving a collec- 
tion of single-stranded DNAs. When 
the researchers heated the hybrid 
molecules before centrifuging, they 
discovered that one strand of the 
DNA was heavy, whereas the other 
was light. This observation ruled out 
the dispersive model; if this model 
were correct, the resulting strands, 
each containing a mottled assembly 
of heavy and light DNA, would have 
all banded together at an intermedi- 
ate density. 


According to historian Frederic 
Lawrence Holmes, the experiment 
was so elegant and the results 
so clean that Stahl—when being 
interviewed for a position at Yale 
University—was unable to fill the 50 
minutes allotted for his talk. “I was 
finished in 25 minutes,” said Stahl, 
“because that is all it takes to tell 
that experiment. It’s so totally sim- 
ple and contained.” Stahl did not get 
the job at Yale, but the experiment 
convinced biologists that Watson 
and Crick had been correct. In fact, 
the results were accepted so widely 
and rapidly that the experiment 
was described in a textbook before 
Meselson and Stahl had even pub- 
lished the data. 


replication origin (Figure 6-8). At each fork, a replication machine moves 
along the DNA, opening up the two strands of the double helix and using 
each strand as a template to make a new daughter strand. The two forks 
move away from the origin in opposite directions, unzipping the DNA 
double helix and copying the DNA as they go (Figure 6-9). DNA rep- 
lication—in both bacterial and eukaryotic chromosomes—is therefore 
termed bidirectional. The forks move very rapidly: at about 1000 nucleo- 
tide pairs per second in bacteria and 100 nucleotide pairs per second in 
humans. The slower rate of fork movement in humans (indeed, in all 
eukaryotes) may be due to the difficulties in replicating DNA through 
the more complex chromatin structure of eukaryotic chromosomes (dis- 
cussed in Chapter 5). 


DNA Polymerase Synthesizes DNA Using a Parental 
Strand as a Template 


The movement of a replication fork is driven by the action of the replica- 
tion machine, at the heart of which is an enzyme called DNA polymerase. 
This enzyme catalyzes the addition of nucleotides to the 3’ end of a grow- 
ing DNA strand, using one of the original, parental DNA strands as a 
template. Base-pairing between an incoming nucleotide and the tem- 
plate strand determines which of the four nucleotides (A, G, T, or C) will 
be selected. The final product is a new strand of DNA that is complemen- 
tary in nucleotide sequence to the template (Figure 6-10). 


The polymerization reaction involves the formation of a phosphodiester 
bond between the 3’ end of the growing DNA chain and the 5'-phosphate 
group of the incoming nucleotide, which enters the reaction as a deoxy- 
ribonucleoside triphosphate. The energy for polymerization is provided 
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Figure 6-9 The two replication forks formed at a replication origin move away 
in opposite directions. (A) These drawings represent the same portion of a DNA 
molecule as it might appear at different times during replication. The orange 

lines represent the two parental DNA strands; the red lines represent the newly 
synthesized DNA strands. (B) An electron micrograph showing DNA replicating in 

an early fly embryo. The particles visible along the DNA are nucleosomes, structures 
made of DNA and the histone protein complexes around which the DNA is wrapped 
(discussed in Chapter 5). The chromosome in this micrograph is the same one that 
was redrawn in sketch (2) of (A). (B, courtesy of Victoria Foe.) 
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Figure 6-8 DNA synthesis occurs at 
Y-shaped junctions called replication 
forks. Two replication forks form at each 
replication origin and subsequently move 
away from each other as replication 
proceeds. 


QUESTION 6-1 


Look carefully at the micrograph 
and corresponding sketch (2) in 
Figure 6-9. 

A. Using the scale bar, estimate 
the lengths of the DNA double 
helices between the replication 
forks. Numbering the replication 
forks sequentially from the left, how 
long will it take until forks 4 and 

5, and forks 7 and 8, respectively, 
collide with each other? (Recall that 
the distance between the bases 

in DNA is 0.34 nm, and eukaryotic 
replication forks move at about 100 
nucleotides per second.) For this 
question, disregard the nucleosomes 
seen in the micrograph and assume 
that the DNA is fully extended. 

B. The fly genome is about 

1.8 x 108 nucleotide pairs in size. 
What fraction of the genome is 
shown in the micrograph? 
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Figure 6-10 A new DNA strand is 
synthesized in the 5’-to-3' direction. 

At each step, the appropriate incoming 
nucleoside triphosphate is selected by 
forming base pairs with the next nucleotide 
in the template strand: A with T, T with A, 

C with G, and G with C. Each is added to 
the 3’ end of the growing new strand, as 
indicated. 
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by the incoming deoxyribonucleoside triphosphate itself: hydrolysis 
of one of its high-energy phosphate bonds fuels the reaction that links 
the nucleotide monomer to the chain, releasing pyrophosphate (Figure 
6-11). Pyrophosphate is further hydrolyzed to inorganic phosphate (Pi), 
which makes the polymerization reaction effectively irreversible (see 
Figure 3-42). 


DNA polymerase does not dissociate from the DNA each time it adds a 
new nucleotide to the growing strand; rather, it stays associated with the 
DNA and moves along the template strand stepwise for many cycles of 
the polymerization reaction (Movie 6.1). We will see later that a special 
protein keeps the polymerase attached to DNA as it repeatedly adds new 
nucleotides to the growing strand. 


The Replication Fork Is Asymmetrical 


The 5'-to-3' direction of the DNA polymerization reaction poses a problem 
at the replication fork. As illustrated in Figure 5-2, the sugar-phosphate 
backbone of each strand of a DNA double helix has a unique chemical 
direction, or polarity, determined by the way each sugar residue is linked 
to the next, and the two strands in the double helix are antiparallel; that 
is, they run in opposite directions. As a consequence, at each replication 
fork, one new DNA strand is being made on a template that runs in one 
direction (3’ to 5’), whereas the other new strand is being made on a 
template that runs in the opposite direction (5' to 3’). The replication fork 
is therefore asymmetrical (Figure 6-12). Figure 6-9A, however, makes it 
look like both of the new DNA strands are growing in the same direction; 
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Figure 6-11 DNA polymerase adds a deoxyribonucleotide to the 3' end of a growing DNA strand. (A) Nucleotides enter the 
reaction as deoxyribonucleoside triphosphates. An incoming nucleoside triphosphate forms a base pair with its partner in the template 
strand. It is then covalently attached to the free 3’ hydroxyl on the growing DNA strand. The new DNA strand is therefore synthesized 
in the 5'-to-3’ direction. The energy for the polymerization reaction comes from the hydrolysis of a high-energy phosphate bond in 

the incoming nucleoside triphosphate and the release of pyrophosphate, which is subsequently hydrolyzed to yield two molecules of 
inorganic phosphate (not shown). (B) The reaction is catalyzed by the enzyme DNA polymerase (light green). The polymerase guides 
the incoming nucleoside triphosphate to the template strand and positions it such that its 5' triphosphate will be able to react with the 
3'-hydroxyl group on the newly synthesized strand. The gray arrow indicates the direction of polymerase movement. (C) Structure of 
DNA polymerase, as determined by x-ray crystallography, also showing the replicating DNA. The template strand is the longer, orange 
strand, and the newly synthesized DNA strand is colored red (Movie 6.1). 


that is, the direction in which the replication fork is moving. For that to 
be true, one strand would have to be synthesized in the 5’-to-3' direction 
and the other in the 3'-to-5' direction. 


Does the cell have two types of DNA polymerase, one for each direction? 
The answer is no: all DNA polymerases add new subunits only to the 3’ 
end of a DNA strand (see Figure 6-11A). As a result, a new DNA chain 
can be synthesized only in a 5'-to-3' direction. This can easily account 
for the synthesis of one of the two strands of DNA at the replication fork, 
but what happens on the other? This conundrum is solved by the use of 
a “backstitching” maneuver. The DNA strand that appears to grow in the 
incorrect 3'-to-5’ direction is actually made discontinuously, in succes- 
sive, separate, small pieces—with the DNA polymerase moving backward 
with respect to the direction of replication-fork movement so that each 
new DNA fragment can be polymerized in the 5’-to-3’ direction. 


The resulting small DNA pieces—called Okazaki fragments after the 
pair of biochemists who discovered them—are later joined together to 
form a continuous new strand. The DNA strand that is made discontinu- 
ously in this way is called the lagging strand, because the cumbersome 
backstitching mechanism imparts a slight delay to its synthesis; the other 
strand, which is synthesized continuously, is called the leading strand 
(Figure 6-13). 


Although they differ in subtle details, the replication forks of all cells, 
prokaryotic and eukaryotic, have leading and lagging strands. This com- 
mon feature arises from the fact that all DNA polymerases work only in 
the 5'-to-3' direction—a restriction that allows DNA polymerase to “check 
its work,” as we discuss next. 


DNA Polymerase Is Self-correcting 


DNA polymerase is so accurate that it makes only about one error in 
every 107 nucleotide pairs it copies. This error rate is much lower than 
can be explained simply by the accuracy of complementary base-pair- 
ing. Although A-T and C-G are by far the most stable base pairs, other, 
less stable base pairs—for example, G-T and C-A—can also be formed. 
Such incorrect base pairs are formed much less frequently than correct 
ones, but, if allowed to remain, they would result in an accumulation of 
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Figure 6-12 At a replication fork, the 
two newly synthesized DNA strands are 
of opposite polarities. This is because 
the two template strands are oriented in 
opposite directions. 


Figure 6-13 At each replication fork, the 
lagging DNA strand is synthesized in 
pieces. Because both of the new strands 
at a replication fork are synthesized in the 
5'-to-3' direction, the lagging strand of 
DNA must be made initially as a series of 
short DNA strands, which are later joined 
together. The upper diagram shows two 
replication forks moving in opposite 
directions; the lower diagram shows the 
same forks a short time later. To replicate 
the lagging strand, DNA polymerase uses 
a backstitching mechanism: it synthesizes 
short pieces of DNA (called Okazaki 
fragments) in the 5'-to-3' direction and then 
moves back along the template strand 
(toward the fork) before synthesizing the 
next fragment. 
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Figure 6-14 During DNA synthesis, 
DNA polymerase proofreads its own 
work. If an incorrect nucleotide is 
accidentally added to a growing strand, 
the DNA polymerase cleaves it from the 
strand and replaces it with the correct 
nucleotide before continuing. 


Figure 6-15 DNA polymerase contains 
separate sites for DNA synthesis 

and proofreading. The diagrams are 
based on the structure of an E. coli DNA 
polymerase molecule, as determined by 
x-ray crystallography. The DNA polymerase, 
which cradles the DNA molecule being 
replicated, is shown in the polymerizing 
mode (left) and in the proofreading, or 
editing, mode (right). The catalytic sites 
for the polymerization activity (P) and 
editing activity (E) are indicated. When the 
polymerase adds an incorrect nucleotide, 
the newly synthesized DNA strand (red) 
transiently unpairs from the template 
strand (orange), and its 3’ end moves into 
the editing site (E) to allow the incorrect 
nucleotide to be removed. 
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mutations. This disaster is avoided because DNA polymerase has two 
special qualities that greatly increase the accuracy of DNA replication. 
First, the enzyme carefully monitors the base-pairing between each 
incoming nucleoside triphosphate and the template strand. Only when 
the match is correct does DNA polymerase undergo a small structural 
rearrangement that allows it to catalyze the nucleotide-addition reac- 
tion. Second, when DNA polymerase does make a rare mistake and adds 
the wrong nucleotide, it can correct the error through an activity called 
proofreading. 


Proofreading takes place at the same time as DNA synthesis. Before the 
enzyme adds the next nucleotide to a growing DNA strand, it checks 
whether the previously added nucleotide is correctly base-paired to 
the template strand. If so, the polymerase adds the next nucleotide; if 
not, the polymerase clips off the mispaired nucleotide and tries again 
(Figure 6-14). Polymerization and proofreading are tightly coordinated, 
and the two reactions are carried out by different catalytic domains in the 
same polymerase molecule (Figure 6-15). 


This proofreading mechanism is possible only for DNA polymerases that 
synthesize DNA exclusively in the 5’-to-3’ direction. If a DNA polymerase 
were able to synthesize in the 3’-to-5’ direction (circumventing the need 
for backstitching on the lagging strand), it would be unable to proofread. 
That’s because if this “backward” polymerase were to remove an incor- 
rectly paired nucleotide from the 5’ end, it would create a chemical dead 
end—a strand that could no longer be elongated (Figure 6—16). Thus, for 
a DNA polymerase to function as a self-correcting enzyme that removes 
its own polymerization errors as it moves along the DNA, it must proceed 
only in the 5'-to-3' direction. The cumbersome backstitching mechanism 
on the lagging strand can be seen as a necessary consequence of main- 
taining this crucial proofreading activity. 


Short Lengths of RNA Act as Primers for DNA Synthesis 


We have seen that the accuracy of DNA replication depends on the 
requirement of the DNA polymerase for a correctly base-paired 3’ end 
before it can add more nucleotides to a growing DNA strand. How then 
can the polymerase begin a completely new DNA strand? To get the pro- 
cess started, a different enzyme is needed—one that can begin a new 
polynucleotide strand simply by joining two nucleotides together without 
the need for a base-paired end. This enzyme does not, however, syn- 
thesize DNA. It makes a short length of a closely related type of nucleic 
acid—RNA (ribonucleic acid)—using the DNA strand as a template. This 
short length of RNA, about 10 nucleotides long, is base-paired to the tem- 
plate strand and provides a base-paired 3' end as a starting point for 
DNA polymerase (Figure 6-17). An RNA fragment thus serves as a primer 
for DNA synthesis, and the enzyme that synthesizes the RNA primer is 
known as primase. 
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Figure 6—16 For proofreading to take place, DNA polymerization must proceed in the 5’-to-3' direction. 

(A) Polymerization in the normal 5’-to-3' direction allows the DNA strand to continue to be elongated after an 
incorrectly added nucleotide (gray) has been removed by proofreading (see Figure 6—14). (B) If DNA synthesis 
instead proceeded in the backward 3’-to-5’ direction, the energy for polymerization would come from the hydrolysis 
of the phosphate groups at the 5’ end of the growing chain (orange), rather than the 5’ end of the incoming 
nucleoside triphosphate. Removal of an incorrect nucleotide would block the addition of the correct nucleotide 
(red), as there are no high-energy phosphodiester bonds remaining at the 5’ end of the growing strand. 


Primase is an example of an RNA polymerase, an enzyme that synthesizes 
RNA using DNA as a template. A strand of RNA is very similar chemi- 
cally to a single strand of DNA except that it is made of ribonucleotide 
subunits, in which the sugar is ribose, not deoxyribose; RNA also differs 
from DNA in that it contains the base uracil (U) instead of thymine (T) 
(see Panel 2-7, pp. 78-79). However, because U can form a base pair with 
A, the RNA primer is synthesized on the DNA strand by complementary 
base-pairing in exactly the same way as is DNA. 


For the leading strand, an RNA primer is needed only to start replication 
at a replication origin; at that point, the DNA polymerase simply takes 
over, extending this primer with DNA synthesized in the 5’-to-3' direc- 
tion. But on the lagging strand, where DNA synthesis is discontinuous, 
new primers are continuously needed to keep polymerization going (see 
Figure 6-13). The movement of the replication fork continually exposes 
unpaired bases on the lagging-strand template, and new RNA primers 
must be laid down at intervals along the newly exposed, single-stranded 
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Figure 6-17 RNA primers are synthesized by an RNA polymerase 
called primase, which uses a DNA strand as a template. Like DNA 
polymerase, primase synthesizes in the 5'-to-3' direction. Unlike DNA 
polymerase, however, primase can start a new polynucleotide chain by 
joining together two nucleoside triphosphates without the need for 

a base-paired 3’ end as a starting point. Primase uses ribonucleoside 
triphosphate rather than deoxyribonucleoside triphosphate. 


stretch. DNA polymerase then adds a deoxyribonucleotide to the 3’ end 
of each new primer to produce another Okazaki fragment, and it will 
continue to elongate this fragment until it runs into the previously syn- 
thesized RNA primer (Figure 6-18). 


To produce a continuous new DNA strand from the many separate pieces 
of nucleic acid made on the lagging strand, three additional enzymes are 
needed. These act quickly to remove the RNA primer, replace it with DNA, 
and join the remaining DNA fragments together. A nuclease degrades 
the RNA primer, a DNA polymerase called a repair polymerase replaces 
the RNA primers with DNA (using the end of the adjacent Okazaki frag- 
ment as its primer), and the enzyme DNA ligase joins the 5'-phosphate 
end of one DNA fragment to the adjacent 3’-hydroxyl end of the next 
(Figure 6-19). Because it was discovered first, the repair polymerase 
involved in this process is often called DNA polymerase I; the polymerase 
that carries out the bulk of DNA replication at the forks is known as DNA 
polymerase III. 


Unlike DNA polymerases I and III, primase does not proofread its work. 
As a result, primers frequently contain mistakes. But because primers 
are made of RNA instead of DNA, they stand out as “suspect copy” to be 
automatically removed and replaced by DNA. The repair polymerase that 
makes this DNA, like the replicative polymerase, proofreads as it synthe- 
sizes. In this way, the cell’s replication machinery is able to begin new 
DNA strands and, at the same time, ensure that all of the DNA is copied 
faithfully. 


Proteins at a Replication Fork Cooperate to Form 
a Replication Machine 


DNA replication requires the cooperation of a large number of proteins 
that act in concert to synthesize new DNA. These proteins form part of 
a remarkably complex replication machine. The first problem faced by 
the replication machine is accessing the nucleotides that lie ahead of 
the replication fork and are thus buried within the double helix. For DNA 
replication to occur, the double helix must be continuously pried apart 
so that the incoming nucleoside triphosphates can form base pairs with 


Figure 6-18 Multiple enzymes are required to synthesize the 
lagging DNA strand. In eukaryotes, RNA primers are made at intervals 
of about 200 nucleotides on the lagging strand, and each RNA primer 
is approximately 10 nucleotides long. These primers are extended 

by a replicative DNA polymerase to produce Okazaki fragments. The 
primers are subsequently removed by nucleases that recognize the 
RNA strand in an RNA-DNA hybrid helix and degrade it; this leaves 
gaps that are filled in by a repair DNA polymerase that can proofread 
as it fills in the gaps. The completed DNA fragments are finally 

joined together by an enzyme called DNA ligase, which catalyzes the 
formation of a phosphodiester bond between the 3’-hydroxyl end of 
one fragment and the 5’-phosphate end of the next, thus linking up 
the sugar-phosphate backbones. This nick-sealing reaction requires an 
input of energy in the form of ATP (see Figure 6-19). 
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Figure 6-19 DNA ligase joins together Okazaki fragments on the lagging strand during DNA synthesis. The 
ligase enzyme uses a molecule of ATP to activate the 5’ phosphate of one fragment (step 1) before forming a new 
bond with the 3’ hydroxyl of the other fragment (step 2). 


each template strand. Two types of replication proteins—DNA helicases 
and single-strand DNA-binding proteins—cooperate to carry out this task. 
A helicase sits at the very front of the replication machine, where it uses 
the energy of ATP hydrolysis to propel itself forward, prying apart the 
double helix as it speeds along the DNA (Figure 6-20 and Movie 6.2). 


Single-strand DNA-binding proteins then latch onto the single-stranded Figure 6-20 DNA synthesis is 
DNA exposed by the helicase, preventing the strands from re-forming carried out by a group of proteins 
base pairs and keeping them in an elongated form so that they can serve that act together as a replication 
as efficient templates. machine. (A) DNA polymerases are 
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QUESTION 6-2 


Discuss the following statement: 
“Primase is a sloppy enzyme that 
makes many mistakes. Eventually, 
the RNA primers it makes are 
removed and replaced with DNA 
synthesized by a polymerase with 
higher fidelity. This is wasteful. It 
would be more energy-efficient 

if a DNA polymerase were used 
to make an accurate primer in 
the first place.” 


Figure 6-21 DNA topoisomerases 
relieve the tension that builds up in 
front of a replication fork. (A) As a 
DNA helicase moves forward, unwinding 
the DNA double helix, it generates a 
section of overwound DNA ahead of it. 
Tension builds up because the rest of 
the chromosome (shown in brown) is too 
large to rotate fast enough to relieve the 
buildup of torsional stress. The broken 
bars represent approximately 20 turns 
of DNA. (B) Some of this torsional stress 
is relieved by additional coiling of the 
DNA double helix to form supercoils. 

(C) DNA topoisomerases relieve this stress 
by generating temporary nicks in the 
DNA, which allow rapid rotation around 
the single strands opposite the nicks. 


This localized unwinding of the DNA double helix itself presents a prob- 
lem. As the helicase moves forward, prying open the double helix, the 
DNA ahead of the fork gets wound more tightly. This excess twisting in 
front of the replication fork creates tension in the DNA that—if allowed 
to build—makes unwinding the double helix increasingly difficult and 
ultimately impedes the forward movement of the replication machinery 
(Figure 6-21A). Enzymes called DNA topoisomerases relieve this ten- 
sion. A DNA topoisomerase produces a transient, single-strand nick in 
the DNA backbone, which temporarily releases the built-up tension; the 
enzyme then reseals the nick before falling off the DNA (Figure 6-21B). 


Back at the replication fork, an additional protein, called a sliding clamp, 
keeps DNA polymerase firmly attached to the template while it is syn- 
thesizing new strands of DNA. Left on their own, most DNA polymerase 
molecules will synthesize only a short string of nucleotides before falling 
off the DNA template strand. The sliding clamp forms a ring around the 
newly formed DNA double helix and, by tightly gripping the polymerase, 
allows the enzyme to move along the template strand without falling off 
as it synthesizes new DNA (see Figure 6-20A and Movie 6.5). 


Assembly of the clamp around DNA requires the activity of another repli- 
cation protein, the clamp loader, which hydrolyzes ATP each time it locks 
a sliding clamp around a newly formed DNA double helix. This loading 
needs to occur only once per replication cycle on the leading strand; on 
the lagging strand, however, the clamp is removed and then reattached 
each time a new Okazaki fragment is made. In bacteria, this happens 
approximately once per second. 


Most of the proteins involved in DNA replication are held together in 
a large multienzyme complex that moves as a unit along the parental 
DNA double helix, enabling DNA to be synthesized on both strands in a 
coordinated manner. This complex can be likened to a miniature sewing 
machine composed of protein parts and powered by nucleoside triphos- 
phate hydrolysis (Figure 6-20B). The proteins involved in DNA replication 
are listed in Table 6-1. 
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TABLE 6-1 PROTEINS INVOLVED IN DNA REPLICATION 
a 


DNA polymerase catalyzes the addition of nucleotides to the 3’ end of a 
growing strand of DNA using a parental DNA strand as 
a template 

DNA helicase uses the energy of ATP hydrolysis to unwind the DNA 


double helix ahead of the replication fork 


Single-strand DNA- binds to single-stranded DNA exposed by DNA 
binding protein helicase, preventing base pairs from re-forming before 
the lagging strand can be replicated 


DNA topoisomerase produces transient nicks in the DNA backbone to relieve 
the tension built up by the unwinding of DNA ahead of 
the DNA helicase 


Sliding clamp keeps DNA polymerase attached to the template, 
allowing the enzyme to move along without falling off as 
it synthesizes new DNA 


Clamp loader uses the energy of ATP hydrolysis to lock the sliding 
clamp onto DNA 

Primase synthesizes RNA primers along the lagging-strand 
template 

DNA ligase uses the energy of ATP hydrolysis to join Okazaki 


fragments made on the lagging-strand template 


Telomerase Replicates the Ends of Eukaryotic 
Chromosomes 


Having discussed how DNA replication begins at origins and continues 
as the replication forks proceed, we now turn to the special problem of 
replicating the very ends of chromosomes. As we discussed previously, 
because DNA replication proceeds only in the 5'-to-3' direction, the lag- 
ging strand of the replication fork must be synthesized in the form of 
discontinuous DNA fragments, each of which is initiated from an RNA 
primer laid down by a primase (see Figure 6-18). A serious problem 
arises, however, as the replication fork approaches the end of a chromo- 
some: although the leading strand can be replicated all the way to the 
chromosome tip, the lagging strand cannot. When the final RNA primer 
on the lagging strand is removed, there is no enzyme that can replace it 
with DNA (Figure 6-22). Without a strategy to deal with this problem, the 
lagging strand would become shorter with each round of DNA replication 
and, after repeated cell divisions, the chromosomes themselves would 
shrink—eventually losing valuable genetic information. 


Bacteria avoid this “end-replication” problem by having circular DNA 
molecules as chromosomes. Eukaryotes get around it by adding long, 
repetitive nucleotide sequences to the ends of every chromosome. 
These sequences, which are incorporated into structures called telo- 
meres, attract an enzyme called telomerase to the chromosome ends. 
Telomerase carries its own RNA template, which it uses to add multi- 
ple copies of the same repetitive DNA sequence to the lagging-strand 
template. In many dividing cells, telomeres are continuously replenished, 
and the resulting extended templates can then be copied by conventional 
DNA replication, ensuring that no peripheral chromosomal sequences 
are lost (Figure 6-23). 


In addition to allowing replication of chromosome ends, telomeres form 
structures that mark the true ends of a chromosome. These structures 
allow the cell to distinguish unambiguously between the natural ends of 
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QUESTION 6-3 


A gene encoding one of the proteins 


involved in DNA replication has 


been inactivated by a mutation in a 
cell. In the absence of this protein, 
the cell attempts to replicate its 
DNA. What would happen during 


the DNA replication process if 


each of the following proteins were 


missing? 
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Figure 6-22 Without a special mechanism to 
replicate the ends of linear chromosomes, 
DNA would be lost during each round 

of cell division. DNA synthesis begins at 
origins of replication and continues until 

the replication machinery reaches the ends 
of the chromosome. The leading strand is 
synthesized in its entirety. But the ends of the 
lagging strand can’t be completed, because 
once the final RNA primer has been removed, 
there is no mechanism for replacing it with 
DNA. Complete replication of the lagging 
strand requires a special mechanism to keep 
the chromosome ends from shrinking with 
each cell division. 


Figure 6-23 Telomeres and telomerase 
prevent linear eukaryotic chromosomes 
from shortening with each cell division. 
To complete the replication of the lagging 
strand at the ends of a chromosome, the 
template strand (orange) is first extended 
beyond the DNA that is to be copied. To 
achieve this, the enzyme telomerase adds to 
the telomere repeat sequences at the 3’ end 
of the template strand, which then allows 
the newly synthesized lagging strand (red) 
to be lengthened by DNA polymerase, as 
shown. The telomerase enzyme itself carries 
a short piece of RNA (blue) with a sequence 
that is complementary to the DNA repeat 
sequence; this RNA acts as the template for 
telomere DNA synthesis. After the lagging- 
strand replication is complete, a short 
stretch of single-stranded DNA remains at 
the ends of the chromosome; however, the 
newly synthesized lagging strand, at this 
point, contains all the information present 
in the original DNA. To see telomerase in 
action, view Movie 6.6. 
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chromosomes and the double-strand DNA breaks that sometimes occur 
accidentally in the middle of chromosomes. These breaks are dangerous 
and must be immediately repaired, as we will see shortly. 


Telomere Length Varies by Cell Type and with Age 


In addition to attracting telomerase, the repetitive DNA sequences found 
within telomeres attract other telomere-binding proteins that not only 
physically protect chromosome ends, but help maintain telomere length. 


Cells that divide at a rapid rate throughout the life of the organism— 
those that line the gut or generate blood cells in the bone marrow, for 
example—keep their telomerase fully active. Many other cell types, how- 
ever, gradually turn down their telomerase activity. After many rounds 
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of cell division, the telomeres in these descendent cells will shrink, until 
they essentially disappear. At this point, these cells will cease divid- 
ing. In theory, such a mechanism could provide a safeguard against the 
uncontrolled proliferation of cells—including abnormal cells that have 
accumulated mutations that could promote the development of cancer. 


DNA REPAIR 


The diversity of living organisms and their success in colonizing almost 
every part of the Earth’s surface depend on genetic changes accumulated 
gradually over billions of years. A small subset of these changes will be 
beneficial, allowing the affected organisms to adapt to changing condi- 
tions and to thrive in new habitats. However, most of these changes will 
be of little consequence or even deleterious. 


In the short term, and from the perspective of an individual organism, 
such genetic alterations—called mutations—are kept to a minimum: to 
survive and reproduce, individuals must be genetically stable. This stabil- 
ity is achieved not only through the extremely accurate mechanism for 
replicating DNA that we have just discussed, but also through the work of 
a variety of protein machines that continually scan the genome for DNA 
damage and fix it when it occurs. Although some changes arise from rare 
mistakes in the replication process, the majority of DNA damage is an 
unintended consequence of the vast number of chemical reactions that 
occur inside cells. 


Most DNA damage is only temporary, because it is immediately cor- 
rected by processes collectively called DNA repair. The importance of 
these DNA repair processes is evident from the consequences of their 
malfunction. Humans with the genetic disease xeroderma pigmentosum, 
for example, cannot mend the damage done by ultraviolet (UV) radia- 
tion because they have inherited a defective gene for one of the proteins 
involved in this repair process. Such individuals develop severe skin 
lesions, including skin cancer, because of the DNA damage that accu- 
mulates in cells exposed to sunlight and the consequent mutations that 
arise in these cells. 


In this section, we describe a few of the specialized mechanisms cells 
use to repair DNA damage. We then consider examples of what happens 
when these mechanisms fail—and we discuss how the evolutionary his- 
tory of DNA replication and repair is reflected in our genome. 


DNA Damage Occurs Continually in Cells 


Just like any other molecule in the cell, DNA is continually undergoing 
thermal collisions with other molecules, often resulting in major chemi- 
cal changes in the DNA. For example, in the time it takes to read this 
sentence, a total of about a trillion (10!7) purine bases (A and G) will 
be lost from DNA in the cells of your body by a spontaneous reaction 
called depurination (Figure 6-24A). Depurination does not break the 
DNA phosphodiester backbone but instead removes a purine base from a 
nucleotide, giving rise to lesions that resemble missing teeth (see Figure 
6-26B). Another common reaction is the spontaneous loss of an amino 
group (deamination) from a cytosine in DNA to produce the base uracil 
(Figure 6-24B). 


The ultraviolet radiation in sunlight is also damaging to DNA; it promotes 
covalent linkage between two adjacent pyrimidine bases, forming, for 
example, the thymine dimer shown in Figure 6-25. It is the failure to 
repair thymine dimers that spells trouble for individuals with the disease 
xeroderma pigmentosum. 
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QUESTION 6-4 


Discuss the following statement: 
“The DNA repair enzymes that 
fix deamination and depurination 
damage must preferentially 
recognize such damage on newly 
synthesized DNA strands.” 
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Figure 6-24 Depurination and 
deamination are the most frequent 
chemical reactions known to create 
serious DNA damage in cells. 

(A) Depurination can remove guanine 

(or adenine) from DNA. (B) The major 
type of deamination reaction converts 
cytosine to uracil, which, as we have seen, 
is not normally found in DNA. However, 
deamination can occur on other bases as 
well. Both depurination and deamination 
take place on double-helical DNA, 

and neither break the phosphodiester 
backbone. 
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These are only a few of many chemical changes that can occur in our 
DNA. Others are caused by reactive chemicals produced as a normal part 
of cell metabolism. If left unrepaired, DNA damage leads either to the 
substitution of one nucleotide pair for another as a result of incorrect 
base-pairing during replication (Figure 6-26A) or to deletion of one or 
more nucleotide pairs in the daughter DNA strand after DNA replication 
(Figure 6-26B). Some types of DNA damage (thymine dimers, for exam- 
ple) can stall the DNA replication machinery at the site of the damage. 


In addition to this chemical damage, DNA can also be altered by repli- 
cation itself. The replication machinery that copies the DNA can—albeit 
rarely—incorporate an incorrect nucleotide that it fails to correct via 
proofreading (see Figure 6-14). 


For each of these forms of DNA damage, cells possess a mechanism for 
repair, as we discuss next. 
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Figure 6-25 The ultraviolet radiation in sunlight can cause the formation of thymine 
dimers. Two adjacent thymine bases have become covalently attached to each other to 
form a thymine dimer. Skin cells that are exposed to sunlight are especially susceptible to 
this tyoe of DNA damage. 
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Figure 6-26 Chemical modifications of nucleotides, if left unrepaired, produce mutations. (A) Deamination of 
cytosine, if uncorrected, results in the substitution of one base for another when the DNA is replicated. As shown 

in Figure 6-24B, deamination of cytosine produces uracil. Uracil differs from cytosine in its base-pairing properties 
and preferentially base-pairs with adenine. The DNA replication machinery therefore inserts an adenine when it 
encounters a uracil on the template strand. (B) Depurination, if uncorrected, can lead to the loss of a nucleotide pair. 
When the replication machinery encounters a missing purine on the template strand, it can skip to the next complete 
nucleotide, as shown, thus producing a daughter DNA molecule that is missing one nucleotide pair. In other cases, 
the replication machinery places an incorrect nucleotide across from the missing base, again resulting in a mutation 
(not shown). 


Cells Possess a Variety of Mechanisms for Repairing 
DNA 


The thousands of random chemical changes that occur every day in the 
DNA of a human cell—through thermal collisions or exposure to reac- 
tive metabolic by-products, DNA-damaging chemicals, or radiation—are 
repaired by a variety of mechanisms, each catalyzed by a different set of 
enzymes. Nearly all these repair mechanisms depend on the double-helical 
structure of DNA, which provides two copies of the genetic information— 
one in each strand of the double helix. Thus, if the sequence in one strand 
is accidentally damaged, information is not lost irretrievably, because 
a backup version of the altered strand remains in the complementary 
sequence of nucleotides in the other, undamaged strand. Most DNA dam- 
age creates structures that are never encountered in an undamaged DNA 
strand; thus the good strand is easily distinguished from the bad. 


The basic pathway for repairing damage to DNA, illustrated schemati- 
cally in Figure 6-27, involves three basic steps: 


1. The damaged DNA is recognized and removed by one of a variety of 
mechanisms. These involve nucleases, which cleave the covalent 
bonds that join the damaged nucleotides to the rest of the DNA 
strand, leaving a small gap on one strand of the DNA double helix. 

2. A repair DNA polymerase binds to the 3’-hydroxyl end of the 
cut DNA strand. The enzyme then fills in the gap by making a 
complementary copy of the information present in the undamaged 
strand. Although they differ from the DNA polymerase that 
replicates DNA, repair DNA polymerases synthesize DNA strands 
in the same way. For example, they elongate chains in the 5’-to-3' 
direction and have the same type of proofreading activity to ensure 
that the template strand is copied accurately. In many cells, the 
repair polymerase is the same enzyme that fills in the gaps left after 
the RNA primers are removed during the normal DNA replication 
process (see Figure 6-18). 
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Figure 6-27 The basic mechanism of 
DNA repair involves three steps. In step 
1 (excision), the damage is cut out by one 
of a series of nucleases, each specialized 
for a certain type of DNA damage. In 

step 2 (resynthesis), the original DNA 
sequence is restored by a repair DNA 
polymerase, which fills in the gap created 
by the excision events. In step 3 (ligation), 
DNA ligase seals the nick left in the sugar- 
phosphate backbone of the repaired 
strand. Nick sealing, which requires energy 
from ATP hydrolysis, remakes the broken 
phosphodiester bond between the adjacent 
nucleotides (see Figure 6-19). 


3. When the repair DNA polymerase has filled in the gap, a break 
remains in the sugar—-phosphate backbone of the repaired strand. 
This nick in the helix is sealed by DNA ligase, the same enzyme 
that joins the Okazaki fragments during replication of the lagging 
DNA strand (see Figure 6-19). 


A DNA Mismatch Repair System Removes Replication 
Errors That Escape Proofreading 


Although the high fidelity and proofreading abilities of the cell’s replica- 
tion machinery generally prevent replication errors from occurring, rare 
mistakes do happen. Fortunately, the cell has a backup system—called 
mismatch repair—that is dedicated to correcting these errors. The rep- 
lication machine makes approximately one mistake per 10’ nucleotides 
synthesized; DNA mismatch repair corrects 99% of these replication 
errors, increasing the overall accuracy to one mistake in 10? nucleotides 
synthesized. This level of accuracy is much, much higher than that gener- 
ally encountered in our day-to-day lives (Table 6-2). 


Whenever the replication machinery makes a copying mistake, it leaves 
behind a mispaired nucleotide (commonly called a mismatch). If left 
uncorrected, the mismatch will result in a permanent mutation in the 
next round of DNA replication (Figure 6-28). In most cases, however, 
a complex of mismatch repair proteins will detect the DNA mismatch, 
remove a portion of the DNA strand containing the error, and then resyn- 
thesize the missing DNA. This repair mechanism restores the correct 
sequence (Figure 6-29). 


To be effective, the mismatch repair system must be able to recognize 
which of the DNA strands contains the error. Removing a segment from 
the strand that contains the correct sequence would only compound the 
mistake. The way the mismatch system solves this problem is by rec- 
ognizing and removing only the newly made DNA. In bacteria, newly 
synthesized DNA lacks a type of chemical modification (a methyl group 
added to certain adenines) that is present on the preexisting parent DNA. 
Newly synthesized DNA is unmethylated for a short time, during which 
the new and template strands can be easily distinguished. Other cells 
use different strategies for distinguishing their parent DNA from a newly 
replicated strand. 


In humans, mismatch repair plays an important role in preventing can- 
cer. An inherited predisposition to certain cancers (especially some 
types of colon cancer) is caused by mutations in genes that encode mis- 
match repair proteins. Human cells have two copies of these genes (one 
from each parent), and individuals who inherit one damaged mismatch 


TABLE 6-2 ERROR RATES 


A professional typist typing at 120 words 1 mistake per 250 characters 

per minute 

Airline luggage system 1 bag lost, damaged, or delayed per 
400 passengers 

Driving a car in the United States 1 death per 10* people per year 

DNA replication (without proofreading) 1 mistake per 10° nucleotides copied 

DNA replication (with proofreading; 1 mistake per 10’ nucleotides copied 


without mismatch repair) 


DNA replication (with mismatch repair) 1 mistake per 10? nucleotides copied 
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repair gene are unaffected until the undamaged copy of the same gene 
is randomly mutated in a somatic cell. This mutant cell—and all of its 
progeny—are then deficient in mismatch repair; they therefore accumu- 
late mutations more rapidly than do normal cells. Because cancers arise 
from cells that have accumulated multiple mutations, a cell deficient in 
mismatch repair has a greatly enhanced chance of becoming cancerous. 
Thus, inheriting a single damaged mismatch repair gene strongly predis- 
poses an individual to cancer. 


Double-Strand DNA Breaks Require a Different Strategy 
for Repair 


The repair mechanisms we have discussed thus far rely on the genetic 
redundancy built into every DNA double helix. If nucleotides on one 
strand are damaged, they can be repaired using the information present 
in the complementary strand. This feature makes the DNA double helix 
especially well-suited for stably carrying genetic information from one 
generation to the next. 


But what happens when both strands of the double helix are damaged 
at the same time? Mishaps at the replication fork, radiation, and various 
chemical assaults can all fracture DNA, creating a double-strand break. 
Such lesions are particularly dangerous, because they can lead to the 
fragmentation of chromosomes and the subsequent loss of genes. 
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Figure 6-28 Errors made during DNA 
replication must be corrected to avoid 
mutations. If uncorrected, a mismatch will 
lead to a permanent mutation in one of the 
two DNA molecules produced during the 
next round of DNA replication. 
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Figure 6-29 Mismatch repair eliminates replication errors and restores the original DNA sequence. When mistakes occur 
during DNA replication, the repair machinery must replace the incorrect nucleotide on the newly synthesized strand, using the original 
parent strand as its template. This mechanism eliminates the error, and allows the original sequence to be copied during subsequent 


rounds of replication. 
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Figure 6-30 Cells can repair (A) NONHOMOLOGOUS END JOINING (B) HOMOLOGOUS RECOMBINATION 
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the repaired DNA. (B) If a double- 

strand break occurs in one of two 

duplicated DNA double helices 

after DNA replication has occurred, 

but before the chromosome 

copies have been separated, the 

undamaged double helix can be emn 
readily used as a template to repair S 
the damaged double helix deletion of DNA sequence 
through homologous 
recombination. Although more 
complicated than nonhomologous 
end joining, this process 
accurately restores the original 
DNA sequence at the site of the 


break. The detailed mechanism is This type of damage is especially difficult to repair. Every chromosome 
presented in Figure eman: contains unique information; if a chromosome experiences a double- 
strand break, and the broken pieces become separated, the cell has no 
spare copy it can use to reconstruct the information that is now missing. 


DOUBLE-STRAND BREAK ACCURATELY 
REPAIRED USING UNDAMAGED DNA 
AS TEMPLATE 


END JOINING 
BY DNA LIGASE 


To handle this potentially disastrous type of DNA damage, cells have 
evolved two basic strategies. The first involves hurriedly sticking the bro- 
ken ends back together, before the DNA fragments drift apart and get 
lost. This repair mechanism, called nonhomologous end joining, occurs 
in many cell types and is carried out by a specialized group of enzymes 
that “clean” the broken ends and rejoin them by DNA ligation. This “quick 
and dirty” mechanism rapidly seals the break, but it comes with a price: 
in “cleaning” the break to make it ready for ligation, nucleotides are often 
lost at the site of repair (Figure 6-30A and Movie 6.7). If this imperfect 
repair disrupts the activity of a gene, the cell could suffer serious con- 
sequences. Thus, nonhomologous end joining can be a risky strategy 
for fixing broken chromosomes. Fortunately, cells have an alternative, 
error-free strategy for repairing double-strand breaks, called homologous 
recombination (Figure 6-30B), as we discuss next. 


Homologous Recombination Can Flawlessly Repair DNA 
Double-Strand Breaks 


The challenge in repairing a double-strand break, as mentioned pre- 
viously, is finding an intact template to guide the repair. However, if a 
double-strand break occurs in a double helix shortly after that stretch of 
DNA has been replicated, the undamaged copy can serve as a template 
to guide the repair of both broken strands of DNA. The information on the 
undamaged strands of the intact double helix can be used to repair the 
complementary strands in the broken DNA. Because the two DNA mole- 
cules are homologous—they have identical or nearly identical nucleotide 
sequences outside the broken region—this mechanism is known as 
homologous recombination. It results in a flawless repair of the double- 
strand break, with no loss of genetic information (see Figure 6-30B). 


Homologous recombination most often occurs shortly after a cell's 
DNA has been replicated before cell division, when the duplicated hel- 
ices are still physically close to each other (Figure 6-31A). To initiate 


the repair, a recombination-specific nuclease chews back the 5’ ends of 
the two broken strands at the break (Figure 6-31B). Then, with the help 
of specialized enzymes (called recA in bacteria and Rad52 in eukary- 
otes), one of the broken 3’ ends “invades” the unbroken homologous 
DNA duplex and searches for a complementary sequence through base- 
pairing (Figure 6-31C). Once an extensive, accurate match is made, the 
invading strand is elongated by a repair DNA polymerase, using the com- 
plementary undamaged strand as a template (Figure 6-31D). After the 
repair polymerase has passed the point where the break occurred, the 
newly elongated strand rejoins its original partner, forming base pairs 
that hold the two strands of the broken double helix together (Figure 
6-31E). Repair is then completed by additional DNA synthesis at the 3’ 
ends of both strands of the broken double helix (Figure 6-31F), followed 
by DNA ligation (Figure 6-31G). The net result is two intact DNA helices, 
for which the genetic information from one was used as a template to 
repair the other. 


Homologous recombination can also be used to repair many other types 
of DNA damage, making it perhaps the most handy DNA repair mech- 
anism available to the cell: all that is needed is an intact homologous 
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Figure 6-31 Homologous recombination 
flawlessly repairs DNA double-strand 
breaks. This is the preferred method for 
repairing double-strand breaks that arise 
shortly after the DNA has been replicated 
but before the cell has divided. See 

text for details. (Adapted from M. McVey 
etal., Proc. Natl. Acad. Sci. U.S.A. 101: 
15694-15699, 2004.) 
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single DNA strand of 
normal B-globin gene 


GTGCACCTGACTCCTGAGGAG-- 


GTGCACCTGACTCCTGTGGAG- 


single DNA strand of 
mutant B-globin gene 


chromosome to use as a partner—a situation that occurs transiently each 
time a chromosome is duplicated. The “all-purpose” nature of homolo- 
gous recombinational repair probably explains why this mechanism, and 
the proteins that carry it out, have been conserved in virtually all cells on 
Earth. 


Homologous recombination is versatile, and it also has a crucial role in 
the exchange of genetic information that occurs during the formation 
of the gametes—sperm and eggs. This exchange, during the specialized 
form of cell division called meiosis, enhances the generation of genetic 
diversity within a species during sexual reproduction. We will discuss it 
when we talk about sex in Chapter 19. 


Failure to Repair DNA Damage Can Have Severe 
Consequences for a Cell or Organism 


On occasion, the cell’s DNA replication and repair processes fail and 
allow a mutation to arise. This permanent change in the DNA sequence 
can have profound consequences. If the change occurs in a particular 
position in the DNA sequence, it could alter the amino acid sequence 
of a protein in a way that reduces or eliminates that protein’s ability to 
function. For example, mutation of a single nucleotide in the human 
hemoglobin gene can cause the disease sickle-cell anemia. The hemo- 
globin protein is used to transport oxygen in the blood (see Figure 4-24). 
Mutations in the hemoglobin gene can produce a protein that is less solu- 
ble than normal hemoglobin and forms fibrous intracellular precipitates, 
which produce the characteristic sickle shape of affected red blood cells 
(Figure 6-32). Because these cells are more fragile and frequently tear 
as they travel through the bloodstream, patients with this potentially 
life-threatening disease have fewer red blood cells than usual—that is, 
they are anemic. Moreover, the abnormal red blood cells that remain 
can aggregate and block small vessels, causing pain and organ failure. 
We know about sickle-cell hemoglobin because individuals with the 
mutation survive; the mutation even provides a benefit—an increased 
resistance to malaria, as we discuss in Chapter 19. 


The example of sickle-cell anemia, which is an inherited disease, illus- 
trates the consequences of mutations arising in the reproductive germ-line 
cells. A mutation in a germ-line cell will be passed on to all the cells in 
the body of the multicellular organism that develop from it, including the 
gametes responsible for the production of the next generation. 


Figure 6-32 A single nucleotide change causes the disease sickle- 
cell anemia. (A) B-globin is one of the two types of protein subunits 
that form hemoglobin (see Figure 4-24). A single mutation in the 
B-globin gene produces a B-globin subunit that differs from normal 
B-globin by a change from glutamic acid to valine at the sixth amino 
acid position. (Only a portion of the gene is shown here; the B-globin 
subunit contains a total of 146 amino acids. The complete sequence of 
the B-globin gene is shown in Figure 5-11.) Humans carry two copies 
of each gene (one inherited from each parent); a sickle-cell mutation 
in one of the two B-globin genes generally causes no harm to the 
individual, as it is compensated for by the normal gene. However, an 
individual who inherits two copies of the mutant B-globin gene will 
have sickle-cell anemia. (B and C) Normal red blood cells are shown 

in (B), and those from an individual suffering from sickle-cell anemia 

in (C). Although sickle-cell anemia can be a life-threatening disease, 
the responsible mutation can also be beneficial. People with the 
disease, or those who carry one normal gene and one sickle-cell gene, 
are more resistant to malaria than unaffected individuals, because 

the parasite that causes malaria grows poorly in red blood cells that 
contain the sickle-cell form of hemoglobin. 


The many other cells in a multicellular organism (its somatic cells) must 
also be protected against mutation—in this case, against mutations that 
arise during the life of the individual. Nucleotide changes that occur in 
somatic cells can give rise to variant cells, some of which grow and divide 
in an uncontrolled fashion at the expense of the other cells in the organ- 
ism. In the extreme case, an unchecked cell proliferation known as cancer 
results. Cancers are responsible for about 30% of the deaths that occur in 
Europe and North America, and they are caused primarily by a gradual 
accumulation of random mutations in a somatic cell and its descendants 
(Figure 6-33). Increasing the mutation frequency even two- or threefold 
could cause a disastrous increase in the incidence of cancer by accelerat- 
ing the rate at which such somatic cell variants arise. 


Thus, the high fidelity with which DNA sequences are replicated and 
maintained is important both for germ-line cells, which transmit the 
genes to the next generation, and for somatic cells, which normally func- 
tion as carefully regulated members of the complex community of cells 
in a multicellular organism. We should therefore not be surprised to find 
that all cells possess a very sophisticated set of mechanisms to reduce 
the number of mutations that occur in their DNA, devoting hundreds of 
genes to these repair processes. 


A Record of the Fidelity of DNA Replication and Repair Is 
Preserved in Genome Sequences 


Although the majority of mutations do neither harm nor good to an 
organism, those that have severely harmful consequences are usually 
eliminated through natural selection; individuals carrying the altered DNA 
may die or experience decreased fertility, in which case these changes 
will be gradually lost from the population. By contrast, favorable changes 
will tend to persist and spread. 


But even where no selection operates—at the many sites in the DNA 
where a change of nucleotide has no effect on the fitness of the organ- 
ism—the genetic message has been faithfully preserved over tens of 
millions of years. Thus humans and chimpanzees, after about 5 million 
years of divergent evolution, still have DNA sequences that are at least 
98% identical. Even humans and whales, after 10 or 20 times this amount 
of time, have chromosomes that are unmistakably similar in their DNA 
sequence (Figure 6-34). Thus our genome—and those of our relatives— 
contains a message from the distant past. Thanks to the faithfulness of 
DNA replication and repair, 100 million years of evolution have scarcely 
changed its essential content. 


anen 
= 


= 
a 


whale GIGTGGTCTCGTGATCAAAGGCGAAAGGTGGCTCTAGAGAATCCC 
human GTGTGGTCTCGCGATCAGAGGCGCAAGATGGCTCTAGAGAATCCC 
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Figure 6-33 Cancer incidence increases 
dramatically with age. The number of 
newly diagnosed cases of colon cancer in 
women in England and Wales in a single 
year is plotted as a function of age at 
diagnosis. Colon cancer, like most human 
cancers, is caused by the accumulation 

of multiple mutations. Because cells 

are continually experiencing accidental 
changes to their DNA—which accumulate 
and are passed on to progeny cells when 
the mutated cells divide—the chance that 
a cell will become cancerous increases 
greatly with age. (Data from C. Muir et al., 
Cancer Incidence in Five Continents, Vol. V. 
Lyon: International Agency for Research on 
Cancer, 1987.) 


Figure 6-34 The sex-determination genes 
from humans and whales are noticeably 
similar. Despite the many millions of years 
that have passed since humans and whales 
diverged from a common ancestor, the 
nucleotide sequences of many of their 
genes remain closely related. The DNA 
sequences of a part of the gene that 
determines maleness in both humans and 
whales are lined up, one above the other; 
the positions where the two sequences are 
identical are shaded in gray. 
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ESSENTIAL CONCEPTS 


Before a cell divides, it must accurately replicate the vast quantity of 
genetic information carried in its DNA. 


Because the two strands of a DNA double helix are complementary, 
each strand can act as a template for the synthesis of the other. Thus 
DNA replication produces two identical, double-helical DNA mol- 
ecules, enabling genetic information to be copied and passed on from 
a cell to its daughter cells and from a parent to its offspring. 


During replication, the two strands of a DNA double helix are pulled 
apart at a replication origin to form two Y-shaped replication forks. 
DNA polymerases at each fork produce a new, complementary DNA 
strand on each parental strand. 


DNA polymerase replicates a DNA template with remarkable fidel- 
ity, making only about one error in every 10’ nucleotides copied. 
This accuracy is made possible, in part, by a proofreading process in 
which the enzyme corrects its own mistakes as it moves along the 
DNA. 


Because DNA polymerase synthesizes new DNA in the 5'-to-3' direc- 
tion, only the leading strand at the replication fork can be synthesized 
in a continuous fashion. On the lagging strand, DNA is synthesized in 
a discontinuous backstitching process, producing short fragments of 
DNA that are later joined together by DNA ligase. 


DNA polymerase is incapable of starting a new DNA strand from 
scratch. Instead, DNA synthesis is primed by an RNA polymerase 
called primase, which makes short lengths of RNA primers that are 
then elongated by DNA polymerase. These primers are subsequently 
removed and replaced with DNA. 


DNA replication requires the cooperation of many proteins that form 
a multienzyme replication machine that pries open the double helix 
and copies the information contained in both DNA strands. 


In eukaryotes, a special enzyme called telomerase replicates the DNA 
at the ends of the chromosomes, particularly in rapidly dividing cells. 


The rare copying mistakes that escape proofreading are dealt with by 
mismatch repair proteins, which increase the accuracy of DNA repli- 
cation to one mistake per 10? nucleotides copied. 


Damage to one of the two DNA strands, caused by unavoidable 
chemical reactions, is repaired by a variety of DNA repair enzymes 
that recognize damaged DNA and excise a short stretch of the dam- 
aged strand. The missing DNA is then resynthesized by a repair DNA 
polymerase, using the undamaged strand as a template. 


If both DNA strands are broken, the double-strand break can be rap- 
idly repaired by nonhomologous end joining. Nucleotides are often 
lost in the process, altering the DNA sequence at the repair site. 


Homologous recombination can flawlessly repair double-strand 
breaks (and many other types of DNA damage) using an undamaged 
homologous double helix as a template. 


Highly accurate DNA replication and DNA repair processes play a key 
role in protecting us from the uncontrolled growth of somatic cells 
known as cancer. 
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DNA replication 
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replication fork 


replication origin 


lagging strand RNA (ribonucleic acid) 
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QUESTIONS 


QUESTION 6-5 


DNA mismatch repair enzymes preferentially repair bases 
on the newly synthesized DNA strand, using the old DNA 
strand as a template. If mismatches were simply repaired 
without regard for which strand served as template, would 
this reduce replication errors as effectively? Explain your 
answer. 


QUESTION 6-6 


Suppose a mutation affects an enzyme that is required to 
repair the damage to DNA caused by the loss of purine 
bases. The loss of a purine occurs about 5000 times in 

the DNA of each of your cells per day. As the average 
difference in DNA sequence between humans and 
chimpanzees is about 1%, how long will it take you to turn 
into an ape? Or would this transformation be unlikely to 
occur? 


QUESTION 6-7 


Which of the following statements are correct? Explain your 
answers. 


A. A bacterial replication fork is asymmetrical because 
it contains two DNA polymerase molecules that are 
structurally distinct. 


B. Okazaki fragments are removed by a nuclease that 
degrades RNA. 


C. The error rate of DNA replication is reduced both by 
proofreading by DNA polymerase and by DNA mismatch 
repair. 


D. In the absence of DNA repair, genes become less stable. 


E. None of the aberrant bases formed by deamination 
occur naturally in DNA. 


F. Cancer can result from the accumulation of mutations in 
somatic cells. 


QUESTION 6-8 


The speed of DNA replication at a replication fork is about 
100 nucleotides per second in human cells. What is the 

minimum number of origins of replication that a human cell 
must have if it is to replicate its DNA once every 24 hours? 


Recall that a human cell contains two copies of the human 
genome—one inherited from the mother, the other from the 
father—each consisting of 3 x 10? nucleotide pairs. 


QUESTION 6-9 


Look carefully at Figure 6—11 and at the structures of the 
compounds shown in Figure Q6-9. 


A. What would you expect if ddCTP were added to a DNA 
replication reaction in large excess over the concentration of 
the available dCTP, the normal deoxycytidine triphosphate? 


deoxycytidine 
triphosphate (dCTP) OH 


®-8-®—0—cH, 


dideoxycytidine 
triphosphate (ddCTP) H H 


—CH, 


dideoxycytidine 
monophosphate (ddCMP) 


Figure Q6-9 H H 
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B. What would happen if it were added at 10% of the 
concentration of the available dCTP? 


C. What effects would you expect if ddCMP were added 
under the same conditions? 


QUESTION 6-10 


Figure Q6—10 shows a snapshot of a replication fork in 
which the RNA primer has just been added to the lagging 
strand. Using this diagram as a guide, sketch the path of the 
DNA as the next Okazaki fragment is synthesized. Indicate 
the sliding clamp and the single-strand DNA-binding protein 
as appropriate. 


Saa, 


next primer 


A 


l YY. Figure Q6-10 


QUESTION 6-11 


Approximately how many high-energy bonds does DNA 
polymerase use to replicate a bacterial chromosome 
(ignoring helicase and other enzymes associated with the 
replication fork)? Compared with its own dry weight of 
10-12 g, how much glucose does a single bacterium need to 
provide enough energy to copy its DNA once? The number 
of nucleotide pairs in the bacterial chromosome is 3 x 10°. 
Oxidation of one glucose molecule yields about 30 high- 
energy phosphate bonds. The molecular weight of glucose 
is 180 g/mole. (Recall from Figure 2-3 that a mole consists 
of 6 x 1023 molecules.) 


QUESTION 6-12 


What, if anything, is wrong with the following statement: 
“DNA stability in both reproductive cells and somatic cells is 
essential for the survival of a species.” Explain your answer. 


Figure Q6-13 
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QUESTION 6-13 


A common type of chemical damage to DNA is produced 

by a spontaneous reaction termed deamination, in which 

a nucleotide base loses an amino group (NH2). The amino 
group is replaced with a keto group (C=O) by the general 
reaction shown in Figure Q6—13. Write the structures of the 
bases A, G, C, T, and U and predict the products that will 

be produced by deamination. By looking at the products of 
this reaction—and remembering that, in the cell, these will 
need to be recognized and repaired—can you propose an 
explanation for why DNA does not contain uracil? 


QUESTION 6-14 


A. Explain why telomeres and telomerase are needed 
for replication of eukaryotic chromosomes but not for 
replication of circular bacterial chromosomes. Draw a 
diagram to illustrate your explanation. 


B. Would you still need telomeres and telomerase to 
complete eukaryotic chromosome replication if primase 
always laid down the RNA primer at the very 3' end of the 
template for the lagging strand? 

QUESTION 6-15 


Describe the consequences that would arise if a eukaryotic 
chromosome: 


A. contained only one origin of replication: 
(i) at the exact center of the chromosome. 
(ii) at one end of the chromosome. 

B. lacked telomeres. 

C. lacked a centromere. 


Assume that the chromosome is 150 million nucleotide pairs 
in length, a typical size for an animal chromosome, and that 
DNA replication in animal cells proceeds at about 

100 nucleotides per second. 


From DNA to Protein: 
How Cells Read the Genome 


Once the double-helical structure of DNA (deoxyribonucleic acid) had 
been determined in the early 1950s, it became clear that the hereditary 
information in cells is encoded in the linear order—or sequence—of the 
four different nucleotide subunits that make up the DNA. We saw in 
Chapter 6 how this information can be passed on unchanged from a cell 
to its descendants through the process of DNA replication. But how does 
the cell decode and use the information? How do genetic instructions 
written in an alphabet of just four “letters” direct the formation of a bac- 
terium, a fruit fly, or a human? We still have a lot to learn about how the 
information stored in an organism's genes produces even the simplest 
unicellular bacterium, let alone how it directs the development of com- 
plex multicellular organisms like ourselves. But the DNA code itself has 
been deciphered, and we have come a long way in understanding how 
cells read it. 


Even before the code was broken, it was known that the information 
contained in genes somehow directed the synthesis of proteins. Proteins 
are the principal constituents of cells and determine not only cell struc- 
ture but also cell function. In previous chapters, we encountered some 
of the thousands of different kinds of proteins that cells can make. We 
saw in Chapter 4 that the properties and function of a protein molecule 
are determined by the sequence of the 20 different amino acid subunits 
in its polypeptide chain: each type of protein has its own unique amino 
acid sequence, which dictates how the chain will fold to form a molecule 
with a distinctive shape and chemistry. The genetic instructions carried 
by DNA must therefore specify the amino acid sequences of proteins. We 
will see in this chapter exactly how this happens. 


FROM DNA TO RNA 


FROM RNA TO PROTEIN 


RNA AND THE ORIGINS OF LIFE 
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Figure 7-1 Genetic information directs 
the synthesis of proteins. The flow of 
genetic information from DNA to RNA 
(transcription) and from RNA to protein 
(translation) occurs in all living cells. DNA 
can also be copied—or replicated—to 
produce new DNA molecules, as we saw 
in Chapter 6. The segments of DNA that 
are transcribed into RNA are called genes 
(orange). 


QUESTION 7-1 


Consider the expression “central 
dogma,” which refers to the flow 
of genetic information from DNA 
to RNA to protein. Is the word 
“dogma” appropriate in this 
context? 


Figure 7-2 A cell can express different 
genes at different rates. In this and later 
figures, the portions of the DNA that are not 
transcribed are shown in gray. 


DNA does not synthesize proteins on its own: it acts more like a man- 
ager, delegating the various tasks to a team of workers. When a particular 
protein is needed by the cell, the nucleotide sequence of the appropriate 
segment of a DNA molecule is first copied into another type of nucleic 
acid—RNA (ribonucleic acid). That segment of DNA is called a gene, and 
the resulting RNA copies are then used to direct the synthesis of the pro- 
tein. Many thousands of these conversions from DNA to protein occur 
every second in each cell in our body. The flow of genetic information 
in cells is therefore from DNA to RNA to protein (Figure 7—1). All cells, 
from bacteria to those in humans, express their genetic information in 
this way—a principle so fundamental that it has been termed the central 
dogma of molecular biology. 


In this chapter, we explain the mechanisms by which cells copy DNA 
into RNA (a process called transcription) and then use the information 
in RNA to make protein (a process called translation). We also discuss 
a few of the key variations on this basic scheme. Principal among these 
is RNA splicing, a process in eukaryotic cells in which segments of an 
RNA transcript are removed—and the remaining segments stitched back 
together—before the RNA is translated into protein. We will also learn 
that, for some genes, it is the RNA, not a protein, that is the final product. 
In the final section, we consider how the present scheme of information 
storage, transcription, and translation might have arisen from much sim- 
pler systems in the earliest stages of cell evolution. 


FROM DNA TO RNA 


The first step in gene expression, the process by which cells read out the 
instructions in their genes, is transcription. Many identical RNA copies can 
be made from the same gene. For most genes, RNA serves solely as an 
intermediary on the pathway to making a protein. For these genes, each 
RNA molecule can direct the synthesis, or translation, of many identical 
protein molecules. This successive amplification enables cells to rapidly 
synthesize large amounts of protein whenever necessary. At the same 
time, each gene can be transcribed, and its RNA translated, at different 
rates, providing the cell with a way to make vast quantities of some pro- 
teins and tiny quantities of others (Figure 7-2). Moreover, as we discuss 
in Chapter 8, a cell can change (or regulate) the expression of each of its 
genes according to the needs of the moment. In this section, we focus on 
the production of RNA. We describe how the transcriptional machinery 
recognizes genes and copies the instructions they contain into molecules 
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Figure 7-3 The chemical structure of RNA differs slightly from 

that of DNA. (A) RNA contains the sugar ribose, which differs from 
deoxyribose, the sugar used in DNA, by the presence of an additional 
-OH group. (B) RNA contains the base uracil, which differs from 
thymine, the equivalent base in DNA, by the absence of a -CH3 group. 
(C) A short length of RNA. The chemical linkage between nucleotides 
in RNA—a phosphodiester bond—is the same as that in DNA. 


of RNA. We then discuss how these RNAs are processed, the variety of 
roles they play in the cell, and, ultimately, how they are removed from 
circulation. 


Portions of DNA Sequence Are Transcribed into RNA 


The first step a cell takes in expressing one of its many thousands of 
genes is to copy the nucleotide sequence of that gene into RNA. The pro- 
cess is called transcription because the information, though copied into 
another chemical form, is still written in essentially the same language— 
the language of nucleotides. Like DNA, RNA is a linear polymer made 
of four different nucleotide subunits, linked together by phosphodiester 
bonds. It differs from DNA chemically in two respects: (1) the nucleotides 
in RNA are ribonucleotides—that is, they contain the sugar ribose (hence 
the name ribonucleic acid) rather than the deoxyribose found in DNA; 
and (2) although, like DNA, RNA contains the bases adenine (A), gua- 
nine (G), and cytosine (C), it contains uracil (U) instead of the thymine (T) 
found in DNA (Figure 7-3). Because U, like T, can base-pair by hydrogen- 
bonding with A (Figure 7-4), the complementary base-pairing properties 
described for DNA in Chapter 5 apply also to RNA. 


Although their chemical differences are small, DNA and RNA differ quite 
dramatically in overall structure. Whereas DNA always occurs in cells 
as a double-stranded helix, RNA is largely single-stranded. This differ- 
ence has important functional consequences. Because an RNA chain is 
single-stranded, it can fold up into a variety of shapes, just as a poly- 
peptide chain folds up to form the final shape of a protein (Figure 7-5); 


Figure 7-4 Uracil forms a base pair with adenine. The hydrogen 
bonds that hold the base pair together are shown in red. Uracil has the 
same base-pairing properties as thymine. Thus U-A base pairs in RNA 
closely resemble T-A base pairs in DNA (see Figure 5—4A). 
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Figure 7-5 RNA molecules can fold into specific structures that are held together by hydrogen bonds between 
different base pairs. RNA is largely single-stranded, but it often contains short stretches of nucleotides that can 
base-pair with complementary sequences found elsewhere on the same molecule. These interactions—along with 
some “nonconventional” base-pair interactions (e.g., A-G)—allow an RNA molecule to fold into a three-dimensional 
structure that is determined by its sequence of nucleotides. (A) A diagram of a hypothetical, folded RNA structure 
showing only conventional (G-C and A-U) base-pair interactions (red). (B) Formation of nonconventional base-pair 
interactions (green) folds the structure of the hypothetical RNA shown in (A) even further. (C) Structure of an actual 
RNA molecule that is involved in RNA splicing. The considerable amount of double-helical structure displayed by 
this RNA is produced by conventional base pairing. For an additional view of RNA structure, see Movie 7.1. 
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Figure 7-6 Transcription of a gene 
produces an RNA complementary to one 
strand of DNA. The bottom strand of DNA 
in this example is called the template strand 
because it is used to guide the synthesis 

of the RNA molecule. The nontemplate 
strand of the gene (here, shown at the 

top) is sometimes called the coding strand 
because its sequence is equivalent to the 
RNA product, as shown. Which DNA strand 
serves as the template varies, depending on 
the gene, as we discuss later. By convention, 
an RNA molecule is usually depicted with its 
5’ end—the first part to be synthesized—to 
the left. 


double-stranded DNA cannot fold in this fashion. As we discuss later in 
the chapter, the ability to fold into a complex three-dimensional shape 
allows RNA to carry out various functions in cells, in addition to con- 
veying information between DNA and protein. Whereas DNA functions 
solely as an information store, some RNAs have structural, regulatory, or 
catalytic roles. 


Transcription Produces RNA That Is Complementary to 
One Strand of DNA 


All the RNA in a cell is made by transcription, a process that has certain 
similarities to DNA replication (discussed in Chapter 6). Transcription 
begins with the opening of a small portion of the DNA double helix to 
expose the bases on each DNA strand. One of the two strands of the 
DNA double helix then serves as a template for the synthesis of RNA. 
Ribonucleotides are added, one by one, to the growing RNA chain; as in 
DNA replication, the nucleotide sequence of the RNA chain is determined 
by complementary base-pairing with the DNA template strand. When a 
good match is made, the incoming ribonucleoside triphosphate is cova- 
lently linked to the growing RNA chain by the enzyme RNA polymerase. 
The RNA chain produced by transcription—the RNA transcript—there- 
fore has a nucleotide sequence exactly complementary to the strand of 
DNA used as the template (Figure 7-6). 


Transcription differs from DNA replication, however, in several crucial 
respects. Unlike a newly formed DNA strand, the RNA strand does not 
remain hydrogen-bonded to the DNA template strand. Instead, just behind 
the region where the ribonucleotides are being added, the RNA chain 
is displaced and the DNA helix re-forms. For this reason—and because 
only one strand of the DNA molecule is transcribed—RNA molecules are 
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single-stranded. Furthermore, a given RNA molecule is copied from only 
a limited region of DNA, making it much shorter than the DNA molecule 
from which it is made. A DNA molecule in a human chromosome can 
be up to 250 million nucleotide pairs long, whereas most mature RNAs 
are no more than a few thousand nucleotides long, and many are much 
shorter than that. 


Like the DNA polymerase that carries out DNA replication (discussed in 
Chapter 6), RNA polymerases catalyze the formation of the phosphodies- 
ter bonds that link the nucleotides together and form the sugar-phosphate 
backbone of the RNA chain (see Figure 7-3). The RNA polymerase moves 
stepwise along the DNA, unwinding the DNA helix just ahead to expose 
a new region of the template strand for complementary base-pairing. 
In this way, the growing RNA chain is elongated by one nucleotide at 
a time in the 5'-to-3' direction (Figure 7-7). The incoming ribonucleo- 
side triphosphates (ATP, CTP, UTP, and GTP) provide the energy needed 
to drive the reaction forward, analogous to the process of DNA synthesis 
(see Figure 6-11). 


The almost immediate release of the RNA strand from the DNA as it is syn- 
thesized means that many RNA copies can be made from the same gene 
in a relatively short time; the synthesis of the next RNA is usually started 
before the first RNA has been completed (Figure 7-8). A medium-sized 
gene—say, 1500 nucleotide pairs—requires approximately 50 seconds for 
a molecule of RNA polymerase to transcribe it (Movie 7.2). At any given 
time, there could be dozens of polymerases speeding along this single 
stretch of DNA, hard on one another's heels, allowing more than 1000 
transcripts to be synthesized in an hour. For most genes, however, the 
amount of transcription is much less than this. 
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Figure 7-7 DNA is transcribed into 

RNA by the enzyme RNA polymerase. 
(A) RNA polymerase (pale blue) moves 
stepwise along the DNA, unwinding the 
DNA helix in front of it. As it progresses, the 
polymerase adds ribonucleotides one-by- 
one to the RNA chain, using an exposed 
DNA strand as a template. The resulting 
RNA transcript is thus single-stranded and 
complementary to the template strand 
(see Figure 7-6). As the polymerase moves 
along the DNA template, it displaces 

the newly formed RNA, allowing the two 
strands of DNA behind the polymerase 

to rewind. A short region of hybrid DNA/ 
RNA helix (approximately nine nucleotides 
in length) therefore forms only transiently, 
causing a “window” of DNA/RNA helix to 
move along the DNA with the polymerase. 
Note that although the primase discussed 
in Chapter 6 and RNA polymerase both 
synthesize RNA using a DNA template, 
they are different enzymes, encoded by 
different genes. 


QUESTION 7-2 


In the electron micrograph in Figure 
7-8, are the RNA polymerase 
molecules moving from right to left 
or from left to right? Why are the 
RNA transcripts so much shorter 
than the DNA segments (genes) that 
encode them? 


Figure 7-8 Many molecules of RNA 
polymerase can simultaneously transcribe 
the same gene. Shown in this electron 
micrograph are two adjacent ribosomal 
genes on a single DNA molecule. Molecules 
of RNA polymerase are barely visible as 

a series of tiny dots along the spine of 

the DNA molecule; each polymerase has 

an RNA transcript (a short, fine thread) 
radiating from it. The RNA molecules being 
transcribed from the two ribosomal genes— 
ribosomal RNAs (rRNAs)—are not translated 
into protein, but are instead used directly as 
components of ribosomes, macromolecular 
machines made of RNA and protein. The 
large particles that can be seen at the free, 
5’ end of each rRNA transcript are ribosomal 
proteins that have assembled on the ends 
of the growing transcripts. These proteins 
will be discussed later in the chapter. 
(Courtesy of Ulrich Scheer.) 
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From DNA to Protein: How Cells Read the Genome 


Although RNA polymerase catalyzes essentially the same chemical reac- 
tion as DNA polymerase, there are some important differences between 
the two enzymes. First, and most obviously, RNA polymerase uses ribo- 
nucleoside triphosphates as substrates, so it catalyzes the linkage of 
ribonucleotides, not deoxyribonucleotides. Second, unlike the DNA poly- 
merase involved in DNA replication, RNA polymerases can start an RNA 
chain without a primer and do not accurately proofread their work. This 
sloppiness is tolerated because RNA, unlike DNA, is not used as the per- 
manent storage form of genetic information in cells, so mistakes in RNA 
transcripts have relatively minor consequences for a cell. RNA polymer- 
ases make about one mistake for every 104 nucleotides copied into RNA, 
whereas DNA polymerase makes only one mistake for every 107 nucleo- 
tides copied. 


Cells Produce Various Types of RNA 


The majority of genes carried in a cell's DNA specify the amino acid 
sequences of proteins. The RNA molecules encoded by these genes— 
which ultimately direct the synthesis of proteins—are called messenger 
RNAs (mRNAs). In eukaryotes, each MRNA typically carries information 
transcribed from just one gene, which codes for a single protein; in bac- 
teria, a set of adjacent genes is often transcribed as a single MRNA, which 
therefore carries the information for several different proteins. 


The final product of other genes, however, is the RNA itself. As we see 
later, these noncoding RNAs, like proteins, have various roles, serving 
as regulatory, structural, and catalytic components of cells. They play 
key parts, for example, in translating the genetic message into protein: 
ribosomal RNAs (rRNAs) form the structural and catalytic core of the ribo- 
somes, which translate mRNAs into protein, and transfer RNAs (tRNAs) act 
as adaptors that select specific amino acids and hold them in place on a 
ribosome for their incorporation into protein. Other small RNAs, called 
microRNAs (miRNAs), serve as key regulators of eukaryotic gene expres- 
sion, as we discuss in Chapter 8. The most common types of RNA are 
summarized in Table 7-1. 


In the broadest sense, the term gene expression refers to the process 
by which the information encoded in a DNA sequence is converted into 
a product, whether RNA or protein, that has some effect on a cell or 
organism. In cases where the final product of the gene is a protein, gene 
expression includes both transcription and translation. When an RNA 
molecule is the gene's final product, however, gene expression does not 
require translation. 


TABLE 7-1 TYPES OF RNA PRODUCED IN CELLS 
Tiree [Rincon 


messenger RNAs (mRNAs) code for proteins 


ribosomal RNAs (rRNAs) form the core of the ribosome's structure and 
catalyze protein synthesis 


microRNAs (miRNAs) regulate gene expression 


transfer RNAs (tRNAs) serve as adaptors between mRNA and amino acids 
during protein synthesis 


Other noncoding RNAs used in RNA splicing, gene regulation, telomere 
maintenance, and many other processes 
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Signals in the DNA Tell RNA Polymerase Where to Start 
and Stop Transcription 


The initiation of transcription is an especially critical process because it is 
the main point at which the cell selects which RNAs are to be produced. 
To begin transcription, RNA polymerase must be able to recognize the 
start of a gene and bind firmly to the DNA at this site. The way in which 
RNA polymerases recognize the transcription start site of a gene differs 
somewhat between bacteria and eukaryotes. Because the situation in 
bacteria is simpler, we describe it first. 


When an RNA polymerase collides randomly with a DNA molecule, the 
enzyme sticks weakly to the double helix and then slides rapidly along its 
length. RNA polymerase latches on tightly only after it has encountered 
a gene region called a promoter, which contains a specific sequence of 
nucleotides that lies immediately upstream of the starting point for RNA 
synthesis. As it binds tightly to this sequence, the RNA polymerase opens 
up the double helix immediately in front of the promoter to expose the 
nucleotides on each strand of a short stretch of DNA. One of the two 
exposed DNA strands then acts as a template for complementary base- 
pairing with incoming ribonucleoside triphosphates, two of which are 
joined together by the polymerase to begin synthesis of the RNA strand. 
Elongation then continues until the enzyme encounters a second sig- 
nal in the DNA, the terminator (or stop site), where the polymerase halts 
and releases both the DNA template and the newly made RNA transcript 
(Figure 7-9). The terminator sequence itself is also transcribed, and it is 
the interaction of this 3’ segment of RNA with the polymerase that causes 
the enzyme to let go of the template DNA. 
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Figure 7-10 Bacterial promoters 
and terminators have specific 
nucleotide sequences that are 
recognized by RNA polymerase. 

(A) The green-shaded regions 
represent the nucleotide sequences 
that specify a promoter. The numbers 
above the DNA indicate the positions 
of nucleotides counting from the 

first nucleotide transcribed, which is 
designated +1. The polarity of the 
promoter orients the polymerase 

and determines which DNA strand is 
transcribed. All bacterial promoters 
contain DNA sequences at —10 and 
-35 that closely resemble those shown 
here. (B) The red-shaded regions 
represent sequences in the gene 

that signal the RNA polymerase to 
terminate transcription. Note that the 
regions transcribed into RNA contain 
the terminator but not the promoter 
nucleotide sequences. 


Figure 7-11 On an individual 
chromosome, some genes are transcribed 
using one DNA strand as a template, and 
others are transcribed from the other 
DNA strand. RNA polymerase always 
moves in the 3'-to-5' direction with respect 
to the template DNA strand. Which strand 
will serve as the template is determined 

by the polarity of the promoter sequences 
(green arrowheads) at the beginning of 
each gene. In this drawing, gene a, which 

is transcribed from left to right, uses the 
bottom DNA strand as its template (see 
Figure 7-10); gene b, which is transcribed 
from right to left, uses the top strand as its 
template. 
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Because the polymerase must bind tightly to DNA before transcription 
can begin, a segment of DNA will be transcribed only if it is preceded 
by a promoter. This ensures that only those portions of a DNA molecule 
that contain a gene will be transcribed into RNA. The nucleotide sequences 
of a typical promoter—and a typical terminator—are presented in 
Figure 7-10. 


In bacteria, it is a subunit of RNA polymerase, the sigma (o) factor (see 
Figure 7-9), that is primarily responsible for recognizing the promoter 
sequence on the DNA. But how can this factor “see” the promoter, given 
that the base pairs in question are situated in the interior of the DNA 
double helix? It turns out that each base presents unique features to the 
outside of the double helix, allowing the sigma factor to initially identify 
the promoter sequence without having to separate the entwined DNA 
strands. As it begins to open the DNA double helix, the sigma factor then 
binds to the exposed base pairs, keeping the double helix open. 


The next problem an RNA polymerase faces is determining which of 
the two DNA strands to use as a template for transcription: each strand 
has a different nucleotide sequence and would produce a different RNA 
transcript. The secret lies in the structure of the promoter itself. Every 
promoter has a certain polarity: it contains two different nucleotide 
sequences, laid out in a specific 5’-to-3’ order, upstream of the tran- 
scriptional start site. These asymmetric sequences position the RNA 
polymerase such that it binds to the promoter in the correct orientation 
(see Figure 7-10A). Because the polymerase can only synthesize RNA 
in the 5'-to-3’ direction, once the enzyme is bound it must use the DNA 
strand that is oriented in the 3’-to-5' direction as its template. 


This selection of a template strand does not mean that on a given chro- 
mosome, all transcription proceeds in the same direction. With respect 
to the chromosome as a whole, the direction of transcription can vary 
from one gene to the next. But because each gene typically has only one 
promoter, the orientation of its promoter determines in which direction 
that gene is transcribed and therefore which strand is the template strand 
(Figure 7-11). 
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Initiation of Eukaryotic Gene Transcription Is a Complex 
Process 


Many of the principles we just outlined for bacterial transcription also 
apply to eukaryotes. However, the initiation of transcription in eukary- 
otes differs in several important ways from the process in bacteria: 


1. While bacteria use a single type of RNA polymerase for transcription, 
eukaryotic cells employ three: RNA polymerase I, RNA polymerase II, 
and RNA polymerase III. These polymerases are responsible for 
transcribing different types of genes. RNA polymerases | and III 
transcribe the genes encoding transfer RNA, ribosomal RNA, and 
various other RNAs that play structural and catalytic roles in the 
cell (Table 7-2). RNA polymerase II transcribes the rest, including all 
those that encode proteins—which constitutes the majority of genes 
in eukaryotes (Movie 7.3). Our subsequent discussion will therefore 
focus on RNA polymerase II. 

2. Whereas the bacterial RNA polymerase (along with its sigma subunit) 
is able to initiate transcription on its own, eukaryotic RNA polymerases 
require the assistance of a large set of accessory proteins. Principal 
among these are the general transcription factors, which must assemble 
at each promoter, along with the polymerase, before transcription can 
begin. 

3. The mechanisms that control the initiation of transcription in 
eukaryotes are much more elaborate than those that operate in 
prokaryotes—a point we discuss in detail in Chapter 8. In bacteria, 
genes tend to lie very close to one another, with only very short 
lengths of nontranscribed DNA between them. But in plants and 
animals, including humans, individual genes are spread out along 
the DNA, with stretches of up to 100,000 nucleotide pairs between 
one gene and the next. This architecture allows a single gene to be 
controlled by a large variety of regulatory DNA sequences scattered 
along the DNA, and it enables eukaryotes to engage in more complex 
forms of transcriptional regulation than do bacteria. 

4. Eukaryotic transcription initiation must deal with the packing of DNA 
into nucleosomes and higher-order forms of chromatin structure, as 
we describe in Chapter 8. 

To begin our discussion of eukaryotic transcription, we take a look at the 

general transcription factors and see how they help RNA polymerase II 

initiate the process. 


Eukaryotic RNA Polymerase Requires General 
Transcription Factors 


The initial finding that, unlike bacterial RNA polymerase, purified eukary- 
otic RNA polymerase II cannot initiate transcription on its own in a test 
tube led to the discovery and purification of the general transcription 


TABLE 7-2 THE THREE RNA POLYMERASES IN EUKARYOTIC CELLS 


RNA polymerase | most rRNA genes 


RNA polymerase II all protein-coding genes, miRNA genes, plus genes for 
other noncoding RNAs (e.g., those of the spliceosome) 


RNA polymerase III tRNA genes, 5S rRNA gene, genes for many other small 
RNAs 
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QUESTION 7-3 


Could the RNA polymerase used for 
transcription also be used to make 
the RNA primers required for DNA 
replication (discussed in Chapter 6)? 
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Figure 7-12 To begin transcription, eukaryotic RNA polymerase 

Il requires a set of general transcription factors. These factors are 
designated TFIIB, TFIID, and so on. (A) Most eukaryotic promoters 
contain a DNA sequence called the TATA box. (B) The TATA box is 
recognized by a subunit of the general transcription factor TFIID, 
called the TATA-binding protein (TBP). For simplicity, the DNA 
distortion produced by the binding of the TBP (see Figure 7-13) is not 
shown. (C) The binding of TFIID enables the adjacent binding of TFIIB. 
(D) The rest of the general transcription factors, as well as the RNA 
polymerase itself, then assemble at the promoter. (E) TFIIH pries apart 
the double helix at the transcription start point, using the energy of 
ATP hydrolysis, which exposes the template strand of the gene. TFIIH 
also phosphorylates RNA polymerase II, releasing the polymerase from 
most of the general transcription factors, so it can begin transcription. 
The site of phosphorylation is a long polypeptide “tail” that extends 
from the polymerase. Once the polymerase moves away from the 
promoter, most of the general transcription factors are released 

from the DNA; the exception is TFIID, which remains bound through 
multiple rounds of transcription initiation. 


factors. These accessory proteins assemble on the promoter, where 
they position the RNA polymerase and pull apart the DNA double helix 
to expose the template strand, allowing the polymerase to begin tran- 
scription. Thus, the general transcription factors have a similar role in 
eukaryotic transcription as sigma factor has in bacterial transcription. 


Figure 7-12 shows the assembly of the general transcription factors at a 
promoter used by RNA polymerase II. The process begins with the bind- 
ing of the general transcription factor TFIID to a short segment of DNA 
double helix composed primarily of T and A nucleotides; because of its 
composition, this part of the promoter is known as the TATA box. Upon 
binding to DNA, TFIID causes a dramatic local distortion in the DNA dou- 
ble helix (Figure 7-13); this structure helps to serve as a landmark for the 
subsequent assembly of other proteins at the promoter. The TATA box 
is a key component of many promoters used by RNA polymerase II, and 
it is typically located about 30 nucleotides upstream from the transcrip- 
tion start site. Once TFIID has bound to the TATA box, the other factors 
assemble, along with RNA polymerase II, to form a complete transcription 
initiation complex. Although Figure 7-12 shows the general transcription 
factors loading onto the promoter in a certain sequence, the actual order 
of assembly probably differs somewhat from one promoter to the next. 
Like bacterial promoters, eukaryotic promoters are composed of several 
distinct DNA sequences; these direct the general transcription factors 
where to assemble, and they orient the RNA polymerase so that it will 
begin transcription in the correct direction and on the correct DNA tem- 
plate strand (Figure 7-14). 


Once RNA polymerase II has been positioned on the promoter, it must 
be released from the complex of general transcription factors to begin 
its task of making an RNA molecule. A key step in liberating the RNA 
polymerase is the addition of phosphate groups to its “tail” (see Figure 


Figure 7-13 TATA-binding protein (TBP) binds to the TATA box 
(indicated by letters) and bends the DNA double helix. TBP, a 
subunit of TFIID (see Figure 7-12), distorts the DNA when it binds. 
TBP is a single polypeptide chain that is folded into two very similar 
domains (blue and green). The protein sits atop the DNA double helix 
like a saddle on a bucking horse (Movie 7.4). 
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7-12E). This action is initiated by the general transcription factor TFIIH, 
which contains a protein kinase as one of its subunits. Once transcrip- 
tion has begun, most of the general transcription factors dissociate from 
the DNA and then are available to initiate another round of transcription 
with a new RNA polymerase molecule. When RNA polymerase II finishes 
transcribing a gene, it too is released from the DNA; the phosphates on its 
tail are stripped off by protein phosphatases, and the polymerase is then 
ready to find a new promoter. Only the dephosphorylated form of RNA 
polymerase II can re-initiate RNA synthesis. 


Eukaryotic mRNAs Are Processed in the Nucleus 


The principle of templating, by which DNA is transcribed into RNA, is the 
same in all organisms; however, the way in which the resulting RNA tran- 
scripts are handled before they are translated into protein differs between 
bacteria and eukaryotes. Because bacteria lack a nucleus, their DNA is 
directly exposed to the cytosol, which contains the ribosomes on which 
protein synthesis takes place. As an MRNA molecule in a bacterium starts 
to be synthesized, ribosomes immediately attach to the free 5’ end of the 
RNA transcript and begin translating it into protein. 


In eukaryotic cells, by contrast, DNA is enclosed within the nucleus, which 
is where transcription takes place. Translation, however, occurs on ribo- 
somes that are located in the cytosol. So, before a eukaryotic mRNA 
can be translated into protein, it must be transported out of the nucleus 
through small pores in the nuclear envelope (Figure 7-15). And before it 
can be exported to the cytosol, a eukaryotic RNA must go through several 
RNA processing steps, which include capping, splicing, and polyadenyla- 
tion, as we discuss shortly. These steps take place as the RNA is being 
synthesized. The enzymes responsible for RNA processing ride on the 
phosphorylated tail of eukaryotic RNA polymerase II as it synthesizes an 
RNA molecule (see Figure 7-12), and they process the transcript as it 
emerges from the polymerase (Figure 7-16). 


Figure 7-15 Before they can be translated, mRNA molecules made 
in the nucleus must be exported to the cytosol via pores in the 
nuclear envelope (red arrows). Shown here is a section of a liver cell 
nucleus. The nucleolus is where ribosomal RNAs are synthesized and 
combined with proteins to form ribosomes, which are then exported 
to the cytosol. (From D.W. Fawcett, A Textbook of Histology, 12th ed. 
1994. With permission from Taylor & Francis Books UK.) 
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Figure 7-14 Eukaryotic promoters 


contain sequences that promote the 
binding of the general transcription 
factors. The location of each sequence 
and the general transcription factor that 
recognizes it are indicated. N stands for 
any nucleotide, and a slash (/) indicates 
that either nucleotide can be found at the 
indicated position. For most start sites 
transcribed by RNA polymerase Il, only two 
or three of the four sequences are needed. 
Although most of these DNA sequences 
are located upstream of the transcription 
start site, one, at +30, is located within the 
transcribed region of the gene. 
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Figure 7-17 Eukaryotic mRNA 
molecules are modified by capping and 
polyadenylation. (A) A eukaryotic mRNA 
has a cap at the 5' end and a poly-A tail at 
the 3' end. In addition to the nucleotide 
sequences that code for protein, most 
mRNAs also contain extra, noncoding 
sequences, as shown. The noncoding 
portion at the 5' end is called the 

5’ untranslated region, or 5’ UTR, and that 
at the 3’ end is called the 3’ UTR. (B) The 
structure of the 5’ cap. Many eukaryotic 


mRNA caps carry an additional modification: 


the 2'-hydroxyl group on the second ribose 
sugar in the mRNA is methylated (not 
shown). 
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Figure 7-16 Phosphorylation of the tail of RNA polymerase II 
allows RNA-processing proteins to assemble there. Capping, 
polyadenylation, and splicing are all modifications that occur as the 
RNA is being synthesized. Note that the phosphates shown here 
are in addition to the ones required for transcription initiation (see 
Figure 7-12). 


Two of these processing steps, capping and polyadenylation, occur on all 
RNA transcripts destined to become mRNA molecules. 


1. RNA capping modifies the 5’ end of the RNA transcript, the part of 
the RNA that is synthesized first. The RNA cap includes an atypical 
nucleotide: a guanine (G) nucleotide bearing a methyl group is 
attached to the 5’ end of the RNA in an unusual way (Figure 7-17). 
In bacteria, by contrast, the 5’ end of an mRNA molecule is simply 
the first nucleotide of the transcript. In eukaryotic cells, capping takes 
place after RNA polymerase II has produced about 25 nucleotides of 
RNA, long before it has completed transcribing the whole gene. 

2. Polyadenylation provides a newly transcribed mRNA with a 
special structure at its 3’ end. In contrast with bacteria, where the 
3’ end of an MRNA is simply the end of the chain synthesized by the 
RNA polymerase, the 3’ end of a eukaryotic MRNA is first trimmed 
by an enzyme that cuts the RNA chain at a particular sequence of 
nucleotides. The transcript is then finished off by a second enzyme 
that adds a series of repeated adenine (A) nucleotides to the trimmed 
end. This poly-A tail is generally a few hundred nucleotides long (see 
Figure 7-17A). 

These two modifications—capping and polyadenylation—increase the 
stability of a eukaryotic mRNA molecule, facilitate its export from the 
nucleus to the cytosol, and generally mark the RNA molecule as an 
mRNA. They are also used by the protein-synthesis machinery to make 
sure that both ends of the mRNA are present and that the message is 
therefore complete before protein synthesis begins. 
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In Eukaryotes, Protein-Coding Genes Are Interrupted by 
Noncoding Sequences Called Introns 


Most eukaryotic mRNAs have to undergo an additional processing step 
before they become functional. This step involves a far more radical 
modification of the RNA transcript than capping or polyadenylation, and 
it is the consequence of a surprising feature of most eukaryotic genes. 
In bacteria, most proteins are encoded by an uninterrupted stretch of 
DNA sequence that is transcribed into an MRNA that, without any further 
processing, can be translated into protein. Most protein-coding eukary- 
otic genes, in contrast, have their coding sequences interrupted by long, 
noncoding, intervening sequences called introns. The scattered pieces 
of coding sequence—called expressed sequences or exons—are usually 
shorter than the introns, and they often represent only a small fraction of 
the total length of the gene (Figure 7-18). Introns range in length from a 
single nucleotide to more than 10,000 nucleotides. Some protein-coding 
eukaryotic genes lack introns altogether, some have only a few, but most 
have many (Figure 7-19). Note that the terms “exon” and “intron” apply 
to both the DNA and the corresponding RNA sequences. 


Introns Are Removed from Pre-mRNAs by RNA Splicing 


To produce an mRNA in a eukaryotic cell, the entire length of the gene, 
introns as well as exons, is transcribed into RNA. After capping, and as 
RNA polymerase II continues to transcribe the gene, RNA splicing begins. 
In this process, the introns are removed from the newly synthesized RNA 
and the exons are stitched together. Each transcript ultimately receives a 
poly-A tail; in many cases, this happens after splicing, whereas in other 
cases, it occurs before the final splicing reactions have been completed. 
Once a transcript has been spliced and its 5' and 3’ ends have been mod- 
ified, the RNA is now a functional mRNA molecule that can leave the 
nucleus and be translated into protein. Before these steps are completed, 
the RNA transcript is known as a precursor-MRNA or pre-mRNA for short. 


How does the cell determine which parts of the RNA transcript to remove 
during splicing? Unlike the coding sequence of an exon, most of the 
nucleotide sequence of an intron is unimportant. Although there is lit- 
tle overall resemblance between the nucleotide sequences of different 
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Figure 7-18 Eukaryotic and bacterial 
genes are organized differently. A 
bacterial gene consists of a single stretch 

of uninterrupted nucleotide sequence 

that encodes the amino acid sequence of 

a protein. In contrast, the protein-coding 
sequences of most eukaryotic genes (exons) 
are interrupted by noncoding sequences 


(introns). Promoter sequences are indicated 


in green. 
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Figure 7-19 Most protein- 
coding human genes are 
broken into multiple exons 
and introns. (A) The B-globin 
gene, which encodes one of 
the subunits of the oxygen- 
carrying protein hemoglobin, 
contains 3 exons. (B) The 
gene that encodes Factor 
VIII, a protein that functions 
in the blood-clotting 
pathway, contains 26 exons. 
Mutations in this large gene 
are responsible for the most 
prevalent form of the blood 
disorder hemophilia. 
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Figure 7-20 Special nucleotide sequences in a pre-mRNA transcript signal the 
beginning and the end of an intron. Only the nucleotide sequences shown are 
required to remove an intron; the other positions in an intron can be occupied by 
any nucleotide. The special sequences are recognized primarily by small nuclear 
ribonucleoproteins (snRNPs), which direct the cleavage of the RNA at the intron- 
exon borders and catalyze the covalent linkage of the exon sequences. Here, in 
addition to the standard symbols for nucleotides (A, C, G, U), R stands for either A 

or G; Y stands for either C or U; and N stands for any nucleotide. The A shown in red 
forms the branch point of the lariat produced in the splicing reaction shown in Figure 
7-21. The distances along the RNA between the three splicing sequences are highly 
variable; however, the distance between the branch point and the 5’ splice junction is 
typically much longer than that between the 3’ splice junction and the branch point 
(see Figure 7-21). The splicing sequences shown are from humans; similar sequences 
direct RNA splicing in other eukaryotes. 


introns, each intron contains a few short nucleotide sequences that act 
as cues for its removal from the pre-mRNA. These special sequences are 
found at or near each end of the intron and are the same or very similar in 
all introns (Figure 7-20). Guided by these sequences, an elaborate splic- 
ing machine cuts out the intron in the form of a “lariat” structure (Figure 
7-21), formed by the reaction of an adenine nucleotide, highlighted in red 
in both Figures 7-20 and 7-21, with the beginning of the intron. 


Although we will not describe the splicing process in detail, itis worthwhile 
to note that, unlike the other steps of mRNA production, RNA splicing is 
carried out largely by RNA molecules rather than proteins. These RNA 
molecules, called small nuclear RNAs (snRNAs), are packaged with 
additional proteins to form small nuclear ribonucleoproteins (snRNPs, pro- 
nounced “snurps”). The snRNPs recognize splice-site sequences through 
complementary base-pairing between their RNA components and the 
sequences in the pre-mRNA, and they carry out the chemistry of splic- 
ing (Figure 7-22). RNA molecules that catalyze reactions in this way are 
known as ribozymes, and we discuss them in more detail later in the 
chapter. Together, these snRNPs form the core of the spliceosome, the 
large assembly of RNA and protein molecules that carries out RNA splic- 
ing in the nucleus. To watch the spliceosome in action, see Movie 7.5. 


Figure 7-21 An intron in a pre-mRNA molecule forms a branched 
structure during RNA splicing. In the first step, the branch-point 
adenine (red A) in the intron sequence attacks the 5’ splice site and 
cuts the sugar-phosphate backbone of the RNA at this point (this is the 
same A highlighted in red in Figure 7-20). In this process, the released 
5' end of the intron becomes covalently linked to the 2'-OH group of 
the ribose of the adenine nucleotide to form a branched structure. In 
the second step of splicing, the free 3’-OH end of the exon sequence 
reacts with the start of the next exon sequence, joining the two exons 
together into a continuous coding sequence. The intron is released as 
a lariat structure, which is eventually degraded in the nucleus. 
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The intron-exon type of gene arrangement in eukaryotes might seem 
wasteful, but it does provide some important benefits. First, the tran- 
scripts of many eukaryotic genes can be spliced in different ways, each of 
which can produce a distinct protein. Such alternative splicing thereby 
allows many different proteins to be produced from the same gene 
(Figure 7-23). About 95% of human genes are thought to undergo alter- 
native splicing. Thus RNA splicing enables eukaryotes to increase the 
already enormous coding potential of their genomes. In Chapter 9, we 
will encounter another advantage of splicing—the production of novel 
proteins—when we discuss how proteins evolve. 
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Figure 7-23 Some pre-mRNAs undergo alternative RNA splicing to produce 
different mRNAs and proteins from the same gene. Whereas all exons are 
transcribed, they can be skipped over by the spliceosome to produce alternatively 
spliced mRNAs, as shown. Such skipping occurs when the splicing signals at the 

5' end of one intron are paired up with the branch-point and 3’ end of a different 
intron. An important feature of alternative splicing is that exons can be skipped 

or included; however, their order—which is specified in the DNA sequence—cannot 
be rearranged. 
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Figure 7-22 Splicing is carried out by a 
collection of RNA-protein complexes 
called snRNPs. Although there are five 
snRNPs and about 200 additional proteins 
required for splicing, only the three most 
important snRNPs—called U1, U2, and 
Ué—are shown here. In the first steps of 
splicing, U1 recognizes the 5’ splice site 
and U2 recognizes the lariat branch-point 
site through complementary base-pairing. 
U6 then “re-checks” the 5’ splice site by 
displacing U1 and base-pairing with this 
intron sequence itself. This “re-reading” 
step improves the accuracy of splicing by 
double-checking the 5’ splice site before 
carrying out the splicing reaction. In the 
next steps, conformational changes in U2 
and Ué—triggered by the hydrolysis of ATP 
by spliceosomal proteins (not shown)— 
drive the formation of the spliceosome 
active site. Once the splicing reactions 
have occurred (see Figure 7-21), the 
spliceosome deposits a group of RNA- 
binding proteins, known as the exon 
junction complex (red), on the mRNA 

to mark the splice site as successfully 
completed. 
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Figure 7-24 RNAs are produced by 
factories within the nucleus. RNAs are 
synthesized and processed (red) and 
DNA is replicated (green) in intracellular 
condensates that form discrete 


compartments within a mammalian nucleus. 


In this micrograph, these loose aggregates 
of protein and nucleic acid were visualized 
by detecting newly synthesized DNA and 
RNA. In some instances, both replication 
and transcription are taking place at the 
same site (yellow). (From D.G. Wansink 
etal., J. Cell Sci. 107:1449-1456, 1994. 
With permission from The Company of 
Biologists.) 
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RNA Synthesis and Processing Takes Place in “Factories” 
Within the Nucleus 


RNA synthesis and processing in eukaryotes requires the coordinated 
action of a large number of proteins, from the RNA polymerases and 
accessory proteins that carry out transcription to the enzymes responsi- 
ble for capping, polyadenylation, and splicing. With so many components 
required to produce and process every one of the RNA molecules that are 
being transcribed, how do all these factors manage to find one another? 


We have already seen that the enzymes responsible for RNA processing 
ride on the phosphorylated tail of eukaryotic RNA polymerase II as it syn- 
thesizes an RNA molecule, so that the RNA transcript can be processed 
as it is being synthesized (see Figure 7-16). In addition to this associa- 
tion, RNA polymerases and RNA-processing proteins also form loose 
molecular aggregates—generally termed intracellular condensates—that 
act as “factories” for the production of RNA. These factories, which bring 
together the numerous RNA polymerases, RNA-processing components, 
and the genes being expressed, are large enough to be seen microscopi- 
cally (Figure 7-24). 


The aggregation of components needed to perform a specific task is not 
unique to RNA transcription. Proteins involved in DNA replication and 
repair also converge to form functional factories dedicated to their spe- 
cific tasks. And genes encoding ribosomal RNAs cluster together in the 
nucleolus (see Figure 5-17), where their RNA products are combined 
with proteins to form ribosomes. These ribosomes, along with the mature 
mRNAs they will decode, must then be exported to the cytosol, where 
translation will take place. 


Mature Eukaryotic mRNAs Are Exported from the Nucleus 


Of all the pre-mRNA that is synthesized by a cell, only a small fraction— 
the sequences contained within mature mRNAs—will be useful. The 
remaining RNA fragments—excised introns, broken RNAs, and aberrantly 
spliced transcripts—are not only useless, but they could be dangerous to 
the cell if allowed to leave the nucleus. How, then, does the cell distin- 
guish between the relatively rare mature MRNA molecules it needs to 
export to the cytosol and the overwhelming amount of debris generated 
by RNA processing? 


The answer is that the transport of mRNA from the nucleus to the cyto- 
sol is highly selective: only correctly processed mRNAs are exported and 
therefore available to be translated. This selective transport is medi- 
ated by nuclear pore complexes, which connect the nucleoplasm with the 
cytosol and act as gates that control which macromolecules can enter 
or leave the nucleus (discussed in Chapter 15). To be “export ready,” an 
mRNA molecule must be bound to an appropriate set of proteins, each 
of which recognizes different parts of a mature mRNA molecule. These 
proteins include poly-A-binding proteins, a cap-binding complex, and 
proteins that bind to mRNAs that have been appropriately spliced (Figure 
7-25). The entire set of bound proteins, rather than any single protein, 
ultimately determines whether an mRNA molecule will leave the nucleus. 
The “waste RNAs” that remain behind in the nucleus are degraded there, 
and their nucleotide building blocks are reused for transcription. 


mRNA Molecules Are Eventually Degraded in the Cytosol 


Because a single mRNA molecule can be translated into protein many 
times (see Figure 7-2), the length of time that a mature mRNA molecule 
persists in the cell greatly influences the amount of protein it produces. 
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Figure 7-25 A specialized set of RNA-binding proteins signals that a completed mRNA is ready for export 
to the cytosol. As indicated on the left, the 5’ cap and poly-A tail of a mature mRNA molecule are “marked” by 


proteins that recognize these modifications. Successful splices are marked by exon junction complexes (see Figure 
7-22). Once an mRNA is deemed “export ready,” a nuclear transport receptor (discussed in Chapter 15) associates 
with the mRNA and guides it through the nuclear pore. In the cytosol, the mRNA can shed some of these proteins 

and bind new ones, which, along with poly-A-binding protein, act as initiation factors for protein synthesis, as we 


discuss in the next section of the chapter. 


Each mRNA molecule is eventually degraded into nucleotides by ribo- 
nucleases (RNAses) present in the cytosol, but the lifespans of mRNA 
molecules differ considerably—depending on the nucleotide sequence of 
the MRNA and the type of cell. In bacteria, most mRNAs are degraded 
rapidly, having a typical lifespan of about 3 minutes. The mRNAs in eukar- 
yotic cells usually persist longer: some, such as those encoding f-globin, 
have lifespans of more than 10 hours, whereas others stick around for 
less than 30 minutes. 


These different lifespans are in part controlled by nucleotide sequences 
in the mRNA itself, most often in the portion of RNA called the 3’ untrans- 
lated region, which lies between the 3’ end of the coding sequence and 
the poly-A tail (see Figure 7-17). The lifespans of different mRNAs help 
the cell control how much protein will be produced. In general, proteins 
made in large amounts, such as ß-globin, are translated from mRNAs 
that have long lifespans, whereas proteins made in smaller amounts, or 
whose levels must change rapidly in response to signals, are typically 
synthesized from short-lived mRNAs. 


The synthesis, processing, and degradation of RNA in eukaryotes and 
prokaryotes is summarized and compared in Figure 7-26. 


FROM RNA TO PROTEIN 


By the end of the 1950s, biologists had demonstrated that the information 
encoded in DNA is copied first into RNA and then into protein. The debate 
then shifted to the “coding problem”: How is the information in a linear 
sequence of nucleotides in an RNA molecule translated into the linear 
sequence of a chemically quite different set of subunits—the amino acids 
in a protein? This fascinating question intrigued scientists from many dif- 
ferent disciplines, including physics, mathematics, and chemistry. Here 
was a cryptogram set up by nature that, after more than 3 billion years 
of evolution, could finally be solved by one of the products of evolution— 
the human brain! Indeed, scientists have not only cracked the code but 
have revealed, in atomic detail, the precise workings of the machinery by 
which cells read this code. 
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Figure 7-26 Producing mRNA molecules 
is more complex in eukaryotes than it 

is in prokaryotes. (A) In eukaryotic cells, 
the pre-mRNA molecule produced by 
transcription contains both intron and 
exon sequences. Its two ends are modified 
by capping and polyadenylation, and the 
introns are removed by RNA splicing. The 
completed mRNA is then transported 
from the nucleus to the cytosol, where 

it is translated into protein. Although 
these steps are depicted as occurring 

one after the other, in reality they occur 
simultaneously. For example, the RNA cap 
is usually added and splicing usually begins 
before transcription has been completed. 
Because of this overlap, transcripts of 

the entire gene (including all introns 

and exons) do not typically exist in the 
cell. Ultimately, mRNAs are degraded by 
RNAses in the cytosol and their nucleotide 
building blocks are reused for transcription. 
(B) In prokaryotes, the production of 
mRNA molecules is simpler. The 5’ end 

of an mRNA molecule is produced by 

the initiation of transcription by RNA 
polymerase, and the 3’ end is produced 
by the termination of transcription. 
Because prokaryotic cells lack a nucleus, 
transcription and translation—as well as 
degradation—take place in a common 
compartment. Translation of a prokaryotic 
mRNA can therefore begin before its 
synthesis has been completed. In both 
eukaryotes and prokaryotes, the amount of 
a protein in a cell depends on the rates of 
each of these steps, as well as on the rates 
of degradation of the mRNA and protein 
molecules. 


An mRNA Sequence Is Decoded in Sets of Three 
Nucleotides 


Transcription as a means of information transfer is simple to understand: 
DNA and RNA are chemically and structurally similar, and DNA can act as 
a direct template for the synthesis of RNA through complementary base- 
pairing. As the term transcription signifies, it is as if a message written 
out by hand were being converted, say, into a typewritten text. The lan- 
guage itself and the form of the message do not change, and the symbols 
used are closely related. 


In contrast, the conversion of the information from RNA into protein rep- 
resents a translation of the information into another language that uses 
different symbols. Because there are only 4 different nucleotides in mRNA 
but 20 different types of amino acids in a protein, this translation can- 
not be accounted for by a direct one-to-one correspondence between a 
nucleotide in RNA and an amino acid in protein. The set of rules by which 
the nucleotide sequence of a gene, through an intermediary mRNA mol- 
ecule, is translated into the amino acid sequence of a protein is known 
as the genetic code. 


In 1961, it was discovered that the sequence of nucleotides in an mRNA 
molecule is read consecutively in groups of three. And because RNA is 
made of 4 different nucleotides, there are 4 x 4 x 4 = 64 possible combi- 
nations of three nucleotides: AAA, AUA, AUG, and so on. However, only 
20 different amino acids are commonly found in proteins. Either some 
nucleotide triplets are never used, or the code is redundant, with some 
amino acids being specified by more than one triplet. The second pos- 
sibility turned out to be correct, as shown by the completely deciphered 
genetic code shown in Figure 7-27. Each group of three consecutive 
nucleotides in RNA is called a codon, and each codon specifies one 
amino acid. The strategy by which this code was cracked is described in 
How We Know, pp. 246-247. 


The same basic genetic code is used in all present-day organisms. 
Although a few slight differences have been found, these occur chiefly in 


From RNA to Protein 245 


AGA UUA AGC 
AGG UUG AGU 
GCA CGA GGA CUA CCA UCA ACA GUA 
codons | Ecc cac GGC AUA CUC CcC UCC ACC GUC UAA 
GCG CGG GAC AAC UGC GAA CAA GGG CAC AUC CUG AAA UUC CCG UCG ACG UAC GUG UAG 
GCU CGU GAU AAU UGU GAG CAG GGU CAU AUU CUU AAG AUG UUU CCU UCU ACU UGG UAU GUU UGA 
amino Ala Arg Asp Asn Cys Glu Gln Gly His Ile Leu Lys Met Phe Pro Ser Thr Trp Tyr Val stop 
acids 
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Figure 7-27 The nucleotide sequence of an mRNA is translated into the amino acid sequence of a protein via 
the genetic code. All of the three-nucleotide codons in mRNAs that specify a given amino acid are listed above 
that amino acid, which is given in both its three-letter and one-letter abbreviations (see Panel 2-6, pp. 76-77, for the 
full name of each amino acid and its structure). Like RNA molecules, codons are usually written with the 5'-terminal 
nucleotide to the left. Note that most amino acids are represented by more than one codon, and there are some 
regularities in the set of codons that specify each amino acid. For example, codons for the same amino acid tend to 
contain the same nucleotides at the first and second positions and vary at the third position. There are three codons 
that do not specify any amino acid but act as termination sites (stop codons), signaling the end of the protein-coding 
sequence in an MRNA. One codon—AUG—acts both as an initiation codon, signaling the start of a protein-coding 


message, and as the codon that specifies the amino acid methionine. 


the mRNA of mitochondria and of some fungi and protozoa. Mitochondria 
have their own DNA replication, transcription, and protein-synthesis 
machinery, which operates independently of the corresponding machin- 
ery in the rest of the cell (discussed in Chapter 14), and they have been 
able to accommodate minor changes to the otherwise universal genetic 
code. Even in fungi and protozoa, the similarities in the code far out- 
weigh the differences. 


In principle, an MRNA sequence can be translated in any one of three dif- 
ferent reading frames, depending on where the decoding process begins 
(Figure 7-28). However, only one of the three possible reading frames 
in an MRNA specifies the correct protein. We discuss later how a special 
signal at the beginning of each MRNA molecule sets the correct reading 
frame. 


tRNA Molecules Match Amino Acids to Codons in mRNA 


The codons in an mRNA molecule do not directly recognize the amino 
acids they specify: the set of three nucleotides does not, for example, bind 
directly to the amino acid. Rather, the translation of mRNA into protein 
depends on adaptor molecules that bind to a codon with one part of the 
adaptor and to an amino acid with another. These adaptors consist of 
a set of small RNA molecules known as transfer RNAs (tRNAs), each 
about 80 nucleotides in length. 


We saw earlier that an RNA molecule generally folds into a three-dimen- 
sional structure by forming internal base pairs between different regions 
of the molecule. If the base-paired regions are sufficiently extensive, they 
will fold back on themselves to form a double-helical structure, like that of 
double-stranded DNA. Such is the case for the tRNA molecule. Four short 
segments of the folded tRNA are double-helical, producing a distinctive 


Figure 7-28 In principle, an mRNA molecule can be translated 

in three possible reading frames. In the process of translating a 
nucleotide sequence (blue) into an amino acid sequence (red), the 
sequence of nucleotides in an mRNA molecule is read from the 5! 

to the 3’ end in sequential sets of three nucleotides. In principle, 
therefore, the same mRNA sequence can specify three completely 
different amino acid sequences, depending on the nucleotide at 
which translation begins—that is, on the reading frame used. In 
reality, however, only one of these reading frames encodes the actual 
message, as we discuss later. 


5 3 

CUC j; AGC GUU ACC AU 
1 l i i M ee) 
Thr 


Ser Val 


Leu 


g UCA GCG UUA; CCA g 


Ala 


Ser Leu Pro 


CUJ CAG CGU j UAC CAU 
3 pie eines) ee fete an 


Gln His 


Arg Tyr 


“ HOW WE KNOW 
CRACKING THE GENETIC CODE 


By the beginning of the 1960s, the central dogma had 
been accepted as the pathway along which informa- 
tion flows from gene to protein. It was clear that genes 
encode proteins, that genes are made of DNA, and that 
mRNA serves as an intermediary, carrying the infor- 
mation from DNA to the ribosome, where the RNA is 
translated into protein. 


Even the general format of the genetic code had been 
worked out: each of the 20 amino acids found in pro- 
teins is represented by a triplet codon in an mRNA 
molecule. But an even greater challenge remained: 
biologists, chemists, and even physicists set their sights 
on breaking the genetic code—attempting to figure out 
which amino acid each of the 64 possible nucleotide 
triplets designates. The most straightforward path to 
the solution would have been to compare the sequence 
of a segment of DNA or of mRNA with its corresponding 
polypeptide product. Techniques for sequencing nucleic 
acids, however, would not be developed for another 
decade. 


So researchers decided that, to crack the genetic code, 
they would have to synthesize their own simple RNA 
molecules. If they could feed these RNA molecules to 
ribosomes—the machines that make proteins—and 
then analyze the resulting polypeptide product, they 
would be on their way to deciphering which triplets 
encode which amino acids. 


Losing the cells 


Before researchers could test their synthetic mRNAs, 
they needed to perfect a cell-free system for protein 
synthesis. This would allow them to translate their 
messages into polypeptides in a test tube. (Generally 
speaking, when working in the laboratory, the simpler 
the system, the easier it is to interpret the results.) To 
isolate the molecular machinery they needed for such 
a cell-free translation system, researchers broke open 
E. coli cells and loaded their contents into a centrifuge 
tube. Spinning these samples at high speed caused the 
membranes and other large chunks of cellular debris to 
be dragged to the bottom of the tube; the lighter cellular 
components required for protein synthesis—including 
mRNA, the tRNA adaptors, ribosomes, enzymes, and 
other small molecules—were left floating near the top 
of the tube (see Panel 4-3, pp. 164-165). Researchers 
found that simply adding radioactive amino acids to 
this cell “soup” would trigger the production of radi- 
olabeled polypeptides. By centrifuging this material 
again, at a higher speed, the researchers could force 
the ribosomes, and any newly synthesized peptides 
attached to them, to the bottom of the tube; the labeled 
polypeptides could then be detected by measuring the 
radioactivity in the sediment remaining in the tube after 
the fluid layer above it had been discarded. 


The trouble with this particular system was that the pro- 
teins it produced were those encoded by the cell’s own 
mRNAs, already present in the extract. But research- 
ers wanted to use their own synthetic messages to 
direct protein synthesis. This problem was solved when 
Marshall Nirenberg discovered that he could destroy 
the cells’ mRNA in the extract by adding a small amount 
of ribonuclease—an enzyme that degrades RNA—to the 
mix. Now all he needed to do was prepare large quanti- 
ties of synthetic mRNA, add it to the cell-free system, 
and see what peptides came out. 


Faking the message 


Producing a synthetic polynucleotide with a defined 
sequence was not as simple as it sounds. Again, it 
would be years before chemists and bioengineers devel- 
oped machines that could synthesize any given string 
of nucleic acids quickly and cheaply. Nirenberg decided 
to use polynucleotide phosphorylase, an enzyme that 
would join ribonucleotides together in the absence of a 
template. The sequence of the resulting RNA would then 
depend entirely on which nucleotides were presented 
to the enzyme. A mixture of nucleotides would be sewn 
into a random sequence; but a single type of nucleotide 
would yield a homogeneous polymer containing only 
that one nucleotide. Thus Nirenberg, working with his 
collaborator Heinrich Matthaei, first produced synthetic 
mRNAs made entirely of uracil—poly U. 


Together, the researchers fed this poly U to their cell- 
free translation system. They then added a single type 
of radioactively labeled amino acid to the mix. After 
testing each amino acid—one at a time, in 20 differ- 
ent experiments—they determined that poly U directs 
the synthesis of a polypeptide containing only phenyl- 
alanine (Figure 7-29). With this electrifying result, the 
first word in the genetic code had been deciphered. 


Nirenberg and Matthaei then repeated the experiment 
with poly A and poly C and determined that AAA codes 
for lysine and CCC for proline. The meaning of poly G 
could not be ascertained by this method because, as we 
now know, this polynucleotide forms an aberrant struc- 
ture that gums up the system. 


Feeding ribosomes with synthetic RNA seemed a 
fruitful technique. But with the single-nucleotide pos- 
sibilities exhausted, researchers had nailed down only 
three codons; they had 61 still to go. The other codons, 
however, were harder to decipher, and a new synthetic 
approach was needed. In the 1950s, the organic chem- 
ist Gobind Khorana had been developing methods for 
preparing mixed polynucleotides of defined sequence— 
but his techniques worked only for DNA. When he 
learned of Nirenberg’s work with synthetic RNAs, 
Khorana directed his energies and skills to producing 
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polyribonucleotides. He found that if he started out by 
making DNAs of a defined sequence, he could then use 
RNA polymerase to produce RNAs from those. In this 
way, Khorana prepared a collection of different RNAs 
of defined repeating sequence: he generated sequences 
of repeating dinucleotides (such as poly UC), trinucleo- 
tides (such as poly UUC), or tetranucleotides (such as 
poly UAUC). 


These mixed polynucleotides, however, yielded results 
that were much more difficult to decode than the mono- 
nucleotide messages that Nirenberg had used. Take poly 
UG, for example. When this repeating dinucleotide was 
added to the translation system, researchers discovered 
that it codes for a polypeptide of alternating cysteines 
and valines. The RNA, of course, contains two different, 
alternating codons: UGU and GUG. So the research- 
ers could say that UGU and GUG code for cysteine and 
valine, although they could not tell which went with 
which. Thus these mixed messages provided useful 
information, but they did not definitively reveal which 
codons specified which amino acids (Figure 7-30). 


Trapping the triplets 


These final ambiguities in the code were resolved when 
Nirenberg and a young medical graduate named Phil 
Leder discovered that RNA fragments that were only 
three nucleotides in length—the size of a single codon— 
could bind to a ribosome and attract the appropriate 
amino-acid-containing tRNA molecule. These com- 
plexes—containing one ribosome, one MRNA codon, 
and one radiolabeled aminoacyl-tRNA—could then be 
captured on a piece of filter paper and the attached 
amino acid identified. 


Their trial run with UUU—the first word—worked splen- 
didly. Leder and Nirenberg primed the usual cell-free 
translation system with snippets of UUU. These tri- 
nucleotides bound to the ribosomes, and Phe-tRNAs 
bound to the UUU. The new system was up and running, 
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Figure 7-29 UUU codes for 
phenylalanine. Synthetic 
mRNAs are fed into a cell-free 
translation system containing 
bacterial ribosomes, tRNAs, 
enzymes, and other small 
molecules. Radioactive amino 
acids were added to this mix, 
one per experiment; when 

the “correct” amino acid was 
added, a radioactive polypeptide 
would be produced. In this case, 
poly U is shown to encode a 
polypeptide containing only 
phenylalanine. 


and the researchers had confirmed that UUU codes for 
phenylalanine. 


All that remained was for researchers to produce all 64 
possible codons—a task that was quickly accomplished 
in both Nirenberg’s and Khorana’s laboratories. Because 
these small trinucleotides were much simpler to syn- 
thesize chemically, and the triplet-trapping tests were 
easier to perform and analyze than the previous decod- 
ing experiments, the researchers were able to work out 
the complete genetic code within the next year. 
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* One codon specifies Cys, the other Val, but which is which? 
The same ambiguity exists for the other codon assignments 
shown here. 


Figure 7-30 Using synthetic RNAs of mixed, repeating 
ribonucleotide sequences, scientists further narrowed 

the coding possibilities. Because these mixed messages 
produced mixed polypeptides, they did not permit the 
unambiguous assignment of a single codon to a specific amino 
acid. For example, the results of the poly-UG experiment 
cannot distinguish whether UGU or GUG encodes cysteine. 

As indicated, the same type of ambiguity confounded the 
interpretation of all the experiments using di-, tri-, and 
tetranucleotides. 
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structure that looks like a cloverleaf when drawn schematically (Figure 
7-31A). As shown in the figure, for example, a 5'-GCUC-3' sequence 
in one part of a polynucleotide chain can base-pair with a 5'-GAGC-3' 
sequence in another region of the same molecule. The cloverleaf under- 
goes further folding to form a compact, L-shaped structure that is held 
together by additional hydrogen bonds between different regions of the 
molecule (Figure 7-31B-D). 


Two regions of unpaired nucleotides situated at either end of the L-shaped 
tRNA molecule are crucial to the function of tRNAs in protein synthesis. 
One of these regions forms the anticodon, a set of three consecutive 
nucleotides that bind, through base-pairing, to the complementary codon 
in an MRNA molecule (Figure 7—31E). The other is a short, single-stranded 
region at the 3’ end of the molecule; this is the site where the amino acid 
that matches the codon is covalently attached to the tRNA. 


We saw in the previous section that the genetic code is redundant; that 
is, several different codons can specify a single amino acid (see Figure 
7-27). This redundancy implies either that there is more than one tRNA 
for many of the amino acids or that some tRNA molecules can base-pair 
with more than one codon. In fact, both situations occur. Some amino 
acids have more than one tRNA, and some tRNAS require accurate base- 
pairing only at the first two positions of the codon and can tolerate a 
mismatch (or wobble) at the third position. This wobble base-pairing 
explains why so many of the alternative codons for an amino acid dif- 
fer only in their third nucleotide (see Figure 7-27). Wobble base-pairings 
make it possible to fit the 20 amino acids to their 61 codons with as few 
as 31 kinds of tRNA molecules. The exact number of different kinds 
of tRNAs, however, differs from one species to the next. For example, 
attached amino i ‘ i A 
acid (Phe) humans have approximately 500 different tRNA genes, but this collection 
3' end includes only 48 different anticodons. 


G 
C 
G 
G 
A 
U 
U 
U 
A 


anticodon 
loop 


anticodon a cloverleaf 


(A) (B) (9 (D) 


5' GCGGAUUUAGCUCS GAGCGCCAGACUGAAYAYCUGGAGGUCCUGUGTYCGAUCCACAGAAUUCGCACCA 3' 


E | 
(E) anticodon 


Figure 7-31 tRNA molecules are molecular adaptors, linking amino acids to codons. In this series of diagrams, the same tRNA 
molecule—in this case, a tRNA specific for the amino acid phenylalanine (Phe)—is depicted in various ways. (A) The conventional 
“cloverleaf” structure shows the complementary base-pairing (red lines) that creates the double-helical regions of the molecule. 

The anticodon loop (blue) contains the sequence of three nucleotides (red letters) that base-pairs with the Phe codon in mRNA. The 
amino acid matching the anticodon is attached at the 3’ end of the tRNA. tRNAs contain some unusual bases, which are produced by 
chemical modification after the tRNA has been synthesized. The bases denoted y (for pseudouridine) and D (for dihydrouridine) are 
derived from uracil. (B and C) Views of the actual L-shaped molecule, based on x-ray diffraction analysis. These two images are rotated 
90° with respect to each other. (D) The schematic representation of tRNA that will be used in subsequent figures emphasizes the 
anticodon. (E) The linear nucleotide sequence of the tRNA molecule, color-coded to match (A), (B), and (C). 


Specific Enzymes Couple tRNAs to the Correct 
Amino Acid 


For atRNA molecule to carry out its role as an adaptor, it must be linked— 
or charged—with the correct amino acid. How does each tRNA molecule 
recognize the one amino acid in 20 that is its proper partner? Recognition 
and attachment of the correct amino acid depend on enzymes called 
aminoacyl-tRNA synthetases, which covalently couple each amino 
acid to the appropriate set of tRNA molecules. In most organisms, there 
is a different synthetase enzyme for each amino acid. That means that 
there are 20 synthetases in all: one attaches glycine to all tRNAs that rec- 
ognize codons for glycine, another attaches phenylalanine to all tRNAs 
that recognize codons for phenylalanine, and so on. Each synthetase 
enzyme recognizes its designated amino acid, as well as nucleotides in 
the anticodon loop and in the amino-acid-accepting arm that are specific 
to the correct tRNA (Figure 7-32 and Movie 7.6). The synthetases are 
thus equal in importance to the tRNAs in the decoding process, because 
it is the combined action of the synthetases and tRNAs that allows each 
codon in the mRNA molecule to be correctly matched to its amino acid 
(Figure 7-33). 


The synthetase-catalyzed reaction that attaches the amino acid to the 
3’ end of the tRNA is one of many reactions in cells that is coupled to 
the energy-releasing hydrolysis of ATP (see Figure 3-32). The reaction 
produces a high-energy bond between the charged tRNA and the amino 
acid. The energy of this bond is later used to link the amino acid cova- 
lently to the growing polypeptide chain. 


The mRNA Message Is Decoded on Ribosomes 


The recognition of a codon by the anticodon on a tRNA molecule depends 
on the same type of complementary base-pairing used in DNA replica- 
tion and transcription. However, accurate and rapid translation of mRNA 
into protein requires a molecular machine that can latch onto an MRNA, 
capture and position the correct tRNA molecules, and then covalently 
link the amino acids that they carry to form a polypeptide chain. In both 
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Figure 7-32 Each aminoacyl-tRNA 
synthetase makes multiple contacts with 
its tRNA molecule. For this tRNA, which 

is specific for the amino acid glutamine, 
nucleotides in both the anticodon 

loop (dark blue) and the amino-acid- 
accepting arm (green) are recognized by 
the synthetase (yellow-green). The ATP 
molecule that will be hydrolyzed to provide 
the energy needed to attach the amino acid 
to the tRNA is shown in red. 
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Figure 7-33 The genetic code is translated by aminoacyl-tRNA synthetases and tRNAs. Each synthetase couples a particular amino 
acid to its corresponding tRNAs, a process called charging. The anticodon on the charged tRNA molecule then forms base pairs with 
the appropriate codon on the mRNA. An error in either the charging step or the binding of the charged tRNA to its codon will cause 
the wrong amino acid to be incorporated into a polypeptide chain. In the sequence of events shown, the amino acid tryptophan (Trp) is 
specified by the codon UGG on the mRNA. 
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Figure 7-34 Ribosomes are located in endoplasmic reticulum 
the cytoplasm of eukaryotic cells. This 
electron micrograph shows a thin section of 
a small region of cytoplasm. The ribosomes 
appear as small gray blobs. Some are free 
in the cytoplasm (red arrows); others are 
attached to membranes of the endoplasmic 
reticulum (green arrows). (Courtesy of 
George Palade.) 


prokaryotes and eukaryotes, the machine that gets the job done is the 
QUESTION 7-4 ribosome—a large complex made from dozens of small proteins (the 


ribosomal proteins) and several RNA molecules called ribosomal RNAs 


In a clever experiment performed in (rRNAs). A typical eukaryotic cell contains millions of ribosomes in its 
1962, a cysteine already attached to cytosol (Figure 7-34). 

its tRNA was chemically converted 

to an alanine. These “hybrid” tRNA Eukaryotic and prokaryotic ribosomes are very similar in structure and 
molecules were then added to a cell- function. Both are composed of one large subunit and one small subunit, 
free translation system from which which fit together to form a complete ribosome with a mass of several 
the normal cysteine-tRNAs had million daltons (Figure 7-35); for comparison, an average-sized protein 


been removed. When the resulting 
protein was analyzed, it was found 
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Figure 7-35 The eukaryotic ribosome 

is a large complex of four rRNAs and eae 

more than 80 small proteins. Prokaryotic . r 

ribosomes are very similar: both are formed pr oben aaa 

from a large and small subunit, which only small 

come together after the small subunit has subunit 

bound an mRNA. The RNAs account for 

most of the mass of the ribosome and give complete eukaryotic ribosome 


it its overall shape and structure. MW = 4,200,000 


has a mass of 30,000 daltons. The small ribosomal subunit matches the 
tRNAs to the codons of the mRNA, while the large subunit catalyzes 
the formation of the peptide bonds that covalently link the amino acids 
together into a polypeptide chain. These two subunits come together on 
an mRNA molecule near its 5’ end to start the synthesis of a protein. The 
mRNA is then pulled through the ribosome like a long piece of tape. As 
the mRNA inches forward in a 5’-to-3’ direction, the ribosome translates 
its nucleotide sequence into an amino acid sequence, one codon at a 
time, using the tRNAs as adaptors. Each amino acid is thereby added 
in the correct sequence to the end of the growing polypeptide chain 
(Movie 7.7). When synthesis of the protein is finished, the two subunits 
of the ribosome separate. Ribosomes operate with remarkable efficiency: 
a eukaryotic ribosome adds about 2 amino acids to a polypeptide chain 
each second; a bacterial ribosome operates even faster, adding about 20 
amino acids per second. 


How does the ribosome choreograph all the movements required for 
translation? In addition to a binding site for an mRNA molecule, each 
ribosome contains three binding sites for tRNA molecules, called the A 
site, the P site, and the E site (Figure 7-36). To add an amino acid to a 
growing peptide chain, a charged tRNA enters the A site by base-pairing 
with the complementary codon on the mRNA molecule. Its amino acid is 
then linked to the growing peptide chain, which is held in place by the 
tRNA in the neighboring P site. Next, the large ribosomal subunit shifts 
forward, moving the spent tRNA to the E site before ejecting it (Figure 
7-37). This cycle of reactions is repeated each time an amino acid is 
added to the polypeptide chain, with the new protein growing from its 
amino to its carboxyl end until a stop codon in the mRNA is encountered 
and the protein is released. 
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Figure 7-36 Each ribosome has a binding site for an mRNA molecule and three 
binding sites for tRNAs. The tRNA sites are designated the A, P, and E sites (short 
for aminoacyl-tRNA, peptidyl-tRNA, and exit, respectively). (A) Three-dimensional 
structure of a bacterial ribosome, as determined by x-ray crystallography, with the 
small subunit in dark green and the large subunit in light green. Both the rRNAs and 
the ribosomal proteins are shown in green. tRNAs are shown bound in the E site 
(red), the P site (orange), and the A site (yellow). Although all three of the tRNA sites 
shown here are filled, during protein synthesis only two of these sites are occupied 
by a tRNA at any one time (see Figure 7-37). (B) Highly schematized representation 
of a ribosome, in the same orientation as (A), which is used in subsequent figures. 
Note that both the large and small subunits are involved in forming the A, P, and E 
sites, while only the small subunit contains the binding site for an mRNA. (A, adapted 
from M.M. Yusupov et al., Science 292:883-896, 2001. Courtesy of Albion A. Bausom 
and Harry Noller.) 
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Figure 7-38 Ribosomal RNAs give the 
ribosome its overall shape. Shown here are 
the detailed structures of the two rRNAs that 
form the core of the large subunit of a bacterial 
ribosome—the 23S rRNA (blue) and the 5S 
rRNA (purple). One of the protein subunits of 
the ribosome (L1) is included as a reference 
point, as this protein forms a characteristic 
protrusion on the ribosome surface. 

Ribosomal RNAs are commonly designated 

by their “S values,” which refer to their rate of 
sedimentation in an ultracentrifuge. The larger 
the S value, the larger the size of the molecule. 
(Adapted from N. Ban et al., Science 289: 
905-920, 2000.) 


Figure 7-37 Translation takes place in a four-step cycle, which 

is repeated over and over during the synthesis of a protein. In 

step 1, a charged tRNA carrying the next amino acid to be added 

to the polypeptide chain binds to the vacant A site on the ribosome 
by forming base pairs with the mRNA codon that is exposed there. 
Only a matching tRNA molecule can base-pair with this codon, which 
determines the specific amino acid added. The A and P sites are 
sufficiently close together that their two tRNA molecules are forced to 
form base pairs with codons that are contiguous, with no stray bases 
in-between. This positioning of the tRNAs ensures that the correct 
reading frame will be preserved throughout the synthesis of the 
protein. In step 2, the carboxyl end of the polypeptide chain (amino 
acid 3 in step 1) is uncoupled from the tRNA at the P site and joined by 
a peptide bond to the free amino group of the amino acid linked to the 
tRNA at the A site. This reaction is carried out by a catalytic site in the 
large subunit. In step 3, a shift of the large subunit relative to the small 
subunit moves the two bound tRNAs into the E and P sites of the large 
subunit. In step 4, the small subunit moves exactly three nucleotides 
along the mRNA molecule, bringing it back to its original position 
relative to the large subunit. This movement ejects the spent tRNA 
and resets the ribosome with an empty A site so that the next charged 
tRNA molecule can bind (Movie 7.8). 

As indicated, the mRNA is translated in the 5'-to-3' direction, and the 
N-terminal end of a protein is made first, with each cycle adding one 
amino acid to the C-terminus of the polypeptide chain. To watch the 
translation cycle in atomic detail, see Movie 7.9. 


The Ribosome Is a Ribozyme 


The ribosome is one of the largest and most complex structures in the cell, 
composed of two-thirds RNA and one-third protein by weight. The deter- 
mination of the entire three-dimensional structure of its large and small 
subunits in 2000 was a major triumph of modern biology. The structure 
confirmed earlier evidence that the rRNAs—not the proteins—are respon- 
sible for the ribosome’s overall structure and its ability to choreograph 
and catalyze protein synthesis. 


The rRNAs are folded into highly compact, precise three-dimensional 
structures that form the core of the ribosome (Figure 7-38). In contrast 
to the central positioning of the rRNAs, the ribosomal proteins are gener- 
ally located on the surface, where they fill the gaps and crevices of the 
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folded RNA. The main role of the ribosomal proteins seems to be to help 
fold and stabilize the RNA core, while permitting the changes in rRNA 
conformation that are necessary for this RNA to catalyze efficient protein 
synthesis. 


Not only are the three tRNA-binding sites (the A, P, and E sites) on the 
ribosome formed primarily by the rRNAs, but the catalytic site for peptide 
bond formation is formed by the 23S rRNA of the large subunit; the near- 
est ribosomal protein is located too far away to make contact with the 
incoming amino acid or with the growing polypeptide chain. The cata- 
lytic site in this RNA—a peptidyl transferase—is similar in many respects 
to that found in some protein enzymes: it is a highly structured pocket 
that precisely orients the two reactants—the elongating polypeptide and 
the amino acid carried by the incoming tRNA—thereby greatly increasing 
the likelihood of a productive reaction. 


RNA molecules that possess catalytic activity are called ribozymes. In 
the final section of this chapter, we will consider other ribozymes and 
discuss what the existence of RNA-based catalysis might mean for the 
early evolution of life on Earth. Here, we need only note that there is good 
reason to suspect that RNA rather than protein molecules served as the 
first catalysts for living cells. If so, the ribosome, with its catalytic RNA 
core, could be viewed as a relic of an earlier time in life’s history, when 
cells were run almost entirely by RNAs. 


Specific Codons in an MRNA Signal the Ribosome 
Where to Start and to Stop Protein Synthesis 


In a test tube, ribosomes can be forced to translate any RNA molecule 
(see How We Know, pp. 246-247). In a cell, however, a specific start sig- 
nal is required to initiate translation. The site at which protein synthesis 
begins on an mRNA is crucial, because it sets the reading frame for the 
entire message. An error of one nucleotide either way at this stage will 
cause every subsequent codon in the mRNA to be misread, resulting in a 
nonfunctional protein with a garbled sequence of amino acids (see Figure 
7-28). Furthermore, the rate of initiation has a major impact on the over- 
all rate at which the protein is synthesized from the mRNA. 


The translation of an mRNA begins with the codon AUG, for which a 
special charged tRNA is required. This initiator tRNA always carries 
the amino acid methionine (or a modified form of methionine, formyl- 
methionine, in bacteria). Thus newly made proteins all have methionine 
as the first amino acid at their N-terminal end, the end of a protein that 
is synthesized first. This methionine is usually removed later by a specific 
protease. 


In eukaryotes, an initiator tRNA, charged with methionine, is first loaded 
into the P site of the small ribosomal subunit, along with additional pro- 
teins called translation initiation factors (Figure 7-39). The initiator 
tRNA is distinct from the tRNA that normally carries methionine. Of all 
the tRNAs in the cell, only a charged initiator tRNA molecule is capable of 
binding tightly to the P site in the absence of the large ribosomal subunit. 


Figure 7-39 Initiation of protein synthesis in eukaryotes requires 
translation initiation factors and a special initiator tRNA. Although 
not shown here, efficient translation initiation also requires additional 
proteins that are bound at the 5’ cap and poly-A tail of the mRNA 
(see Figure 7-25). In this way, the translation apparatus can ascertain 
that both ends of the mRNA are intact before initiating translation. 
Following initiation, the protein is elongated by the reactions outlined 
in Figure 7-37. 
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Figure 7-40 A single prokaryotic mRNA molecule can encode several different 
proteins. In prokaryotes, genes directing the different steps in a process are often 
organized into clusters (operons) that are transcribed together into a single mRNA. 
A prokaryotic mRNA does not have the same sort of 5’ cap as a eukaryotic mRNA, 
but instead has a triphosphate at its 5’ end. Prokaryotic ribosomes initiate translation 
at ribosome-binding sites (dark blue), which can be located in the interior of an 
mRNA molecule. This feature enables prokaryotes to simultaneously synthesize 
different proteins from a single MRNA molecule, with each protein made by a 
different ribosome. 


Next, the small ribosomal subunit loaded with the initiator tRNA binds 
to the 5’ end of an MRNA molecule, which is marked by the 5’ cap that is 
present on all eukaryotic mRNAs (see Figure 7-17). The small ribosomal 
subunit then scans the mRNA, in the 5'-to-3' direction, until it encounters 
the first AUG. When this AUG is recognized by the initiator tRNA, several 
of the initiation factors dissociate from the small ribosomal subunit to 
make way for the large ribosomal subunit to bind and complete riboso- 
mal assembly. Because the initiator tRNA is bound to the P site, protein 
synthesis is ready to begin with the addition of the next charged tRNA to 
the A site (see Figure 7-37). 


The mechanism for selecting a start codon is different in bacteria. Bacterial 
mRNAs have no 5’ caps to tell the ribosome where to begin searching for 
the start of translation. Instead, each MRNA molecule contains a specific 
ribosome-binding sequence, approximately six nucleotides long, located 
a few nucleotides upstream of the AUG at which translation is to begin. 
Unlike a eukaryotic ribosome, a prokaryotic ribosome can readily bind 
directly to a start codon that lies in the interior of an MRNA, as long as a 
ribosome-binding site precedes it by several nucleotides. Such ribosome- 
binding sequences are necessary in bacteria, as prokaryotic mRNAs are 
often polycistronic—that is, they encode several different proteins on the 
same mRNA molecule; these transcripts contain a separate ribosome- 
binding site for each protein-coding sequence (Figure 7-40). In contrast, 
a eukaryotic MRNA usually carries the information for a single protein, 
and so it can rely on the 5’ cap—and the proteins that recognize it—to 
position the ribosome for its AUG search. 


The end of translation in both prokaryotes and eukaryotes is signaled by 
the presence of one of several codons, called stop codons, in the mRNA 
(see Figure 7-27). The stop codons—UAA, UAG, and UGA—are not recog- 
nized by a tRNA and do not specify an amino acid, but instead signal to 
the ribosome to stop translation. Proteins known as release factors bind 
to any stop codon that reaches the A site on the ribosome; this binding 
alters the activity of the peptidyl transferase in the ribosome, causing it 
to catalyze the addition of a water molecule instead of an amino acid to 
the peptidyl-tRNA (Figure 7-41). This reaction frees the carboxyl end of 
the polypeptide chain from its attachment to a tRNA molecule; because 
this is the only attachment that holds the growing polypeptide to the 


Figure 7-41 Translation halts at a stop codon. In the final phase of 
protein synthesis, the binding of release factor to an A site bearing 

a stop codon terminates translation of an mRNA molecule. The 
completed polypeptide is released, and the ribosome dissociates into 
its two separate subunits. 


ribosome, the completed protein chain is immediately released. At this 
point, the ribosome also releases the mRNA and dissociates into its two 
separate subunits, which can then assemble on another mRNA molecule 
to begin a new round of protein synthesis. 


Proteins Are Produced on Polyribosomes 


The synthesis of most protein molecules takes between 20 seconds and 
several minutes. But even during this short period, multiple ribosomes 
usually bind to each MRNA molecule being translated. If an mRNA is 
being translated efficiently, a new ribosome will hop onto its 5’ end 
almost as soon as the preceding ribosome has translated enough of the 
nucleotide sequence to move out of the way. The mRNA molecules being 
translated are therefore usually found in the form of polyribosomes, also 
known as polysomes. These large cytosolic assemblies are made up of 
many ribosomes spaced as close as 80 nucleotides apart along a single 
mRNA molecule (Figure 7-42). With multiple ribosomes working simul- 
taneously on a single mRNA, many more protein molecules can be made 
in a given time than would be possible if each polypeptide had to be com- 
pleted before the next could be started. 


Polysomes operate in both bacteria and eukaryotes, but bacteria can 
speed up the rate of protein synthesis even further. Because bacterial 
mRNA does not need to be processed and is also physically accessible to 
ribosomes while it is being synthesized, ribosomes will typically attach 
to the free end of a bacterial mRNA molecule and start translating it even 
before the transcription of that RNA is complete; these ribosomes follow 
closely behind the RNA polymerase as it moves along DNA. 


Inhibitors of Prokaryotic Protein Synthesis Are Used as 
Antibiotics 


The ability to translate mRNAs accurately into proteins is a fundamental 
feature of all life on Earth. Although the ribosome and other molecules 
that carry out this complex task are very similar among organisms, we 
have seen that there are some subtle differences in the way that bacteria 
and eukaryotes synthesize RNA and proteins. Although they represent 
a quirk of evolution, these differences form the basis of one of the most 
important advances in modern medicine. 
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Figure 7-42 Proteins are synthesized on 
polyribosomes. (A) Schematic drawing 
showing how a series of ribosomes can 
simultaneously translate the same mRNA 
molecule (Movie 7.10). (B) Electron 
micrograph of a polyribosome in the cytosol 
of a eukaryotic cell. (B, courtesy of John 
Heuser.) 
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TABLE 7-3 ANTIBIOTICS THAT INHIBIT BACTERIAL PROTEIN OR RNA 
SYNTHESIS 


Antibiotic Specific Effect 


Tetracycline blocks binding of aminoacyl-tRNA to A site of ribosome 
(step 1 in Figure 7-37) 


Streptomycin prevents the transition from initiation complex to chain 
elongation (see Figure 7-39); also causes miscoding 

Chloramphenicol blocks the peptidyl transferase reaction on ribosomes 
(step 2 in Figure 7-37) 

Cycloheximide blocks the translocation step in translation (step 3 in 
Figure 7-37) 

Rifamycin blocks initiation of transcription by binding to and 


inhibiting RNA polymerase 


Many of our most effective antibiotics are compounds that act by inhibit- 
ing bacterial, but not eukaryotic, gene expression. Some of these drugs 
exploit the small structural and functional differences between bacterial 
and eukaryotic ribosomes, so that they interfere preferentially with bacte- 
rial protein synthesis. These compounds can thus be taken in doses high 
enough to kill bacteria without being toxic to humans. Because different 
antibiotics bind to different regions of the bacterial ribosome, these drugs 
often inhibit different steps in protein synthesis. A few of the antibiotics 
that inhibit bacterial gene expression are listed in Table 7-3. 


Many common antibiotics were first isolated from fungi. Fungi and bac- 
teria often occupy the same ecological niches, and to gain a competitive 
edge, fungi have evolved, over time, potent toxins that kill bacteria but 
are harmless to themselves. Because fungi and humans are both eukary- 
otes, and are thus much more closely related to each other than either is 
to bacteria (see Figure 1-29), we have been able to borrow these weap- 
ons to combat our own bacterial foes. At the same time, bacteria have 
unfortunately evolved a resistance to many of these drugs, as we discuss 
in Chapter 9. Thus it remains a continual challenge for us to remain one 
step ahead of our microbial foes. 


Controlled Protein Breakdown Helps Regulate the 
Amount of Each Protein in a Cell 


After a protein is released from the ribosome, a cell can control its activity 
and longevity in various ways. The number of copies of a protein in a cell 
depends, like the number of organisms in a population, not only on how 
quickly new individuals arise but also on how long they survive. Proteins 
vary enormously in their lifespan. Structural proteins that become part 
of a relatively stable tissue such as bone or muscle may last for months 
or even years, whereas other proteins, such as metabolic enzymes and 
those that regulate cell growth and division (discussed in Chapter 18), 
last only for days, hours, or even seconds. But what determines the 
lifespan of a protein—and how does a protein “die”? 


Cells produce many proteins whose job it is to break other proteins down 
into their constituent amino acids (a process termed proteolysis). These 
enzymes, which degrade proteins, first to short peptides and finally to 
individual amino acids, are known collectively as proteases. Proteases 
act by cutting (hydrolyzing) the peptide bonds between amino acids (see 
Panel 2-6, pp. 76-77). One function of proteolytic pathways is to rapidly 


degrade those proteins whose lifetime must be kept short. Another is 
to recognize and remove proteins that are damaged or misfolded. 
Eliminating improperly folded proteins is critical for an organism, as mis- 
folded proteins tend to aggregate, and protein aggregates can damage 
cells and even trigger cell death. Eventually, all proteins—even long-lived 
ones—accumulate damage and are degraded by proteolysis. The amino 
acids produced by this proteolysis can then be re-used by the cell to make 
new proteins. 


In eukaryotic cells, proteins are broken down by large protein machines 
called proteasomes, present in both the cytosol and the nucleus. A pro- 
teasome contains a central cylinder formed from proteases whose active 
sites face into an inner chamber. Each end of the cylinder is plugged by 
a large protein complex formed from at least 10 types of protein subunits 
(Figure 7-43). These stoppers bind the proteins destined for degradation 
and then—using ATP hydrolysis to fuel this activity—unfold the doomed 
proteins and thread them into the inner chamber of the cylinder. Once 
the proteins are inside, proteases chop them into short peptides, which 
are then jettisoned from either end of the proteasome. Housing proteases 
inside these molecular destruction chambers makes sense, as it prevents 
the enzymes from running rampant in the cell. 


How do proteasomes select which proteins in the cell should be degraded? 
In eukaryotes, proteasomes act primarily on proteins that have been 
marked for destruction by the covalent attachment of a small protein 
called ubiquitin. Specialized enzymes tag those proteins that are destined 
for rapid degradation with a short chain of ubiquitin molecules; these 
ubiquitylated proteins are then recognized, unfolded, and fed into protea- 
somes by proteins within the stopper (Figure 7-44). 


Proteins that are meant to be short-lived often contain a short amino 
acid sequence that identifies the protein as one to be ubiquitylated and 
degraded in proteasomes. Damaged or misfolded proteins, as well as 
proteins containing oxidized or otherwise abnormal amino acids, are 
also recognized and degraded by this ubiquitin-dependent proteolytic 
system. The enzymes that add a polyubiquitin chain to such proteins 
recognize signals that become exposed on these proteins as a result of 
the misfolding or chemical damage—for example, amino acid sequences 
or conformational motifs that are typically buried and inaccessible in a 
“healthy” protein. 


There Are Many Steps Between DNA and Protein 


We have seen that many steps are required to produce a functional 
protein from the information contained in a gene. In a eukaryotic cell, 
mRNAs must be synthesized, processed, and exported to the cytosol 
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Figure 7-43 Proteins are degraded by the 
proteasome. The structures depicted here 
were determined by x-ray crystallography. 
(A) This drawing shows a cut-away view of 
the central cylinder of the proteasome, with 
the active sites of the proteases indicated 
by red dots. (B) The structure of the entire 
proteasome, in which access to the central 
cylinder (yellow) is regulated by a stopper 
(blue) at each end. (B, from P.C.A. da 
Fonseca et al., Mol. Cell 46:54-66, 2012. 
With permission from Elsevier.) 


Figure 7-44 Proteins marked by a 
polyubiquitin chain are degraded by 

the proteasome. Proteins in the stopper 

of a proteasome (blue) recognize proteins 
marked by a specific type of polyubiquitin 
chain (red). The stopper unfolds the target 
protein and threads it into the proteasome’s 
central cylinder (yellow), which is lined with 
proteases that chop the protein to pieces. 
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Figure 7-45 Many proteins require post- 
translational modifications to become 
fully functional. To be useful to the cell, a 
completed polypeptide must fold correctly 
into its three-dimensional conformation 
and then bind any required cofactors (red) 
and protein partners—all via noncovalent 
bonding. Many proteins also require one 
or more covalent modifications to become 
active—or to be recruited to specific 
membranes or organelles (not shown). 
Although phosphorylation and glycosylation 
are the most common, more than 100 types 
of covalent modifications of proteins are 
known. 


Figure 7-46 Protein production in a 
eukaryotic cell requires many steps. The 
final concentration of each protein depends 
on the rate of each step depicted. Even after 
an mRNA and its corresponding protein have 
been produced, their concentrations can be 
regulated by degradation. 


where they are translated to produce a protein. But the process does not 
end there. Proteins must then fold into the correct, three-dimensional 
shape (as we discuss in Chapter 4). Some proteins do so spontaneously, 
as they emerge from the ribosome. Most, however, require the assistance 
of chaperone proteins, which steer them along productive folding path- 
ways and prevent them from aggregating inside the cell (see Figures 4-8 
and 4-9). 


In addition to folding properly, many proteins—once they leave the ribo- 
some—require further adjustments before they are useful to the cell. As 
we discussed in Chapter 4, some proteins are covalently modified—for 
example, by phosphorylation or glycosylation. Others bind to small- 
molecule cofactors or associate with additional protein subunits. Such 
post-translational modifications are often needed for a newly synthesized 
protein to become fully functional (Figure 7-45). The final concentration 
of a protein, therefore, depends on the rate at which each of these steps— 
from DNA to mature, functional protein—is carried out (Figure 7-46). 


In principle, any one of these steps can be controlled by cells as they 
adjust the concentrations of their proteins to suit their needs. However, 
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as we will discuss thoroughly in the next chapter, the initiation of tran- 
scription is the most common point for a cell to regulate the expression 
of its genes. 


RNA AND THE ORIGINS OF LIFE 


The central dogma—that DNA makes RNA, which makes protein—pre- 
sented evolutionary biologists with a knotty puzzle: if nucleic acids are 
required to direct the synthesis of proteins, and proteins are required to 
synthesize nucleic acids, how could this system of interdependent com- 
ponents have arisen? The prevailing view is that an RNA world existed 
on Earth before cells containing DNA and proteins appeared. According 
to this hypothesis, RNA—which today serves largely as an intermediate 
between genes and proteins—both stored genetic information and cata- 
lyzed chemical reactions in primitive cells. Only later in evolutionary time 
did DNA take over as the genetic material and proteins become the major 
catalysts and structural components of cells (Figure 7-47). As we have 
seen, RNA still catalyzes several fundamental reactions in modern cells, 
including protein synthesis and RNA splicing. These ribozymes are like 
molecular fossils, holdovers from an earlier RNA world. 


Life Requires Autocatalysis 


The origin of life requires molecules that possess, if only to a small extent, 
one crucial property: the ability to catalyze reactions that lead—directly 
or indirectly—to the production of more molecules like themselves. 
Catalysts with this self-reproducing property, once they had arisen by 
chance, would divert raw materials from the production of other sub- 
stances to make more of themselves. In this way, one can envisage the 
gradual development of an increasingly complex chemical system of 
organic monomers and polymers that function together to generate more 
molecules of the same types, fueled by a supply of simple raw materi- 
als in the primitive environment on Earth. Such an autocatalytic system 
would have many of the properties we think of as characteristic of living 
matter: the system would contain a far-from-random selection of inter- 
acting molecules; it would tend to reproduce itself; it would compete with 
other systems dependent on the same raw materials; and, if deprived of 
its raw materials or maintained at a temperature that upset the balance 
of reaction rates, it would decay toward chemical equilibrium and “die.” 


But what molecules could have had such autocatalytic properties? In 
present-day living cells, the most versatile catalysts are proteins, which 
are able to adopt diverse three-dimensional forms that bristle with chem- 
ically reactive sites on their surface. However, there is no known way in 
which a protein can reproduce itself directly. RNA molecules, by contrast, 
possess properties that—at least, in principle—could be exploited to cata- 
lyze their own synthesis. 
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Figure 7-47 An RNA world may have existed before modern cells arose. 
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Figure 7-48 An RNA molecule can in 
principle guide the formation of an 
exact copy of itself. In the first step, 

the original RNA molecule acts as a 
template to produce an RNA molecule of 
complementary sequence. In the second 
step, this complementary RNA molecule 
itself acts as a template to produce an RNA 
molecule of the original sequence. Since 
each template molecule can produce many 
copies of the complementary strand, these 
reactions can result in the amplification of 
the original sequence. 
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RNA Can Store Information and Catalyze Chemical 
Reactions 


We have seen that complementary base-pairing enables one nucleic acid 
to act as a template for the formation of another. Thus a single strand of 
RNA or DNA contains the information needed to specify the sequence of a 
complementary polynucleotide, which, in turn, can specify the sequence 
of the original molecule, allowing the original nucleic acid to be repli- 
cated (Figure 7-48). Such complementary templating mechanisms lie at 
the heart of both DNA replication and transcription in modern-day cells. 


But the efficient synthesis of polynucleotides by such complementary 
templating mechanisms also requires catalysts to promote the polymeri- 
zation reaction: without catalysts, polymer formation is slow, error-prone, 
and inefficient. Today, nucleotide polymerization is catalyzed by protein 
enzymes—such as DNA and RNA polymerases. But how could this reac- 
tion be catalyzed before proteins with the appropriate catalytic ability 
existed? The beginnings of an answer were obtained in 1982, when it 
was discovered that RNA molecules themselves can act as catalysts. 


In present-day cells, RNA is synthesized as a single-stranded molecule, 
and we have seen that complementary base-pairing can occur between 
nucleotides in the same chain. This base-pairing, along with noncon- 
ventional hydrogen bonds, can cause each RNA molecule to fold up in 
a unique way that is determined by its nucleotide sequence (see Figure 
7-5). Such associations produce complex three-dimensional shapes. 


Protein enzymes are able to catalyze biochemical reactions because they 
have surfaces with unique contours and chemical properties, as we dis- 
cuss in Chapter 4. In the same way, RNA molecules, with their unique 
folded shapes, can serve as catalysts (Figure 7-49). Catalytic RNAs do not 
have the same structural and functional diversity as do protein enzymes; 
they are, after all, built from only four different subunits. Nonetheless, 
ribozymes can catalyze many types of chemical reactions. Although rela- 
tively few catalytic RNAs operate in present-day cells, they play major 
roles in some of the most fundamental steps in the expression of genetic 
information—specifically those steps where RNA molecules themselves 
are spliced or translated into protein. Additional ribozymes, with other 
catalytic capabilities, have been generated in the laboratory and selected 
for their activity in a test tube (Table 7-4). 


RNA, therefore, has all the properties required of an information-con- 
taining molecule that could also catalyze its own synthesis (Figure 7-50). 
Although self-replicating systems of RNA molecules have not been found 
in nature, scientists appear to be well on the way to constructing them 
in the laboratory. This achievement would not prove that self-replicating 
RNA molecules were essential to the origin of life on Earth, but it would 
demonstrate that such a scenario is possible. 
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TABLE 7-4 BIOCHEMICAL REACTIONS THAT CAN BE C 
RIBOZYMES 


LYZED BY 


K 


Peptide bond formation in protein ribosomal RNA ribozyme 
synthesis 
; 3! 
RNA splicing small nuclear RNAs (snRNAs), self-splicing 2 + 
RNAs 3° 
bstrat 
DNA ligation in vitro selected RNA RA G 
RNA polymerization in vitro se ected RNA BASE-PAIRING BETWEEN 
RIBOZYME AND SUBSTRATE 
RNA phosphorylation in vitro selected RNA 


RNA aminoacylation in vitro selected RNA 


RNA alkylation in vitro selected RNA 


C-C bond rotation (isomerization) in vitro selected RNA 


RNA Is Thought to Predate DNA in Evolution  sssmoccnncs 


If the evolutionary role for RNA proposed above is correct, the first cells 
on Earth would have stored their genetic information in RNA rather than 
DNA. And based on the chemical differences between these polynucleo- 
tides, it appears that RNA could indeed have arisen before DNA. Ribose 
(see Figure 7-3A), like glucose and other simple carbohydrates, is readily 
formed from formaldehyde (HCHO), which is one of the principal products 
of experiments simulating conditions on the primitive Earth. The sugar 
deoxyribose is harder to make, and in present-day cells it is produced 
from ribose in a reaction catalyzed by a protein enzyme, suggesting that 
ribose predates deoxyribose in cells. Presumably, DNA appeared on the 
scene after RNA, and then proved better suited than RNA as a perma- 
nent repository of genetic information. In particular, the deoxyribose in g 


its sugar-phosphate backbone makes chains of DNA chemically much " — 


| PRODUCT RELEASE 


more stable than chains of RNA, so that DNA can grow to greater lengths + 


without breakage. 
ribozyme 


The other differences between RNA and DNA—the double-helical struc- 
ture of DNA and the use of thymine rather than uracil—further enhance 
DNA stability by making the molecule easier to repair. We saw in Chapter 
6 that a damaged nucleotide on one strand of the double helix can be 
repaired by using the other strand as a template. Furthermore, deamina- 
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Figure 7-49 A ribozyme is an RNA 


molecule that possesses catalytic activity. 


The RNA molecule shown catalyzes the 


tion, one of the most common detrimental chemical changes occurring cleavage ofa second RNA ata specific 


site. Such ribozymes are found embedded 
in large RNA genomes—called viroids— 
that infect plants, where the cleavage 
reaction is one step in the replication of the 
viroid. (Adapted from T.R. Cech and O.C. 
Uhlenbeck, Nature 372:39-40, 1994. With 
permission from Macmillan Publishers Ltd.) 
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Figure 7-50 Could an RNA molecule catalyze 
its own synthesis? The process would require that 
the RNA catalyze the self-templated amplification 
steps shown in Figure 7-48. The red rays represent 
the active site of this hypothetical ribozyme. 
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Figure 7-51 RNA may have preceded 
DNA and proteins in evolution. According 
to this hypothesis, RNA molecules provided 
genetic, structural, and catalytic functions in 
the earliest cells. DNA is now the repository 
of genetic information, and proteins carry 
out almost all catalysis in cells. RNA now 
functions mainly as a go-between in protein 
synthesis, while remaining a catalyst for 

a few crucial reactions (including protein 
synthesis). 


QUESTION 7—6 


Discuss the following: “During the 
evolution of life on Earth, RNA lost 
its glorious position as the first self- 
replicating catalyst. Its role now is as 
a mere messenger in the information 
flow from DNA to protein.” 
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in polynucleotides, is easier to detect and repair in DNA than in RNA (see 
Figure 6-24). This is because the product of the deamination of cytosine 
is, by chance, uracil, which already exists in RNA, so that such damage 
would be impossible for repair enzymes to detect in an RNA molecule. 
However, in DNA, which has thymine rather than uracil, any uracil pro- 
duced by the accidental deamination of cytosine is easily detected and 
repaired. 


Taken together, the evidence we have discussed supports the idea that 
RNA—with its ability to provide genetic, structural, and catalytic func- 
tions—preceded DNA in evolution. As cells more closely resembling 
present-day cells appeared, it is believed that RNAs were relieved of many 
of the duties they had originally performed: DNA took over the primary 
storage of genetic information, and proteins became the major catalysts, 
while RNA remained primarily as the intermediary connecting the two 
(Figure 7-51). With the rise of DNA, cells were able to become more com- 
plex, for they could then carry and transmit more genetic information 
than could be stably maintained by RNA alone. Because of the greater 
chemical complexity of proteins and the variety of chemical reactions 
they can catalyze, the shift from RNA to proteins (albeit incomplete) also 
provided a much richer source of structural components and enzymes, 
enabling cells to evolve the great diversity of appearance and function 
that we see today. 


ESSENTIAL CONCEPTS 


e The flow of genetic information in all living cells is DNA — RNA — 
protein. The conversion of the genetic instructions in DNA into RNAs 
and proteins is termed gene expression. 


e To express the genetic information carried in DNA, the nucleotide 
sequence of a gene is first transcribed into RNA. Transcription is 
catalyzed by the enzyme RNA polymerase, which uses nucleotide 
sequences in the DNA molecule to determine which strand to use as 
a template, and where to start and stop transcribing. 


e RNA differs in several respects from DNA. It contains the sugar ribose 
instead of deoxyribose and the base uracil (U) instead of thymine (T). 
RNAs in cells are synthesized as single-stranded molecules, which 
often fold up into complex three-dimensional shapes. 


e Cells make several functional types of RNAs, including messenger 
RNAs (mRNAs), which carry the instructions for making proteins; 
ribosomal RNAs (rRNAs), which are the crucial components of ribo- 
somes; and transfer RNAs (tRNAS), which act as adaptor molecules in 
protein synthesis. 


e To begin transcription, RNA polymerase binds to specific DNA sites 
called promoters that lie immediately upstream of genes. To initiate 
transcription, eukaryotic RNA polymerases require the assembly of 
a complex of general transcription factors at the promoter, whereas 
bacterial RNA polymerase requires only an additional subunit, called 
sigma factor. 


e Most protein-coding genes in eukaryotic cells are composed of a 
number of coding regions, called exons, interspersed with larger, 
noncoding regions, called introns. When a eukaryotic gene is tran- 
scribed from DNA into RNA, both the exons and introns are copied. 


e Introns are removed from the RNA transcripts in the nucleus by RNA 
splicing, a reaction catalyzed by small ribonucleoprotein complexes 
known as snRNPs. Splicing removes the introns from the RNA and 
joins together the exons—often in a variety of combinations, allowing 
multiple proteins to be produced from the same gene. 
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e Eukaryotic pre-mRNAs go through several additional RNA process- 
ing steps before they leave the nucleus as mRNAs, including 5’ RNA 
capping and 3’ polyadenylation. These reactions, along with splicing, 
take place as the pre-mRNA is being transcribed. 


e Translation of the nucleotide sequence of an mRNA into a protein 
takes place in the cytoplasm on large ribonucleoprotein assemblies 
called ribosomes. As the mRNA moves through the ribosome, its 
message is translated into protein. 


e The nucleotide sequence in MRNA is read in consecutive sets of three 
nucleotides called codons; each codon corresponds to one amino 
acid. 


e The correspondence between amino acids and codons is specified 
by the genetic code. The possible combinations of the 4 different 
nucleotides in RNA give 64 different codons in the genetic code. Most 
amino acids are specified by more than one codon. 


e tRNAs act as adaptor molecules in protein synthesis. Enzymes called 
aminoacyl-tRNA synthetases covalently link amino acids to their 
appropriate tRNAs. Each tRNA contains a sequence of three nucleo- 
tides, the anticodon, which recognizes a codon in an mRNA through 
complementary base-pairing. 


e Protein synthesis begins when a ribosome assembles at an initia- 
tion codon (AUG) in an mRNA molecule, a process that depends on 
proteins called translation initiation factors. The completed protein 
chain is released from the ribosome when a stop codon (UAA, UAG, 
or UGA) in the mRNA is reached. 


e The stepwise linking of amino acids into a polypeptide chain is cata- 
lyzed by an rRNA molecule in the large ribosomal subunit, which thus 
acts as a ribozyme. 


e The concentration of a protein in a cell depends on the rates at 
which the mRNA and protein are synthesized and degraded. Protein 
degradation in the cytosol and nucleus occurs inside large protein 
complexes called proteasomes. 


e From our knowledge of present-day organisms and the molecules 
they contain, it seems likely that life on Earth began with the evolu- 
tion of RNA molecules that could catalyze their own replication. 


e It has been proposed that RNA served as both the genome and the 
catalysts in the first cells, before DNA replaced RNA as a more stable 
molecule for storing genetic information, and proteins replaced RNAs 
as the major catalytic and structural components. RNA catalysts in 
modern cells are thought to provide a glimpse into an ancient, RNA- 
based world. 


KEY TERMS 


alternative splicing messenger RNA (mRNA) RNA polymerase 
aminoacyl-tRNA synthetase polyadenylation RNA processing 

anticodon promoter RNA splicing 

codon protease RNA transcript 

exon proteasome RNA world 

gene reading frame small nuclear RNA (snRNA) 
gene expression ribosomal RNA (rRNA) spliceosome 

general transcription factors ribosome transcription 

genetic code ribozyme transfer RNA (tRNA) 
initiator tRNA RNA translation 


intron RNA capping translation initiation factor 
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QUESTION 7-7 


Which of the following statements are correct? Explain your 
answers. 


A. An individual ribosome can make only one type of 
protein. 


B. All mRNAs fold into particular three-dimensional 
structures that are required for their translation. 


C. The large and small subunits of an individual ribosome 
always stay together and never exchange partners. 


D. Ribosomes are cytoplasmic organelles that are 
encapsulated by a single membrane. 


E. Because the two strands of DNA are complementary, 
the mRNA of a given gene can be synthesized using either 
strand as a template. 


F. An mRNA may contain the sequence 
ATTGACCCCGGTCAA. 


G. The amount of a protein present in a cell depends on 
its rate of synthesis, its catalytic activity, and its rate of 
degradation. 


QUESTION 7-8 


The Lacheinmal protein is a hypothetical protein that causes 
people to smile more often. It is inactive in many chronically 
unhappy people. The mRNA isolated from a number of 
different unhappy individuals in the same family was found 
to lack an internal stretch of 173 nucleotides that is present 
in the Lacheinmal mRNA isolated from happy members of 
the same family. The DNA sequences of the Lacheinmal 
genes from the happy and unhappy family members were 
determined and compared. They differed by a single 
nucleotide substitution, which lay in an intron. What can you 
say about the molecular basis of unhappiness in this family? 

(Hints: [1] Can you hypothesize a molecular mechanism by 
which a single nucleotide substitution in a gene could cause 
the observed deletion in the mRNA? Note that the deletion 
is internal to the mRNA. [2] Assuming the 173-base-pair 
deletion removes coding sequences from the Lacheinmal 
mRNA, how would the Lacheinmal protein differ between 
the happy and unhappy people?) 


QUESTION 7-9 


Use the genetic code shown in Figure 7-27 to identify which 
of the following nucleotide sequences would code for the 
polypeptide sequence arginine-glycine-aspartate: 


1. 5'-AGA-GGA-GAU-3’ 
2. 5'-ACA-CCC-ACU-3’ 
3. 5'-GGG-AAA-UUU-3’ 
4. 5'-CGG-GGU-GAC-3’ 


QUESTION 7-10 


“The bonds that form between the anticodon of a tRNA 
molecule and the three nucleotides of a codon in mRNA 
are ." Complete this sentence with each of the 
following options and explain whether each of the resulting 
statements is correct or incorrect. 
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A. covalent bonds formed by GTP hydrolysis 


B. hydrogen bonds that form when the tRNA is at the A 
site 


C. broken by the translocation of the ribosome along the 
mRNA 


QUESTION 7-11 


List the ordinary, dictionary definitions of the terms 
replication, transcription, and translation. By their side, list 
the special meaning each term has when applied to the 
living cell. 


QUESTION 7-12 


In an alien world, the genetic code is written in pairs of 
nucleotides. How many amino acids could such a code 
specify? In a different world, a triplet code is used, but the 
order of nucleotides is not important; it only matters which 
nucleotides are present. How many amino acids could this 
code specify? Would you expect to encounter any problems 
translating these codes? 


QUESTION 7-13 


One remarkable feature of the genetic code is that amino 
acids with similar chemical properties often have similar 
codons. Thus codons with U or C as the second nucleotide 
tend to specify hydrophobic amino acids. Can you suggest 
a possible explanation for this phenomenon in terms of the 
early evolution of the protein-synthesis machinery? 


QUESTION 7-14 


A mutation in DNA generates a UGA stop codon in the 
middle of the mRNA coding for a particular protein. 

A second mutation in the cell’s DNA leads to a single 
nucleotide change in a tRNA that allows the correct 
translation of this protein; that is, the second mutation 
“suppresses” the defect caused by the first. The altered 
tRNA translates the UGA as tryptophan. What nucleotide 
change has probably occurred in the mutant tRNA 
molecule? What consequences would the presence of such 
a mutant tRNA have for the translation of the normal genes 
in this cell? 


QUESTION 7-15 


The charging of a tRNA with an amino acid can be 
represented by the following equation: 


amino acid + tRNA + ATP > aminoacyl-tRNA + AMP + PP; 


where PP; is pyrophosphate (see Figure 3—41). In 

the aminoacyl-tRNA, the amino acid and tRNA are linked 
with a high-energy covalent bond; a large portion of the 
energy derived from the hydrolysis of ATP is thus stored in 
this bond and is available to drive peptide bond formation 
during the later stages of protein synthesis. The free-energy 
change of the charging reaction shown in the equation is 
close to zero and therefore would not be expected to 
favor attachment of the amino acid to tRNA. Can you 
suggest a further step that could drive the reaction to 
completion? 


QUESTION 7-16 


A. The average molecular weight of a protein in the cell is 
about 30,000 daltons. A few proteins, however, are much 
larger. The largest known polypeptide chain made by any 
cell is a protein called titin (made by mammalian muscle 
cells), and it has a molecular weight of 3,000,000 daltons. 
Estimate how long it will take a muscle cell to translate 

an mRNA coding for titin (assume the average molecular 
weight of an amino acid to be 120, and a translation rate of 
two amino acids per second for eukaryotic cells). 


B. Protein synthesis is very accurate: for every 10,000 
amino acids joined together, only one mistake is made. 
What is the fraction of average-sized protein molecules and 
of titin molecules that are synthesized without any errors? 
[Hint: the probability P of obtaining an error-free protein is 
given by P = (1 - E)”, where E is the error frequency and n 
the number of amino acids.] 


C. The combined molecular weight of the eukaryotic 
ribosomal proteins is about 2.5 x 10° daltons. Would it be 
advantageous to synthesize them as a single protein? 


D. Transcription occurs at a rate of about 30 nucleotides 
per second. Is it possible to calculate the time required to 
synthesize a titin mRNA from the information given here? 


QUESTION 7-17 


Which of the following types of mutations would be 
predicted to harm an organism? Explain your answers. 


A. Insertion of a single nucleotide near the end of the 
coding sequence. 


B. Removal of a single nucleotide near the beginning of the 
coding sequence. 


C. Deletion of three consecutive nucleotides in the middle 
of the coding sequence. 


D. Deletion of four consecutive nucleotides in the middle of 
the coding sequence. 


E. Substitution of one nucleotide for another in the middle 
of the coding sequence. 


QUESTION 7-18 


Figure 7—8 shows many molecules of RNA polymerase 
simultaneously transcribing two adjacent genes on a single 
DNA molecule. Looking at this figure, label the 5’ and 3’ 
ends of the DNA template strand and the sets of RNA 
molecules being transcribed. 


Questions 
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An organism's DNA encodes all of the RNA and protein molecules that 
are needed to make its cells. Yet a complete description of the DNA 
sequence of an organism—be it the few million nucleotides of a bacte- 
rium or the few billion nucleotides in each human cell—does not enable 
us to reconstruct that organism any more than a list of all the English 
words in a dictionary enables us to reconstruct a Shakespeare play. We 
need to know how the elements in the DNA sequence or the words on a 
list work together to produce the masterpiece. 


For cells, the answer comes down to gene expression. Even the sim- 
plest single-celled bacterium can use its genes selectively—for example, 
switching genes on and off to make the enzymes needed to digest what- 
ever food sources are available. In multicellular plants and animals, 
gene expression is even more elaborate. Over the course of embryonic 
development, a fertilized egg cell gives rise to many cell types that dif- 
fer dramatically in both structure and function. The differences between 
an information-processing nerve cell and toxin-neutralizing liver cell, for 
example, are so extreme that it is difficult to imagine that the two cells 
contain the same DNA ( ). For this reason, and because cells in 
an adult organism rarely lose their distinctive characteristics, biologists 
originally suspected that certain genes might be selectively eliminated 
from cells as they become specialized. We now know, however, that 
nearly all the cells of a multicellular organism contain the same genome. 
Cell differentiation is instead achieved by changes in gene expression. 


In mammals, hundreds of different cell types carry out a range of spe- 
cialized functions that depend upon genes that are switched on in that 
cell type but not in most others: for example, the p cells of the pancreas 
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Figure 8-1 A neuron and a liver cell share the same genome. 

The long branches of this neuron from the retina enable it to receive 
electrical signals from numerous other neurons and pass these signals 
along to many neighboring neurons. The liver cell, which is drawn to 
the same scale, is involved in many metabolic processes, including 
digestion and the detoxification of alcohol and other drugs. Both of 
these mammalian cells contain the same genome, but they express 
different RNAs and proteins. (Neuron adapted from S. Ramón y Cajal, 
Histologie du Système Nerveux de l'Homme et de Vertébrés, 1909- 
1911. Paris: Maloine; reprinted, Madrid: C.S.I.C., 1972.) 


make the protein hormone insulin, while the a cells of the pancreas make 
the hormone glucagon; the B lymphocytes of the immune system make 
antibodies, while developing red blood cells make the oxygen-transport 
protein hemoglobin. The differences between a neuron, a white blood 
cell, a pancreatic B cell, and a red blood cell depend on the precise con- 
trol of gene expression. A typical differentiated cell expresses only about 
half the genes in its total repertoire. This selection, which differs from one 
cell type to the next, is the basis for the specialized properties of each cell 
type. 

In this chapter, we discuss the main ways in which gene expression is 
regulated, with a focus on those genes that encode proteins as their 
final product. Although some of these control mechanisms apply to both 
eukaryotes and prokaryotes, eukaryotic cells—with their larger number 
of genes and more complex chromosomes—have some additional ways 
of controlling gene expression that are not found in bacteria. 


AN OVERVIEW OF GENE EXPRESSION 


Gene expression is a complex process by which cells selectively direct 
the synthesis of the many thousands of proteins and RNAs encoded in 
their genome. But how do cells coordinate and control such an intri- 
cate process—and how does an individual cell specify which of its genes 
to express? This decision is an especially important problem for ani- 
mals because, as they develop, their cells become highly specialized, 
ultimately producing an array of muscle, nerve, and blood cells, along 
with the hundreds of other cell types seen in the adult. Such cell 
differentiation arises because cells make and accumulate different sets 
of RNA and protein molecules: that is, they express different genes. 


The Different Cell Types of a Multicellular Organism 
Contain the Same DNA 


The evidence that cells have the ability to change which genes they 
express without altering the nucleotide sequence of their DNA comes 
from experiments in which the genome from a differentiated cell is made 
to direct the development of a complete organism. If the chromosomes of 
the differentiated cell were altered irreversibly during development—for 
example, by jettisoning some of their genes—they would not be able to 
accomplish this feat. 


Consider, for example, an experiment in which the nucleus is taken from 
a skin cell in an adult frog and injected into a frog egg from which the 
nucleus has been removed. In at least some cases, that doctored egg 
will develop into a normal tadpole (Figure 8-2). Thus, the nucleus from 
the transplanted skin cell cannot have lost any critical DNA sequences. 
Nuclear transplantation experiments carried out with differentiated cells 
taken from adult mammals—including sheep, cows, pigs, goats, and 
mice—have shown similar results. And in plants, individual cells removed 
from a carrot, for example, can regenerate an entire adult carrot plant. 


An Overview of Gene Expression 269 


(A) 
nucleus in 
pipette 
skin cells in N 
adult frog culture dish 
UV SAR 
yee nucleus normal embryo 
—_> injected 
into egg 
unfertilized frog egg nucleus destroyed 
by UV light 
(B) 
= == _ 
m 
section proliferating separated single clone of young young carrot 
of carrot cell mass cells in rich cell dividing embryo plant 
liquid cells 
medium 
DONOR CELL 
© PLACED NEXT TO 


ENUCLEATED EGG 


— © 
OPO) 
AY a 


— | 


ELECTRIC 


wnat PULSE CAUSES = 
á DONOR CELL reconstructed embryo embryo placed in calf 
TO FUSE WITH zygote foster mother 
— —= 


ENUCLEATED 
EGG CELL 
unfertilized MEIOTIC SPINDLE 
egg cell AND ASSOCIATED 
CHROMOSOMES 
REMOVED 


Figure 8-2 Differentiated cells contain all the genetic instructions needed to direct the formation of a 
complete organism. (A) The nucleus of a skin cell from an adult frog transplanted into an “enucleated” egg—one 
whose nucleus has been destroyed—can give rise to an entire tadpole. The broken arrow indicates that to give the 
transplanted genome time to adjust to an embryonic environment, a further transfer step is required in which one of 
the nuclei is taken from the early embryo that begins to develop and is put back into a second enucleated egg. (B) In 
many types of plants, differentiated cells retain the ability to “de-differentiate,” so that a single cell can proliferate to 
form a clone of progeny cells that later give rise to an entire plant. (C) A nucleus removed from a differentiated cell 
of an adult cow can be introduced into an enucleated egg from a different cow to give rise to a calf. Different calves 
produced from the same differentiated cell donor are all clones of the donor and are therefore genetically identical. 
The cloned sheep Dolly was produced by this type of nuclear transplantation. (A, modified from J.B. Gurdon, Sci. 
Am. 219:24-35, 1968.) 


These experiments all demonstrate that the DNA in specialized cell types 
of multicellular organisms still contains the entire set of instructions 
needed to form a whole organism. The various cell types of an organism 
therefore differ not because they contain different genes, but because 
they express them differently. 


Different Cell Types Produce Different Sets of Proteins 


The extent of the differences in gene expression between different 
cell types may be roughly gauged by comparing the protein composi- 
tion of cells in liver, heart, brain, and so on. In the past, such analysis 
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was performed by two-dimensional gel electrophoresis (see Panel 4-5, 
p. 167). Nowadays, the total protein content of a cell can be rapidly 
analyzed by a method called mass spectrometry (see Figure 4-56). This 
technique is much more sensitive than electrophoresis and it enables the 
detection of proteins that are produced even in minor quantities. 


Both techniques reveal that many proteins are common to all the cells 
of a multicellular organism. These housekeeping proteins include, for 
example, RNA polymerases, DNA repair enzymes, ribosomal proteins, 
enzymes involved in glycolysis and other basic metabolic processes, and 
many of the proteins that form the cytoskeleton. In addition, each differ- 
ent cell type also produces specialized proteins that are responsible for 
the cell's distinctive properties. In mammals, for example, hemoglobin is 
made almost exclusively in developing red blood cells. 


Gene expression can also be studied by cataloging a cell’s RNA molecules, 
including the mRNAs that encode protein. The most comprehensive 
methods for such analyses involve determining the nucleotide sequence 
of all RNAs made by the cell, an approach that can also reveal the rela- 
tive abundance of each. Estimates of the number of different mRNA 
sequences in human cells suggest that, at any one time, a typical dif- 
ferentiated human cell expresses perhaps 5000-15,000 protein-coding 
genes from a total of about 19,000. And studies of a variety of tissue 
types confirm that the collection of expressed mRNAs differs from one 
cell type to the next. 


A Cell Can Change the Expression of Its Genes in 
Response to External Signals 


Although each cell type in a multicellular organism expresses its own 
group of genes, these collections are not static. Specialized cells are 
capable of altering their patterns of gene expression in response to 
extracellular cues. For example, if a liver cell is exposed to the steroid 
hormone cortisol, the production of several proteins is dramatically 
increased. Released by the adrenal gland during periods of starvation, 
intense exercise, or prolonged stress, cortisol signals liver cells to boost 
the production of glucose from amino acids and other small molecules. 
The set of proteins whose production is induced by cortisol includes 
enzymes such as tyrosine aminotransferase, which helps convert tyros- 
ine to glucose. When the hormone is no longer present, the production of 
these proteins returns to its resting level. 


Other cell types respond to cortisol differently. In fat cells, for example, 
the production of tyrosine aminotransferase is reduced; some other cell 
types do not respond to cortisol at all. The fact that different cell types 
often respond in different ways to the same extracellular signal contrib- 
utes to the specialization that gives each cell type its distinctive character. 


Gene Expression Can Be Regulated at Various Steps 
from DNA to RNA to Protein 


If differences among the various cell types of an organism depend on 
the particular genes that each cell expresses, at what level is this control 
of gene expression exercised? As we discussed in the previous chapter, 
there are many steps in the pathway leading from DNA to protein, and 
each of them can in principle be regulated. Thus a cell can control the 
proteins it contains by (1) controlling when and how often a given gene is 
transcribed, (2) controlling how an RNA transcript is spliced or otherwise 
processed, (3) selecting which mRNAs are exported from the nucleus 
to the cytosol, (4) regulating how quickly certain mRNA molecules are 
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degraded, (5) selecting which mRNAs are translated into protein by ribo- 
somes, or (6) regulating how rapidly specific proteins are destroyed after 
they have been made; in addition, the activity of individual proteins, once 
they have been synthesized, can be further regulated in a variety of ways. 


In eukaryotic cells, gene expression can be regulated at each of these 
steps (Figure 8-3). For most genes, however, the control of transcription 
(shown in step 1) is paramount. This makes sense because only tran- 
scriptional control can ensure that no unnecessary intermediates are 
synthesized. Thus it is the regulation of transcription—and the DNA and 
protein components that determine which genes a cell transcribes into 
RNA—that we address first. 


HOW TRANSCRIPTION IS REGULATED 


Until 50 years ago, the idea that genes could be switched on and off was 
revolutionary. This concept was a major advance, and it came originally 
from studies of how E. coli bacteria adapt to changes in the composition 
of their growth medium. Many of the same principles apply to eukaryotic 
cells. However, the enormous complexity of gene regulation in organ- 
isms that possess a nucleus, combined with the packaging of their DNA 
into chromatin, creates special challenges and some novel opportunities 
for control—as we will see. We begin with a discussion of the transcrip- 
tion regulators (often loosely referred to as transcription factors), proteins 
that bind to specific DNA sequences and control gene transcription. 


Transcription Regulators Bind to Regulatory DNA 
Sequences 


Nearly all genes, whether bacterial or eukaryotic, contain sequences 
that direct and control their transcription. In Chapter 7, we saw that the 
promoter region of a gene binds the enzyme RNA polymerase and cor- 
rectly orients the enzyme to begin its task of making an RNA copy of 
the gene. The promoters of both bacterial and eukaryotic genes include 
a transcription initiation site, where RNA synthesis begins, plus nearby 
sequences that contain recognition sites for proteins that associate with 
RNA polymerase: sigma factor in bacteria (see Figure 7-9) or the general 
transcription factors in eukaryotes (see Figure 7-12). 


In addition to the promoter, the vast majority of genes include regula- 
tory DNA sequences that are used to switch the gene on or off. Some 
regulatory DNA sequences are as short as 10 nucleotide pairs and act 
as simple switches that respond to a single signal; such simple regula- 
tory switches predominate in bacteria. Other regulatory DNA sequences, 
especially those in eukaryotes, are very long (sometimes spanning more 
than 100,000 nucleotide pairs) and act as molecular microprocessors, 
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Figure 8-3 Gene expression in eukaryotic 
cells can be controlled at various steps. 
Examples of regulation at each of these 
steps are known, although for most 

genes the main site of control is step 1: 
transcription of a DNA sequence into RNA. 
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Figure 8-4 A transcription regulator interacts with the DNA double helix. (A) The regulator shown recognizes 
DNA via three a helices, drawn as numbered cylinders, which allow the protein to fit into the major groove and 

form tight associations with the base pairs in a short stretch of DNA. This particular structural motif, called a 
homeodomain, is found in many eukaryotic DNA-binding proteins (Movie 8.1). (B) Most of the contacts with the 
DNA bases are made by helix 3 (red), which is shown here end-on. (C) An asparagine side chain from helix 3 forms 
two hydrogen bonds with the adenine in an A-T base pair. The view is end-on, looking down the center of the DNA 
double helix, and the protein contacts the base pair from the major-groove side. Note that the interactions between 
the protein and DNA take place along the edges of the nucleotide base and do not disrupt the hydrogen bonds 
that hold the base pairs together. For simplicity, only one amino acid-base contact is shown; in reality, transcription 
regulators form hydrogen bonds (as shown here), ionic bonds, and hydrophobic interactions with multiple bases. 
Most of these contacts occur in the major groove, but some proteins also interact with bases in the minor groove, 
as shown in (B). Typically, the protein-DNA interface would consist of 10-20 such contacts, each involving a different 
amino acid and each contributing to the overall strength of the protein-DNA interaction. 


integrating information from a variety of signals into a command that 
determines how often transcription of the gene is initiated. 


Regulatory DNA sequences do not work by themselves. To have any 
effect, these sequences must be recognized by proteins called transcrip- 
tion regulators. It is the binding of a transcription regulator to a regula- 
tory DNA sequence that acts as the switch to control transcription. The 
simplest bacterium produces several hundred different transcription reg- 
ulators, each of which recognizes a different DNA sequence and thereby 
regulates a distinct set of genes. Humans make many more—2000 or so— 
indicating the importance and complexity of this form of gene regulation 
in the development and function of a complex organism. 


Proteins that recognize a specific nucleotide sequence do so because 
the surface of the protein fits tightly against the surface features of the 
DNA double helix in that region. Because these surface features will vary 
depending on the nucleotide sequence, different DNA-binding proteins 
will recognize different nucleotide sequences. In most cases, the pro- 
tein inserts into the major groove of the DNA double helix and makes a 
series of intimate, noncovalent molecular contacts with the nucleotide 
pairs within the groove (Figure 8-4, Movie 8.2). Although each individual 
contact is weak, the 10 to 20 contacts that typically form at the protein- 
DNA interface combine to ensure that the interaction is both highly spe- 
cific and very strong; indeed, protein-DNA interactions are among the 
tightest and most specific molecular interactions known in biology. 


Many transcription regulators bind to the DNA helix as dimers. Such 
dimerization roughly doubles the area of contact with the DNA, thereby 
greatly increasing the potential strength and specificity of the protein- 
DNA interaction (Figure 8-5, Movie 8.3). 
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Transcription Switches Allow Cells to Respond to 
Changes in Their Environment 


The simplest and best-understood examples of gene regulation occur in 
bacteria. The genome of the bacterium E. coli consists of a single, circu- 
lar DNA molecule of about 4.6 x 106 nucleotide pairs. This DNA encodes 
approximately 4300 proteins, although only a fraction of these are made 
at any one time. Bacteria regulate the expression of many of their genes 
according to the food sources that are available in the environment. In 
E. coli, for example, five genes code for enzymes that manufacture tryp- 
tophan when this amino acid is scarce. These genes are arranged in a 
cluster on the chromosome and are transcribed from a single promoter 
as one long MRNA molecule; such coordinately transcribed clusters are 
called operons (Figure 8—6). Although operons are common in bacteria 
(see Figure 7-40), they are rare in eukaryotes, where genes are tran- 
scribed and regulated individually. 


When tryptophan concentrations are low, the operon is transcribed; 
the resulting mRNA is translated to produce a full set of biosynthetic 
enzymes, which work in tandem to synthesize the amino acid. When 
tryptophan is abundant, however—for example, when the bacterium is in 
the gut of a mammal that has just eaten a protein-rich meal—the amino 
acid is imported into the cell and shuts down production of the enzymes, 
which are no longer needed. 


We understand in considerable detail how this repression of the trypto- 
phan operon comes about. Within the operon’s promoter is a short DNA 
sequence, called the operator (see Figure 8-6), that is recognized by a 
transcription regulator. When this regulator binds to the operator, it blocks 
access of RNA polymerase to the promoter, thus preventing transcription 
of the operon and, ultimately, the production of the tryptophan-synthe- 
sizing enzymes. The transcription regulator is known as the tryptophan 
repressor, and it is controlled in an ingenious way: the repressor can bind 
to DNA only if it is also bound to tryptophan (Figure 8-7). 


The tryptophan repressor is an allosteric protein (see Figure 4-44): the 
binding of tryptophan causes a subtle change in its three-dimensional 
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Figure 8-5 Many transcription regulators 
bind to DNA as dimers. (A) As shown, 
such dimerization doubles the number 

of protein-DNA contacts. Here, and 
throughout the book, regulatory sequences 
are represented by colored bars; each bar 
represents a double-helical segment of 
DNA, as in Figure 8—4. (B) Shown here is a 
regulatory sequence recognized by Nanog, 
a homeodomain family member that is a 
key regulator in embryonic stem cells. This 
diagram, called a “logo,” represents the 
preferred nucleotide at each position of 
the sequence; the height of each letter is 
proportional to the frequency with which 
this base is found at that position in the 
regulatory sequence. In the first position, 
for example, T is found more often than C, 
while A is the only nucleotide found in the 
second and third position of the sequence. 
Although regulatory sequences in the cell 
are double-stranded, a logo typically shows 
the sequence of only one DNA strand; the 
other strand is simply the complementary 
sequence. Logos are useful because 

they reveal at a glance the range of DNA 
sequences to which a given transcription 
regulator will bind. 


Figure 8-6 A cluster of bacterial 

genes can be transcribed from a single 
promoter. Each of these five genes encodes 
a different enzyme; all of the enzymes 

are needed to synthesize the amino acid 
tryptophan from simpler molecular building 
blocks. The genes are transcribed as a 
single mRNA molecule, a feature that 
allows their expression to be coordinated. 
Such clusters of genes, called operons, are 
common in bacteria. In this case, the entire 
operon is controlled by a single regulatory 
DNA sequence, called the Trp operator 
(green), situated within the promoter. The 
yellow blocks in the promoter represent 
DNA sequences that bind RNA polymerase. 
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Figure 8—7 Genes can be switched off by repressor proteins. If the concentration of tryptophan inside a 
bacterium is low (left), RNA polymerase (blue) binds to the promoter and transcribes the five genes of the tryptophan 
operon. However, if the concentration of tryptophan is high (right), the repressor protein (dark green) becomes active 
and binds to the operator (light green), where it blocks the binding of RNA polymerase to the promoter. Whenever 
the concentration of intracellular tryptophan drops, the repressor falls off the DNA, allowing the polymerase to 

again transcribe the operon. The promoter contains two key blocks of DNA sequence information, the -35 and -10 
regions, highlighted in yellow, which are recognized by RNA polymerase (see Figure 7-10). The complete operon is 


shown in Figure 8-6. 


Figure 8-8 Genes can be switched on by 
activator proteins. An activator protein 
binds to a regulatory sequence on the DNA 
and then interacts with the RNA polymerase 
to help it initiate transcription. Without 

the activator, the promoter fails to initiate 
transcription efficiently. In bacteria, the 
binding of the activator to DNA is often 
controlled by the interaction of a metabolite 
or other small molecule (red circle) with the 
activator protein. 


structure so that the protein can bind to the operator sequence. When 
the concentration of free tryptophan in the bacterium drops, the repres- 
sor no longer binds to DNA, and the tryptophan operon is transcribed. 
The repressor is thus a simple device that switches production of a set of 
biosynthetic enzymes on and off according to the availability of trypto- 
phan—a form of feedback inhibition (see Figure 4-42). 


The tryptophan repressor protein itself is always present in the cell. The 
gene that encodes it is continuously transcribed at a low level, so that a 
small amount of the repressor protein is always being made. Thus the 
bacterium can respond very rapidly to increases and decreases in trypto- 
phan concentration. 


Repressors Turn Genes Off and Activators Turn Them On 


The tryptophan repressor, as its name suggests, is a transcriptional 
repressor protein: in its active form, it switches genes off, or represses 
them. Some bacterial transcription regulators do the opposite: they 
switch genes on, or dctivate them. These transcriptional activator 
proteins work on promoters that—in contrast to the promoter for the 
tryptophan operon—are only marginally able to bind and position RNA 
polymerase on their own. These inefficient promoters can be made fully 
functional by activator proteins that bind to a nearby regulatory sequence 
and make contact with the RNA polymerase, helping it to initiate tran- 
scription (Figure 8-8). 
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Like the tryptophan repressor, activator proteins often have to interact 
with a second molecule to be able to bind DNA. For example, the bacte- 
rial activator protein CAP has to bind cyclic AMP (cAMP) before it can 
bind to DNA (see Figure 4-20). Genes activated by CAP are switched on 
in response to an increase in intracellular cAMP concentration, which 
occurs when glucose, the bacterium’s preferred carbon source, is no 
longer available; as a result, CAP drives the production of enzymes that 
allow the bacterium to digest other sugars. 


The Lac Operon Is Controlled by an Activator and a 
Repressor 


In many instances, the activity of a single promoter is controlled by two 
different transcription regulators. The Lac operon in E. coli, for example, 
is controlled by both the Lac repressor and the CAP activator that we 
just discussed. The Lac operon encodes proteins required to import and 
digest the disaccharide lactose. In the absence of glucose, the bacterium 
makes CAMP, which activates CAP to switch on genes that allow the cell 
to utilize alternative sources of carbon—including lactose. It would be 
wasteful, however, for CAP to induce expression of the Lac operon if lac- 
tose itself were not present. Thus the Lac repressor shuts off the operon 
in the absence of lactose. This arrangement enables the control region 
of the Lac operon to integrate two different signals, so that the operon 
is highly expressed only when two conditions are met: glucose must be 
absent and lactose must be present (Figure 8-9). This circuit thus behaves 
much like a switch that carries out a logic operation in a computer. When 
lactose is present AND glucose is absent, the cell executes the appro- 
priate program—in this case, transcription of the genes that permit the 
uptake and utilization of lactose. None of the other combinations of con- 
ditions produce this result. 


The elegant logic of the Lac operon first attracted the attention of biolo- 
gists more than 50 years ago. The molecular basis of the switch in E. coli 
was uncovered by a combination of genetics and biochemistry, provid- 
ing the first insight into how transcription is controlled. In a eukaryotic 
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QUESTION 8-1 


Bacterial cells can take up the 
amino acid tryptophan (Trp) from 
their surroundings or, if there is an 
insufficient external supply, they can 
synthesize tryptophan from other 
small molecules. The Trp repressor is 
a transcription regulator that shuts 
off the transcription of genes that 
code for the enzymes required for 
the synthesis of tryptophan (see 
Figure 8-7). 

A. What would happen to the 
regulation of the tryptophan operon 
in cells that express a mutant form 
of the tryptophan repressor that 

(1) cannot bind to DNA, (2) cannot 
bind tryptophan, or (3) binds 

to DNA even in the absence of 
tryptophan? 

B. What would happen in 
scenarios (1), (2), and (3) if the 

cells, in addition, produced normal 
tryptophan repressor protein from a 
second, normal gene? 


Figure 8-9 The Lac operon is controlled 
by two transcription regulators, the 

Lac repressor and CAP. When lactose 

is absent, the Lac repressor binds to the 
Lac operator and shuts off expression of 
the operon. Addition of lactose increases 
the intracellular concentration of a related 
compound, allolactose; allolactose binds to 
the Lac repressor, causing it to undergo a 
conformational change that releases its grip 
on the operator DNA (not shown). When 
glucose is absent, cyclic AMP (red circle) is 
produced by the cell, and CAP binds to DNA. 
For the operon to be transcribed, glucose 
must be absent (allowing the CAP activator 
to bind) and lactose must be present 
(releasing the Lac repressor). LacZ, the first 
gene of the operon, encodes the enzyme 
B-galactosidase, which breaks down lactose 
to galactose and glucose (Movie 8.4). 
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QUESTION 8-2 


Explain how DNA-binding proteins 
can make sequence-specific contacts 
to a double-stranded DNA molecule 
without breaking the hydrogen 
bonds that hold the bases together. 
Indicate how, through such contacts, 
a protein can distinguish a T-A from 
a C-G pair. Indicate the parts of the 
nucleotide base pairs that could 
form noncovalent interactions— 
hydrogen bonds, electrostatic 
attractions, or hydrophobic 
interactions (see Panel 2-3, 

pp. 70-71)—with a DNA-binding 
protein. The structures of all the 
base pairs in DNA are given in 
Figure 5-4. 


Figure 8-10 In eukaryotes, gene 
activation can occur at a distance. 

An activator protein bound to a distant 
enhancer attracts RNA polymerase and 
the general transcription factors to the 
promoter. Looping of the intervening DNA 
permits contact between the activator and 
the transcription initiation complex bound 
to the promoter. In the case shown here, 

a large protein complex called Mediator 
serves as a go-between. The broken stretch 
of DNA signifies that the segment of DNA 
between the enhancer and the start of 
transcription varies in length, sometimes 
reaching tens of thousands of nucleotide 
pairs. The TATA box is a DNA recognition 
sequence for the first general transcription 
factor that binds to the promoter (see 
Figure 7-12). Some eukaryotic activator 
proteins bind to DNA as dimers, but others 
bind DNA as monomers, as shown. 


Control of Gene Expression 


cell, similar transcription regulatory devices are combined to generate 
increasingly complex circuits, including those that enable a fertilized egg 
to form the tissues and organs of a multicellular organism. 


Eukaryotic Transcription Regulators Control Gene 
Expression from a Distance 


Eukaryotes, too, use transcription regulators—both activators and 
repressors—to regulate the expression of their genes. The DNA sites to 
which eukaryotic gene activators bind are termed enhancers, because 
their presence dramatically enhances the rate of transcription. However, 
biologists discovered that eukaryotic activator proteins could enhance 
transcription even when they are bound thousands of nucleotide pairs 
upstream—or downstream—of the gene’s promoter. These observations 
raised several questions. How do enhancer sequences and the proteins 
bound to them function over such long distances? How do they commu- 
nicate with a gene’s promoter? 


Many models for this “action at a distance” have been proposed, but the 
simplest of these seems to apply in most cases. The DNA between the 
enhancer and the promoter loops out, bringing the activator protein into 
close proximity with the promoter (Figure 8-10). The DNA thus acts as 
a tether, allowing a protein that is bound to an enhancer—even one that 
is thousands of nucleotide pairs away—to interact with the proteins in 
the vicinity of the promoter (see Figure 7-12). Often, additional proteins 
serve as adaptors to close the loop; the most important of these is a large 
complex of proteins known as Mediator. Together, all of these proteins 
ultimately attract and position the general transcription factors and RNA 
polymerase at the promoter, forming a transcription initiation complex 
(see Figure 8-10). Eukaryotic repressor proteins do the opposite: they 
decrease transcription by preventing the assembly of this complex. 


Eukaryotic Transcription Regulators Help Initiate 
Transcription by Recruiting Chromatin-Modifying Proteins 


In a eukaryotic cell, the proteins that guide the formation of the tran- 
scription initiation complex must also deal with the problem of DNA 
packaging. As discussed in Chapter 5, eukaryotic DNA is wound around 
clusters of histone proteins to form nucleosomes, which, in turn, are 
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folded into higher-order structures. How do transcription regulators, 
general transcription factors, and RNA polymerase gain access to the 
underlying DNA? Although some of these proteins can bind efficiently to 
DNA that is wrapped up in nucleosomes, others are thwarted by these 
compact structures. More critically, nucleosomes that are positioned 
over a promoter can inhibit the initiation of transcription by physically 
blocking the assembly of the general transcription factors and RNA poly- 
merase on the promoter. Such packaging may have evolved in part to 
prevent leaky gene expression by blocking the initiation of transcription 
in the absence of the proper activator proteins. 


In eukaryotic cells, activator and repressor proteins can exploit the mech- 
anisms used to package DNA to help turn genes on and off. As we saw 
in Chapter 5, chromatin structure can be altered by chromatin-remodeling 
complexes and by enzymes that covalently modify the histone proteins 
that form the core of the nucleosome (see Figures 5-24 and 5-25). Many 
gene activators take advantage of these mechanisms by attracting such 
chromatin-modifying proteins to promoters. For example, the recruitment 
of histone acetyltransferases promotes the attachment of acetyl groups to 
selected lysines in the tail of histone proteins; these acetyl groups them- 
selves attract proteins that promote transcription, including some of the 
general transcription factors (Figure 8-11). And the recruitment of chro- 
matin-remodeling complexes makes nearby DNA more accessible. These 
actions enhance the efficiency of transcription initiation. 


In a similar way, gene repressor proteins can modify chromatin in ways 
that reduce the efficiency of transcription initiation. For example, many 
repressors attract histone deacetylases—enzymes that remove the acetyl 
groups from histone tails, thereby reversing the positive effects that 
acetylation has on transcription initiation. Although some eukaryotic 
repressor proteins work on a gene-by-gene basis, others can orchestrate 
the formation of large swathes of transcriptionally inactive chromatin. 
As discussed in Chapter 5, these transcription-resistant regions of DNA 
include the heterochromatin found in interphase chromosomes and the 
inactive X chromosome in the cells of female mammals. 
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QUESTION 8-3 


Some transcription regulators bind 
to DNA and cause the double helix 
to bend at a sharp angle. Such 
“bending proteins” can stimulate 
the initiation of transcription 
without contacting either the RNA 
polymerase, any of the general 
transcription factors, or any other 
transcription regulators. Can you 
devise a plausible explanation for 
how these proteins might work 

to modulate transcription? Draw 

a diagram that illustrates your 
explanation. 


Figure 8-11 Eukaryotic transcriptional 
activators can recruit chromatin- 
modifying proteins to help initiate gene 
transcription. On the left, the recruitment 
of histone-modifying enzymes such as 
histone acetyltransferases adds acetyl 
groups to specific histones, which can 

then serve as binding sites for proteins 

that stimulate transcription initiation (not 
shown). On the right, chromatin-remodeling 
complexes render the DNA packaged in 
nucleosomes more accessible to other 
proteins in the cell, including those required 
for transcription initiation; notice, for 
example, the increased exposure of the 
TATA box. 
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Figure 8-12 Animal and plant 
chromosomes are arranged in DNA loops. 
In this schematic diagram, specialized 
proteins (green) hold chromosomal DNA 
in loops, thereby favoring the association 
of each gene with its proper enhancer. 
The loops, sometimes called topological 
associated domains (TADs), range in 

size between thousands and millions of 
nucleotide pairs and are typically much 
larger than the loops that form between 
regulatory sequences and promoters (see 
Figure 8-10). 
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The Arrangement of Chromosomes into Looped 
Domains Keeps Enhancers in Check 


We have seen that all genes have regulatory regions, which dictate at 
which times, under what conditions, and in what tissues the gene will 
be expressed. We have also seen that eukaryotic transcription regula- 
tors can act across very long stretches of DNA, with the intervening DNA 
looped out. What, then, prevents a transcripton regulator—bound to the 
control region of one gene—from looping in the wrong direction and 
inappropriately influencing the transcription of a neighboring gene? 


To avoid such unwanted cross-talk, the chromosomal DNA of plants and 
animals is arranged in a series of loops that hold individual genes and 
their regulatory regions in rough proximity. This localization restricts the 
action of enhancers, preventing them from wandering across to adjacent 
genes. The chromosomal loops are formed by specialized proteins that 
bind to sequences that are then drawn together to form the base of the 
loop (Figure 8-12). 


The importance of these loops is highlighted by the effects of mutations 
that prevent the loops from properly forming. Such mutations, which 
lead to genes being expressed at the wrong time and place, are found in 
numerous cancers and inherited diseases. 


GENERATING SPECIALIZED CELL TYPES 


All cells must be able to turn genes on and off in response to signals in 
their environment. But the cells of multicellular organisms have taken 
this type of transcriptional control to an extreme, using it in highly spe- 
cialized ways to form organized arrays of differentiated cell types. Such 
decisions present a special challenge: once a cell in a multicellular 
organism becomes committed to differentiate into a specific cell type, 
the choice of fate is generally maintained through subsequent cell divi- 
sions. This means that the changes in gene expression, which are often 
triggered by a transient signal, must be remembered by the cell. Such 
cell memory is a prerequisite for the creation of organized tissues and 
for the maintenance of stably differentiated cell types. In contrast, the 
simplest changes in gene expression in both eukaryotes and bacteria are 
often only transient; the tryptophan repressor, for example, switches off 
the tryptophan operon in bacteria only in the presence of tryptophan; as 
soon as the amino acid is removed from the medium, the genes switch 
back on, and the descendants of the cell will have no memory that their 
ancestors had been exposed to tryptophan. 


In this section, we discuss some of the special features of transcriptional 
regulation that allow multicellular organisms to create and maintain 
specialized cell types. These cell types ultimately produce the tissues 
and organs that give worms, flies, and even humans their distinctive 
characteristics. 


Eukaryotic Genes Are Controlled by Combinations of 
Transcription Regulators 


The genes we have examined thus far have all been controlled by a small 
number of transcription regulators. While this is true for many simple 
bacterial systems, most eukaryotic transcription regulators work as part 
of a large “committee” of regulatory proteins, all of which cooperate to 
express the gene in the right cell type, in response to the right conditions, 
at the right time, and in the required amount. 


The term combinatorial control refers to the process by which groups 
of transcription regulators work together to determine the expression of 
a single gene. The bacterial Lac operon we discussed earlier provides 
a simple example of the use of multiple regulators to control transcrip- 
tion (see Figure 8-9). In eukaryotes, such regulatory inputs have been 
amplified, so that a typical gene is controlled by dozens of transcription 
regulators that bind to regulatory sequences that may be spread over tens 
of thousands of nucleotide pairs. Together, these regulators direct the 
assembly of the Mediator, chromatin-remodeling complexes, histone- 
modifying enzymes, general transcripton factors, and, ultimately, RNA 
polymerase (Figure 8-13). In many cases, multiple repressors and acti- 
vators are bound to the DNA that controls transcription of a given gene; 
how the cell integrates the effects of all of these proteins to determine the 
final level of gene expression is only now beginning to be understood. An 
example of such a complex regulatory system—one that participates in 
the development of a fruit fly from a fertilized egg—is described in How 
We Know, pp. 280-281. 


The Expression of Different Genes Can Be Coordinated 
by a Single Protein 


In addition to being able to switch individual genes on and off, all cells— 
whether prokaryote or eukaryote—need to coordinate the expression of 
different genes. When a eukaryotic cell receives a signal to proliferate, for 
example, a number of hitherto unexpressed genes are turned on together 
to set in motion the events that lead eventually to cell division (discussed 
in Chapter 18). As discussed earlier, bacteria often coordinate the expres- 
sion of a set of genes by having them clustered together in an operon 
under the control of a single promoter (see Figure 8-6). Such clustering is 
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Figure 8-13 Transcription regulators 
work together as a “committee” to 


® control the expression of a eukaryotic 


gene. Whereas the general transcription 
factors that assemble at the promoter 
are the same for all genes transcribed by 
RNA polymerase (see Figure 7-12), the 
transcription regulators and the locations 
of their DNA binding sites relative to the 


These regulators, along with chromatin- 
modifying proteins, are assembled at the 
promoter by the Mediator. The effects 

of multiple transcription regulators 
combine to determine the final rate of 
transcription initiation. The “spacer” DNA 
sequences that separate the regulatory 
DNA sequences are not recognized by any 
transcription regulators. 
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* HOW WE KNOW 


GENE REGULATION—THE STORY OF EVE 


The ability to regulate gene expression is crucial to the 
proper development of a multicellular organism from 
a fertilized egg to an adult. Beginning at the earliest 
moments in development, a succession of transcrip- 
tional programs guides the differential expression of 
genes that allows an animal to form a proper body 
plan—helping to distinguish its back from its belly, and 
its head from its tail. These programs ultimately direct 
the correct placement of a wing or a leg, a mouth or an 
anus, a neuron or a liver cell. 


A central challenge in developmental biology, then, is 
to understand how an organism generates these pat- 
terns of gene expression, which are laid down within 
hours of fertilization. Among the most important genes 
involved in these early stages of development are those 
that encode transcription regulators. By interacting with 
different regulatory DNA sequences, these proteins 
instruct every cell in the embryo to switch on the genes 
that are appropriate for that cell at each time point dur- 
ing development. How can a protein binding to a piece 
of DNA help direct the development of a complex multi- 
cellular organism? To see how we can address that large 
question, we review the story of Eve. 


Seeing Eve 


Even-skipped—Eve, for short—is a gene whose expres- 
sion plays an important part in the development of the 
Drosophila embryo. If this gene is inactivated by muta- 
tion, many parts of the embryo fail to form and the fly 
larva dies early in development. But Eve is not expressed 
uniformly throughout the embryo. Instead, the Eve pro- 
tein is produced in a striking series of seven neat stripes, 
each of which occupies a very precise position along the 
length of the embryo. These seven stripes correspond to 
seven of the fourteen segments that define the body plan 
of the fly—three for the head, three for the thorax, and 
eight for the abdomen. 


This pattern of expression never varies: the Eve protein 
can be found in the very same places in every Drosophila 
embryo (see Figure 8—14B). How can the expression of a 
gene be regulated with such spatial precision—such that 
one cell will produce a protein while a neighboring cell 
does not? To find out, researchers took a trip upstream. 


Dissecting the DNA 


As we have seen in this chapter, regulatory DNA 
sequences control which cells in an organism will 
express a particular gene, and at what point during devel- 
opment that gene will be turned on. In eukaryotes, these 


regulatory sequences are frequently located upstream 
of the gene itself. One way to locate a regulatory DNA 
sequence—and study how it operates—is to remove 
a piece of DNA from the region upstream of a gene of 
interest and insert that DNA upstream of a reporter 
gene—one that encodes a protein with an activity that 
is easy to monitor experimentally. If the piece of DNA 
contains a regulatory sequence, it will drive the expres- 
sion of the reporter gene. When this patchwork piece of 
DNA is subsequently introduced into a cell or organism, 
the reporter gene will be expressed in the same cells and 
tissues that normally express the gene from which the 
regulatory sequence was derived (see Figure 10-24). 


By excising various segments of the DNA sequences 
upstream of Eve, and coupling them to a reporter gene, 
researchers found that the expression of the gene is con- 
trolled by a series of seven regulatory modules—each 
of which specifies a single stripe of Eve expression. In 
this way, researchers identified, for example, a sin- 
gle segment of regulatory DNA that specifies stripe 2. 
They could excise this regulatory segment, link it to a 
reporter gene, and introduce the resulting DNA segment 
into the fly. When they examined embryos that carried 
this engineered DNA, they found that the reporter gene 
is expressed in the precise position of stripe 2 (Figure 
8-14). Similar experiments revealed the existence of six 
other regulatory modules, one for each of the other Eve 
stripes. 


The next question was: How does each of these seven 
regulatory segments direct the formation of a single 
stripe in a specific position? The answer, researchers 
found, is that each segment contains a unique com- 
bination of regulatory sequences that bind different 
combinations of transcription regulators. These regula- 
tors, like the Eve protein itself, are distributed in unique 
patterns within the embryo—some toward the head, 
some toward the rear, some in the middle. 


The regulatory segment that defines stripe 2, for 
example, contains regulatory DNA sequences for four 
transcription regulators: two that activate Eve transcrip- 
tion and two that repress it (Figure 8-15). In the narrow 
band of tissue that constitutes stripe 2, it just so happens 
that the repressor proteins are not present—so the Eve 
gene is expressed; in the bands of tissue on either side of 
the stripe, where the repressors are present, Eve is kept 
quiet. And so a stripe is formed. 


The regulatory segments controlling the other stripes 
are thought to function along similar lines; each regu- 
latory segment reads “positional information” provided 
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Figure 8-14 An experimental approach using a reporter gene reveals the modular construction of the Eve gene 
regulatory region. (A) Expression of the Eve gene is controlled by a series of regulatory segments (orange) that direct 
the production of Eve protein in stripes along the embryo. (B) Embryos stained with antibodies to the Eve protein show 
the seven characteristic stripes of Eve expression. (C) In the laboratory, the regulatory segment that directs the formation 
of stripe 2 can be excised from the DNA shown in part (A) and inserted upstream of the E. coli LacZ gene, which 
encodes the enzyme B-galactosidase (see Figure 8-9). (D) When the engineered DNA containing the stripe 2 regulatory 
segment is introduced into the genome of a fly, the resulting embryo expresses B-galactosidase mRNA precisely in the 
position of the second Eve stripe. This mRNA is visualized by in situ hybridization (see p. 352) using a labeled RNA probe 
that base pairs only with the lacZ mRNA. (B and D, courtesy of Stephen Small and Michael Levine.) 


by some unique combination of transcription regulators 
and expresses Eve on the basis of this information. The 
entire regulatory region is strung out over 20,000 nucle- 
otide pairs of DNA and, altogether, binds more than 20 
transcription regulators. This large regulatory region is 
built from a series of smaller regulatory segments, each 
of which consists of a unique arrangement of regula- 
tory DNA sequences recognized by specific transcription 
regulators. In this way, the Eve gene can respond to an 
enormous combination of inputs. 


The Eve protein is itself a transcription regulator, and 
it—in combination with many other regulatory pro- 
teins—controls key events in the development of the fly. 
This complex organization of a discrete number of regu- 
latory elements begins to explain how the development 
of an entire organism can be orchestrated by repeated 
applications of a few basic principles. 


transcriptional repressors 


] 
Giant 


TA 
J) 


stripe 2 
regulatory 
DNA segment 


1 
Krüppel 


N. d 


Bicoid Hunchback | 


transcriptional activators 


Figure 8-15 The regulatory segment that specifies Eve stripe 2 contains binding sites for four different 
transcription regulators. All four regulators are responsible for the proper expression of Eve in stripe 2. Flies that are 
deficient in the two activators, called Bicoid and Hunchback, fail to form stripe 2 efficiently; in flies deficient in either of 
the two repressors, called Giant and Kruppel, stripe 2 expands and covers an abnormally broad region of the embryo. 
As indicated in the diagram, in some cases the binding sites for the transcription regulators overlap, and the proteins 
compete for binding to the DNA. For example, the binding of Bicoid and Krüppel to the site at the far right is thought to 
be mutually exclusive. The regulatory segment is 480 base pairs in length. 
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Figure 8-16 A single transcription inactive cortisol © cortisol 
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rarely seen in eukaryotic cells, where each gene is transcribed and regu- 
lated individually. So how do eukaryotic cells coordinate the expression 
of multiple genes? In particular, given that a eukaryotic cell uses a com- 
mittee of transcription regulators to control each of its genes, how can it 
rapidly and decisively switch whole groups of genes on or off? 


The answer is that even though control of gene expression is combinato- 
rial, the effect of a single transcription regulator can still be decisive in 
switching any particular gene on or off, simply by completing the combi- 
nation needed to activate or repress that gene. This is like dialing in the 
final number of a combination lock: the lock will spring open if the other 
numbers have been previously entered. And just as the same number 
can complete the combination for different locks, the same protein can 
complete the combination for several different genes. As long as differ- 
ent genes contain regulatory DNA sequences that are recognized by the 
same transcription regulator, they can be switched on or off together as 
a coordinated unit. 


An example of such coordinated regulation in humans is seen in response 
to cortisol (see Table 16-1, p. 536). As discussed earlier in this chapter, 
when this hormone is present, liver cells increase the expression of many 
genes, including those that allow the liver to produce glucose in response 
to starvation or prolonged stress. To switch on such cortisol-responsive 
genes, the cortisol receptor—a transcription regulator—first forms a 
complex with a molecule of cortisol. This cortisol-receptor complex then 
binds to a regulatory sequence in the DNA of each cortisol-responsive 
gene. When the cortisol concentration decreases again, the expression 
of all of these genes drops to normal levels. In this way, a single tran- 
scription regulator can coordinate the expression of many different genes 
(Figure 8-16). 


Combinatorial Control Can Also Generate Different 
Cell Types 


The ability to switch many different genes on or off using a limited number 
of transcription regulators is not only useful in the day-to-day regulation 
of cell function. It is also one of the means by which eukaryotic cells 
diversify into particular types of cells during embryonic development. 


One striking example is the development of muscle cells. A mammalian 
skeletal muscle cell is distinguished from other cells by the production 
of a large number of characteristic proteins, such as the muscle-specific 
forms of actin and myosin that make up the contractile apparatus, as well 
as the receptor proteins and ion channel proteins in the plasma mem- 
brane that allow the muscle cell to contract in response to stimulation by 
nerves (discussed in Chapter 17). The genes encoding this unique array 
of muscle-specific proteins are all switched on coordinately as the mus- 
cle cell differentiates. Studies of developing muscle cells in culture have 
identified a small number of key transcription regulators, expressed only 
in potential muscle cells, that coordinate muscle-specific gene expression 
and are thus crucial for muscle-cell differentiation. This set of regulators 
activates the transcription of the genes that code for muscle-specific pro- 
teins by binding to specific DNA sequences present in their regulatory 
regions. 


In the same way, other sets of transcription regulators can activate the 
expression of genes that are specific for other cell types. How different 
combinations of transcription regulators can tailor the development of 
different cell types is illustrated schematically in Figure 8-17. 


Still other transcription regulators can maintain cells in an undif- 
ferentiated state, like the precursor cell shown in Figure 8-17. Some 
undifferentiated cells are so developmentally flexible they are capable 
of giving rise to all the specialized cell types in the body. The embryonic 
stem (ES) cells we discuss in Chapter 20 retain this remarkable quality, a 
property called pluripotency. 


The differentiation of a particular cell type involves changes in the 
expression of thousands of genes: genes that encode products needed by 
the cell are expressed at high levels, while those that are not needed are 
expressed at low levels or shut down completely. A given transcription 
regulator, therefore, often controls the expression of hundreds or even 
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Figure 8-17 Combinations of a few 
transcription regulators can generate 
many cell types during development. In 
this simple scheme, a “decision” to make 
a new transcription regulator (shown as a 
numbered circle) is made after each cell 
division. Repetition of this simple rule can 
generate eight cell types (A through H) 
using only three transcription regulators. 
Each of these hypothetical cell types would 
then express different sets of genes, as 
dictated by the combination of transcription 
regulators that each cell type produces. 
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Figure 8—18 A set of three transcription 
regulators forms the regulatory network 
that specifies an embryonic stem cell. 

(A) The three transcription regulators— 
KIf4, Oct4, and Sox2—are shown in large 
colored circles. The genes whose regulatory 
sequences contain binding sites for each 

of these regulators are indicated by 

small green dots. The lines that link each 
regulator to a gene represent the binding 
of that regulator to the regulatory region of 
the gene. Note that although each regulator 
controls the expression of a unique set 

of genes, many of these target genes are 
bound by more than one transcription 
regulator—and a substantial set interacts 
with all three. (B) These three regulators 
also control their own expression. As shown 
here, each regulator binds to the regulatory 
region of its own gene, as indicated by 

the feedback loops (red). In addition, 

the regulators also bind to each other's 
regulatory regions (blue). Positive feedback 
loops, a common form of regulation, are 
discussed later in the chapter. 
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Figure 8-19 A master transcription 
regulator can direct the formation of an 
entire organ. Artificially induced expression 
of the Drosophila Ey gene in the precursor 
cells of the leg triggers the misplaced 
development of an eye on a fly's leg. The 
experimentally induced organ appears 

to be structurally normal, containing the 
various types of cells found in a typical fly 
eye. It does not, however, communicate with 
the fly's brain. (Walter Gehring, courtesy of 
Biozentrum, University of Basel.) 


Control of Gene Expression 


thousands of genes (Figure 8—18). Because each gene, in turn, is typically 
controlled by many different transcription regulators, a relatively small 
number of regulators acting in different combinations can form the enor- 
mously complex regulatory networks that generate specialized cell types. 
It is estimated that approximately 1000 transcription regulators are suf- 
ficient to control the 24,000 genes that give rise to an individual human. 


The Formation of an Entire Organ Can Be Triggered by a 
Single Transcription Regulator 


We have seen that transcription regulators, working in combination, can 
control the expression of whole sets of genes and can produce a variety 
of cell types. But in some cases a single transcription regulator can initi- 
ate the formation of not just one cell type but a whole organ. A stunning 
example of such transcriptional control comes from studies of eye devel- 
opment in the fruit fly Drosophila. Here, a single transcription regulator 
called Ey triggers the differentiation of all of the specialized cell types that 
come together to form the eye. Flies with a mutation in the Ey gene have 
no eyes at all, which is how the regulator was discovered. 


How the Ey protein coordinates the specification of each type of cell found 
in the eye—and directs their proper organization in three-dimensional 
space—is an actively studied topic in developmental biology. In essence, 
however, Ey functions like the transcription regulators we have already 
discussed, controlling the expression of multiple genes by binding to 
DNA sequences in their regulatory regions. Some of the genes controlled 
by Ey encode additional transcription regulators that, in turn, control the 
expression of other genes. In this way, the action of this master transcrip- 
tion regulator, which sits at the apex of a regulatory network like the one 
shown in Figure 8-18, produces a cascade of regulators that, working in 
combination, lead to the formation of an organized group of many differ- 
ent types of cells. One can begin to imagine how, by repeated applications 
of this principle, an organism as complex as a fly—or a human—progres- 
sively self-assembles, cell by cell, tissue by tissue, and organ by organ. 


Master regulators such as Ey are so powerful that they can even activate 
their regulatory networks outside the normal location. In the laboratory, 
the Ey gene has been artificially expressed in fruit fly embryos in cells that 
would normally give rise to a leg. When these modified embryos develop 
into adult flies, some have an eye in the middle of a leg (Figure 8-19). 
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Transcription Regulators Can Be Used to Experimentally 
Direct the Formation of Specific Cell Types in Culture 


We have seen that the Ey gene, when introduced into a fly embryo, can 
produce an eye in an unnatural location; this somewhat unusual outcome 
is made possible by the cooperation of numerous transcription regulators 
in a variety of cell types—a situation that is common in a developing 
embryo. Perhaps even more surprising is that some transcription regula- 
tors can convert one specialized cell type to another in a culture dish. 
For example, when the gene encoding the transcription regulator MyoD 
is artificially introduced into fibroblasts cultured from skin, the fibroblasts 
form musclelike cells. It appears that the fibroblasts, which are derived 
from the same broad class of embryonic cells as muscle cells, have 
already accumulated many of the other necessary transcription regula- 
tors required for the combinatorial control of the muscle-specific genes, 
and that addition of MyoD completes the unique combination required to 
direct the cells to become muscle. 


This same type of reprogramming can produce even more impressive 
transformations. For example, a set of nerve-specific transcription regu- 
lators, when artificially expressed in cultured liver cells, can convert them 
into functional neurons (Figure 8—20). And the combination of transcrip- 
tion regulators shown in Figure 8-18 can be used in the laboratory to 
coax differentiated cells to de-differentiate into induced pluripotent 
stem (iPS) cells; these reprogrammed cells behave much like natu- 
rally occurring ES cells, and they can be directed to generate a variety 
of specialized differentiated cells (Figure 8-21). This approach, initially 
performed using cultured fibroblasts, has been adapted to produce iPS 
cells from a variety of specialized cell types, including those taken from 
humans. Differentiated cells produced from human iPS cells are currently 
being used in the study or treatment of disease, as we discuss in Chapter 
20. Taken together, these dramatic demonstrations suggest that it may 
someday be possible to produce in the laboratory any cell type for which 
the correct combination of transcription regulators can be identified. 
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Figure 8-20 A small number of 
transcription regulators can convert 

one differentiated cell type directly into 
another. In this experiment, liver cells grown 
in culture (A) were converted into neuronal 
cells (B) via the artificial introduction of three 
nerve-specific transcription regulators. The 
cells are labeled with a fluorescent dye. 
Such interconversion would never take place 
during normal development. The result 
shown here depends on an experimenter 
expressing several nerve-specific regulators 
in liver cells, where these regulators would, 
during normal development, be tightly shut 
off. (From S. Marro et al., Cell Stem Cell 
9:374-382, 2011. With permission 

from Elsevier.) 


Figure 8—21 A combination of 
transcription regulators can induce a 
differentiated cell to de-differentiate 

into a pluripotent iPS cell. The artificial 
expression of a set of three genes, each of 
which encodes a transcription regulator, can 
reprogram a fibroblast into a pluripotent 
cell with ES cell-like properties. Like ES cells, 
such iPS cells can proliferate indefinitely 

in culture and can be stimulated by 
appropriate extracellular signal molecules 
to differentiate into almost any cell type in 
the body. 
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Differentiated Cells Maintain Their Identity 


Once a cell has become differentiated into a particular cell type in the 
body, it will generally remain differentiated, and all its progeny cells 
will remain that same cell type. Some highly specialized cells, including 
skeletal muscle cells and neurons, never divide again once they have 
differentiated—that is, they are terminally differentiated (as discussed in 
Chapter 18). But many other differentiated cells—such as fibroblasts, 
smooth muscle cells, and liver cells—will divide many times in the life of 
an individual. When they do, these specialized cell types give rise only to 
cells like themselves: unless an experimenter intervenes, smooth muscle 
cells do not give rise to liver cells, nor liver cells to fibroblasts. 


For a proliferating cell to maintain its identity—a property called 
cell memory—the patterns of gene expression responsible for that iden- 
tity must be “remembered” and passed on to its daughter cells through all 
subsequent cell divisions. Thus, in the model illustrated in Figure 8-17, 
the production of each transcription regulator, once begun, has to be 
continued in the daughter cells of each cell division. How is such perpetu- 
ation accomplished? 


Cells have several ways of ensuring that their daughters remember what 
kind of cells they should be. One of the simplest and most important is 
through a positive feedback loop, where a master transcription regulator 
activates transcription ofits own gene, in addition to that of other cell-type- 
specific genes. Each time a cell divides, the regulator is distributed to both 
daughter cells, where it continues to stimulate the positive feedback loop 
(Figure 8-22). The continued stimulation ensures that the regulator will 
continue to be produced in subsequent cell generations. The Ey protein 
and the transcription regulators involved in the generation of ES cells 
and iPS cells take part in such positive feedback loops (see Figure 8-18B). 
Positive feedback is crucial for establishing the “self-sustaining” circuits 
of gene expression that allow a cell to commit to a particular fate—and 
then to transmit that decision to its progeny. 
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Figure 8-22 A positive feedback loop can generate cell memory. Protein A is a master transcription regulator that activates the 
transcription of its own gene—as well as other cell-type-specific genes (not shown). All of the descendants of the original cell will 
therefore “remember” that the progenitor cell had experienced a transient signal that initiated the production of protein A. As shown 
in Figure 8-18, each of the regulators needed to form iPS cells influences its own expression using this type of positive feedback loop. 


Although positive feedback loops are probably the most prevalent way of 
ensuring that daughter cells remember what kind of cells they are meant 
to be, there are other ways of reinforcing cell identity. One involves the 
methylation of DNA. In vertebrate cells, DNA methylation occurs on cer- 
tain cytosine bases (Figure 8-23). This covalent modification generally 
turns off the affected genes by attracting proteins that bind to methyl- 
ated cytosines and block gene transcription. DNA methylation patterns 
are passed on to progeny cells by the action of an enzyme that copies 
the methylation pattern on the parent DNA strand to the daughter DNA 
strand as it is synthesized (Figure 8—24). 


Another mechanism for inheriting gene expression patterns involves the 
modification of histones. When a cell replicates its DNA, each daughter 
double helix receives half of its parent's histone proteins, which contain 
the covalent modifications that were present on the parent chromosome. 
Enzymes responsible for these modifications may bind to the parental 
histones and confer the same modifications to the new histones nearby. 
It has been proposed that this cycle of modification helps reestablish 
the pattern of chromatin structure found in the parent chromosome 
(Figure 8-25). 


Because all of these cell-memory mechanisms transmit patterns of gene 
expression from parent to daughter cell without altering the actual nucle- 
otide sequence of the DNA, they are considered to be forms of epigenetic 
inheritance. These mechanisms, which work together, play an important 
part in maintaining patterns of gene expression, allowing transient sig- 
nals from the environment to be remembered by our cells—a fact that 
has important implications for understanding how cells operate and how 
they malfunction in disease. 


POST-TRANSCRIPTIONAL CONTROLS 


We have seen that transcription regulators control gene expression 
by promoting or hindering the transcription of specific genes. The vast 
majority of genes in all organisms are regulated in this way. But many 
additional points of control can come into play later in the pathway from 
DNA to protein, giving cells a further opportunity to regulate the amount 
or activity of the gene products that they make (see Figure 8-3). These 
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Figure 8-24 DNA methylation patterns can be faithfully inherited when a cell divides. An enzyme called a maintenance 
methyltransferase guarantees that once a pattern of DNA methylation has been established, it is inherited by newly made DNA. 
Immediately after DNA replication, each daughter double helix will contain one methylated DNA strand—inherited from the parent 
double helix—and one unmethylated, newly synthesized strand. The maintenance methyltransferase interacts with these hybrid double 
helices and methylates only those CG sequences that are base-paired with a CG sequence that is already methylated. 
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Figure 8—25 Histone modifications 
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the inheritance of the parental chromatin post-transcriptional controls, which operate after transcription has 
structure. begun, play a crucial part in further fine-tuning the expression of almost 
all genes. 


We have already encountered a few examples of such post-transcriptional 
control. For example, alternative RNA splicing allows different forms of a 
protein, encoded by the same gene, to be made in different tissues (Figure 
7-23). And we saw that various post-translational modifications of a pro- 
tein can regulate its concentration and activity (see Figure 4-47). In the 
remainder of this chapter, we consider several other examples—some 
only recently discovered—of the many ways in which cells can manipu- 
late the expression of a gene after transcription has commenced. 


mRNAs Contain Sequences That Control Their 
Translation 


We saw in Chapter 7 that an mRNA’s lifespan is dictated by specific nucle- 
otide sequences within the untranslated regions that lie both upstream 
and downstream of the protein-coding sequence. These sequences often 
contain binding sites for proteins that are involved in RNA degradation. 
But they also carry information specifying whether—and how often—the 
mRNA is to be translated into protein. 


Although the details differ between eukaryotes and bacteria, the general 
strategy is similar for both. Bacterial mRNAs contain a short ribosome- 
binding sequence located a few nucleotide pairs upstream of the AUG 
codon where translation begins (see Figure 7-40). This binding sequence 
forms base pairs with the rRNA in the small ribosomal subunit, correctly 
positioning the initiating AUG codon within the ribosome. Because this 
interaction is needed for efficient translation initiation, it provides an ideal 
target for translational control. By blocking—or exposing—the ribosome- 
binding sequence, the bacterium can either inhibit—or promote—the 
translation of an mRNA (Figure 8—26). 


In eukaryotes, specialized repressor proteins can similarly inhibit trans- 
lation initiation by binding to specific nucleotide sequences in the 5’ 
untranslated region of the mRNA, thereby preventing the ribosome from 
finding the first AUG. When conditions change, the cell can inactivate the 
repressor to initiate translation of the mRNA. 


Regulatory RNAs Control the Expression of Thousands 
of Genes 
AS we saw in Chapter 7, RNAs perform many critical biological tasks. In 


addition to the mRNAs, which code for proteins, noncoding RNAs have a 
variety of functions. Some, such as transfer RNAs (tRNAs) and ribosomal 
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Figure 8-26 A bacterial gene’s expression can be controlled by regulating translation of its mRNA. 

(A) Sequence-specific RNA-binding proteins can repress the translation of specific mRNAs by keeping the ribosome 
from binding to the ribosome-binding sequence (orange) in the mRNA. Some bacteria exploit this mechanism to 
inhibit the translation of ribosomal proteins. If a ribosomal protein is accidentally produced in excess over other 
ribosomal components, the free protein will inhibit translation of its own mRNA, thereby blocking its own synthesis. 
As new ribosomes are assembled, the levels of the free protein decrease, allowing the mRNA to again be translated 
and the ribosomal protein to be produced. (B) An mRNA from the pathogen Listeria monocytogenes contains a 
“thermosensor” RNA sequence that controls the translation of a set of mRNAs that code for proteins the bacterium 
needs to successfully infect its host. At the warmer temperatures inside a host, base pairs within the thermosensor 


come apart, exposing the ribosome-binding sequence, so the necessary protein is made. 


RNAs (rRNAs) play key structural and catalytic roles in the cell, particu- 
larly in protein synthesis (see pp. 252-253). And the RNA component of 
telomerase is crucial for the complete duplication of eukaryotic chro- 
mosomes (see Figure 6-23). But we now know that many organisms, 
particularly animals and plants, produce thousands of additional non- 
coding RNAS. 


Many of these noncoding RNAs have crucial roles in regulating gene 
expression and are therefore referred to as regulatory RNAs. These 
regulatory RNAs include microRNAs, small interfering RNAs, and long 
noncoding RNAs, and we discuss each in the remaining sections of the 
chapter. 


MicroRNAs Direct the Destruction of Target mRNAs 


MicroRNASs, or miRNAs, are tiny RNA molecules that control gene expres- 
sion by base-pairing with specific mRNAs and reducing both their stability 
and their translation into protein. Like other RNAs, miRNAs also undergo 
processing to produce the mature, functional miRNA molecule. The mature 
miRNA, about 22 nucleotides in length, is packaged with specialized pro- 
teins to form an RNA-induced silencing complex (RISC), which patrols the 
cytosol in search of mRNAs that are complementary in sequence to its 
bound miRNA (Figure 8-27). Once a target MRNA base-pairs with an 
miRNA, it is either destroyed immediately—by a nuclease that is part of 
the RISC—or its translation is blocked. In the latter case, the bound mRNA 
molecule is delivered to a region of the cytosol where other nucleases 
eventually degrade it. Destruction of the mRNA releases the miRNA- 
bearing RISC, allowing it to seek out additional mRNA targets. Thus, a 
single miRNA—as part of a RISC—can eliminate one mRNA molecule after 
another, thereby efficiently blocking production of the encoded protein. 


There are thought to be roughly 500 different miRNAs encoded by the 
human genome; these RNAs may regulate as many as one-third of our 
protein-coding genes. Although we are only beginning to understand the 
full impact of these miRNAs, it is clear that they play a critical part in 
regulating gene expression and thereby influence many cell functions. 
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Figure 8-27 An miRNA targets a 
complementary mRNA molecule for 
destruction. Each precursor miRNA 
transcript is processed to form a double- 
stranded intermediate, which is further 
processed to form a mature, single-stranded 
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a nucleotide sequence complementary 

to its bound miRNA. Depending on how 
extensive the region of complementarity is, 
the target mRNA is either rapidly degraded 
by a nuclease within the RISC (shown on 
the left) or transferred to an area of the 
cytoplasm where other nucleases destroy it 
(shown on the right). 
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Small Interfering RNAs Protect Cells From Infections 


Some of the same components that process and package miRNAs also 
play another crucial part in the life of a cell: they serve as a powerful 
cell defense mechanism. In this case, the system is used to eliminate 
“foreign” RNA molecules—in particular, long, double-stranded RNA mol- 
ecules. Such RNAs are rarely produced by normal genes, but they often 
serve as intermediates in the life cycles of viruses and in the movement of 
some transposable genetic elements (discussed in Chapter 9). This form 
of RNA targeting, called RNA interference (RNAI), keeps these poten- 
tially destructive elements in check. 
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FORMATION OF RISC In the first step of RNAI, double-stranded, foreign RNAs are cut into short 
fragments (approximately 22 nucleotide pairs in length) in the cytosol by 
a protein called Dicer—the same protein used to generate the double- 
stranded RNA intermediate in miRNA production (see Figure 8-27). The 
resulting double-stranded RNA fragments, called small interfering RNAs 
(siRNAs), are then taken up by the same RISC proteins that carry 


Se es miRNAs. The RISC discards one strand of the siRNA duplex and uses the 

RNA remaining single-stranded RNA to seek and destroy complementary RNA 

TNA sihglestratidëù molecules (Figure 8—28). In this way, the infected cell effectively turns the 
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Figure 8—28 siRNAs are produced from double-stranded, foreign 
RNAs during the process of RNA interference. Double-stranded 
RNAs from a virus or transposable genetic element are first cleaved 
by a nuclease called Dicer. The resulting double-stranded fragments 
(known as siRNAs) are incorporated into RISCs, which discard one 
strand of the duplex and use the other strand to locate and destroy 
. foreign RNAs that contain a complementary sequence. 
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At the same time, RNAi can also selectively shut off the synthesis of foreign 
RNAs by the host’s RNA polymerase. In this case, the siRNAs produced by 
Dicer are packaged into a protein complex called RITS (for RNA-induced 
transcriptional silencing). Using its single-stranded siRNA as a guide, the 
RITS complex attaches itself to complementary RNA sequences as they 
emerge from an actively transcribing RNA polymerase (Figure 8-29). 
Positioned along a gene in this way, the RITS complex then attracts pro- 
teins that covalently modify nearby histones in a way that promotes the 
localized formation of heterochromatin (see Figure 5-27). This hetero- 
chromatin then blocks further transcription initiation at that site. Such 
RNAi-directed heterochromatin formation helps limit the spread of trans- 
posable genetic elements throughout the host genome. 


RNAi operates in a wide variety of organisms, including single-celled 
fungi, plants, and worms, indicating that it is an evolutionarily ancient 
defense mechanism, particularly against viral infection. In some organ- 
isms, including many plants, the RNAi defense response can spread from 
tissue to tissue, allowing an entire organism to become resistant to a 
virus after only a few of its cells have been infected. In this sense, RNAi 
resembles certain aspects of the adaptive immune responses of ver- 
tebrates; in both cases, an invading pathogen elicits the production of 
molecules—either siRNAs or antibodies—that are custom-made to inac- 
tivate the specific invader and thereby protect the host. 


Thousands of Long Noncoding RNAs May Also Regulate 
Mammalian Gene Activity 


At the other end of the size spectrum are the long noncoding RNAs, a 
class of RNA molecules that are defined as being more than 200 nucleo- 
tides in length. There are thought to be upward of 5000 of these lengthy 
RNAs encoded in the human and mouse genomes. Yet, with few excep- 
tions, their roles in the biology of the organism, if any, are not entirely 
clear. 


One of the best understood of the long noncoding RNAs is Xist. This enor- 
mous RNA molecule, some 17,000 nucleotides long, is a key player in 
X-inactivation—the process by which one of the two X chromosomes in 
the cells of female mammals is permanently silenced (see Figure 5-28). 
Early in development, Xist is produced by only one of the X chromosomes 
in each female nucleus. The transcript then “sticks around,” coating the 
chromosome and attracting the enzymes and chromatin-remodeling 
complexes that promote the formation of highly condensed heterochro- 
matin. Other long noncoding RNAs may promote the silencing of specific 
genes in a similar manner. 


Some long noncoding RNAs fold into specific, three-dimensional struc- 
tures via complementary base pairing, as discussed in Chapter 7 (see for 
example Figure 7-5). These structures can serve as scaffolds, which bring 
together proteins that function together in a particular cell process (Figure 
8-30). For example, one of the roles of the RNA molecule in telomerase— 
the enzyme that duplicates the ends of eukaryotic chromosomes (see 
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Figure 8-29 RNAi can also trigger 
transcriptional silencing. In this case, a 
single-stranded siRNA is incorporated 

into a RITS complex, which uses the 
single-stranded siRNA to search for 
complementary RNA sequences as 

they emerge from a transcribing RNA 
polymerase. The binding of the RITS 
complex attracts proteins that promote the 
modification of histones and the formation 
of tightly packed heterochromatin. This 
change in chromatin structure, directed 

by complementary base-pairing, causes 
transcriptional repression. Such silencing is 
used in plants, animals, and fungi to hold 
transposable elements in check. 


Figure 8-30 Long noncoding RNAs can 
serve as scaffolds, bringing together 
proteins that function in the same cell 
process. As described in Chapter 7, RNAs 
can fold into three-dimensional structures 
that can be recognized by specific proteins. 
By engaging in complementary base-pairing 
with other RNA molecules, these long 
noncoding RNAs can, in principle, localize 
proteins to specific sequences in RNA or 
DNA molecules, as shown. 
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Figure 6-23)—is to hold its different protein subunits together. By bring- 
ing together protein subunits, long noncoding RNAs can play important 
roles in many cell activities. 


Regardless of how the various long noncoding RNAs operate—or what 
exactly each of them does—the discovery of this large class of RNAs 
reinforces the idea that a eukaryotic genome contains information that 
provides not only an inventory of the molecules and structures every cell 
must make, but also a set of instructions for how and when to assemble 
these parts to guide the growth and development of a complete organism. 


ESSENTIAL CONCEPTS 


A typical eukaryotic cell expresses only a fraction of its genes, and 
the distinct types of cells in multicellular organisms arise because 
different sets of genes are expressed as cells differentiate. 


In principle, gene expression can be controlled at any of the steps 
between a gene and its ultimate functional product. For the majority 
of genes, however, the initiation of transcription is the most impor- 
tant point of control. 


The transcription of individual genes is switched on and off in cells by 
transcription regulators, proteins that bind to short stretches of DNA 
called regulatory DNA sequences. 


In bacteria, transcription regulators usually bind to regulatory DNA 
sequences close to where RNA polymerase binds. This binding can 
either activate or repress transcription of the gene. In eukaryotes, 
regulatory DNA sequences are often separated from the promoter by 
many thousands of nucleotide pairs. 


Eukaryotic transcription regulators act in two main ways: (1) they can 
directly affect the assembly process that requires RNA polymerase 
and the general transcription factors at the promoter, and (2) they 
can locally modify the chromatin structure of promoter regions. 


In eukaryotes, the expression of a gene is generally controlled by a 
combination of different transcription regulators. 


In multicellular plants and animals, the production of different tran- 
scription regulators in different cell types ensures the expression of 
only those genes appropriate to the particular type of cell. 


A master transcription regulator, if expressed in the appropriate pre- 
cursor cell, can trigger the formation of a specialized cell type or 
even an entire organ. 


One differentiated cell type can be converted to another by artificially 
expressing an appropriate set of transcription regulators. A differen- 
tiated cell can also be reprogrammed into a stem cell by artificially 
expressing a different, specific set of such regulators. 


Cells in multicellular organisms have mechanisms that enable their 
progeny to “remember” what type of cell they should be. A promi- 
nent mechanism for propagating cell memory relies on transcription 
regulators that perpetuate transcription of their own gene—a form of 
positive feedback. 


The pattern of DNA methylation can be transmitted from one cell 
generation to the next, producing a form of epigenetic inheritance 
that helps a cell remember the state of gene expression in its parent 
cell. There is also evidence for a form of epigenetic inheritance based 
on transmitted chromatin structures. 


Cells can regulate gene expression by controlling events that occur 
after transcription has begun. Many of these post-transcriptional 
mechanisms rely on RNA molecules that can influence their own sta- 
bility or translation. 


MicroRNAs (miRNAs) control gene expression by base-pairing with 
specific mRNAs and inhibiting their stability and translation. 


Cells have a defense mechanism for destroying “foreign” double- 
stranded RNAs, many of which are produced by viruses. It makes use 
of small interfering RNAs (siRNAs) that are produced from the foreign 
RNAs in a process called RNA interference (RNAi). 


The recent discovery of thousands of long noncoding RNAs in 
mammals has revealed new roles for RNAs in assembling protein 
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QUESTIONS 


QUESTION 8-4 


A virus that grows in bacteria (bacterial viruses are called 
bacteriophages) can replicate in one of two ways. In the 
prophage state, the viral DNA is inserted into the bacterial 
chromosome and is copied along with the bacterial genome 
each time the cell divides. In the lytic state, the viral DNA 

is released from the bacterial chromosome and replicates 
many times in the cell. This viral DNA then produces viral 
coat proteins that together with the replicated viral DNA 
form many new virus particles that burst out of the bacterial 
cell. These two forms of growth are controlled by two 
transcription regulators, the repressor (product of the c/ 
gene) and Cro, both of which are encoded by the virus. In 
the prophage state, cI is expressed; in the lytic state, Cro is 
expressed. In addition to regulating the expression of other 
genes, cI represses the Cro gene, and Cro represses the c/ 
gene (Figure Q8-4). When bacteria containing a phage in 
the prophage state are briefly irradiated with UV light, cI 
protein is degraded. 


A. What will happen next? 


B. Will the change in (A) be reversed when the UV light is 
switched off? 


C. What advantage might this response to UV light provide 
to the virus? 
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QUESTION 8-5 


Which of the following statements are correct? Explain your 
answers. 


A. In bacteria, but not in eukaryotes, many mRNAs contain 
the coding region for more than one gene. 


B. Most DNA-binding proteins bind to the major groove of 
the DNA double helix. 


C. Of the major control points in gene expression 
(transcription, RNA processing, RNA transport, translation, 
and control of a protein’s activity), transcription initiation is 
one of the most common. 


QUESTION 8-6 


Your task in the laboratory of Professor Quasimodo is 

to determine how far an enhancer (a binding site for an 
activator protein) can be moved from the promoter of 
the straightspine gene and still activate transcription. You 
systematically vary the number of nucleotide pairs between 
these two sites and then determine the amount 

of transcription by measuring the production of 
Straightspine mRNA. At first glance, your data look 
confusing (Figure Q8-6). What would you have expected 
for the results of this experiment? Can you save your 
reputation and explain these results to Professor 
Quasimodo? 
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Figure O8-6 


QUESTION 8-7 


The A repressor binds as a dimer to critical sites on 

the à genome to repress the virus’s lytic genes. This is 
necessary to maintain the prophage (integrated) state. 
Each molecule of the repressor consists of an N-terminal 
DNA-binding domain and a C-terminal dimerization domain 
(Figure Q8-7). Upon viral induction (for example, by 
irradiation with UV light), the genes for lytic growth are 
expressed, À progeny are produced, and the bacterial cell is 
lysed (see Question 8-4). Induction is initiated by cleavage 
of the A repressor at a site between the DNA-binding 
domain and the dimerization domain, which causes the 


cleavage 
+ p —p» site 


repressor monomers DNA binding site 


Figure O8-7 


repressor dimer 


repressor to dissociate from the DNA. In the absence of 
bound repressor, RNA polymerase binds and initiates lytic 
growth. Given that the number (concentration) of DNA- 
binding domains is unchanged by cleavage of the repressor, 
why do you suppose its cleavage results in its dissociation 
from the DNA? 


QUESTION 8-8 


The Arg genes that encode the enzymes for arginine 
biosynthesis are located at several positions around the 
genome of E. coli, and they are regulated coordinately 

by a transcription regulator encoded by the ArgR gene. 
The activity of the ArgR protein is modulated by arginine. 
Upon binding arginine, ArgR alters its conformation, 
dramatically changing its affinity for the DNA sequences in 
the promoters of the genes for the arginine biosynthetic 
enzymes. Given that ArgR is a repressor protein, would you 
expect that ArgR would bind more tightly or less tightly 

to the DNA sequences when arginine is abundant? If ArgR 
functioned instead as an activator protein, would you expect 
the binding of arginine to increase or to decrease its affinity 
for its regulatory DNA sequences? Explain your answers. 


QUESTION 8-9 


When enhancers were initially found to influence 
transcription from many thousands of nucleotide pairs away 
from the promoters they control, two principal models were 
invoked to explain this action at a distance. In the “DNA 
looping” model, direct interactions between proteins bound 
at enhancers and promoters were proposed to stimulate 
transcription initiation. In the “scanning” or “entry-site” 
model, RNA polymerase (or another component of the 
transcription machinery) was proposed to bind at the 
enhancer and then scan along the DNA until it reached the 
promoter. These two models were tested using an enhancer 
on one piece of DNA and a B-globin gene and promoter 
on a separate piece of DNA (Figure O8-9). The B-globin 
gene was not expressed when these two separate pieces 
of DNA were introduced together. However, when the two 
segments of DNA were joined via a linker (made of a protein 
that binds to a small molecule called biotin), the B-globin 
gene was expressed. 

Does this experiment distinguish between the DNA 
looping model and the scanning model? Explain your 
answer. 
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Figure Q8-9 


QUESTION 8-10 


Differentiated cells of an organism contain the same genes. 
(Among the few exceptions to this rule are the cells of 
the mammalian immune system, in which the formation of 


specialized cells is based on limited rearrangements of the 
genome.) Describe an experiment that substantiates the 
first sentence of this question, and explain why it does. 


QUESTION 8-11 


Figure 8—17 shows a simple scheme by which three 
transcription regulators are used during development 

to create eight different cell types. How many cell types 
could you create, using the same rules, with four different 
transcription regulators? As described in the text, MyoD is 
a transcription regulator that by itself is sufficient to induce 
muscle-specific gene expression in fibroblasts. How does 
this observation fit the scheme in Figure 8-17? 


QUESTION 8-12 


Imagine the two situations shown in Figure Q8-12. In 

cell |, a transient signal induces the synthesis of protein 

A, which is a transcriptional activator that turns on many 
genes including its own. In cell Il, a transient signal induces 
the synthesis of protein R, which is a transcriptional 
repressor that turns off many genes including its own. In 
which, if either, of these situations will the descendants of 
the original cell “remember” that the progenitor cell had 
experienced the transient signal? Explain your reasoning. 


(A) CELL | 
Q 
OFF transient A F) 
signal Q 
E — — 
gene activator turns on transcription activator protein 
of activator mRNA turns on its own 
transcription 
(B) CELL II 
OFF transient 5 a = 
signal A 
E — — E 
gene repressor turns on transcription repressor protein 
, of repressor mRNA turns off its own 
Figure Q8-12 transcription 


QUESTION 8-13 


Discuss the following argument: “If the expression of every 
gene depends on a set of transcription regulators, then the 
expression of these regulators must also depend on the 
expression of other regulators, and their expression must 
depend on the expression of still other regulators, and so 
on. Cells would therefore need an infinite number of genes, 
most of which would code for transcription regulators.” 
How does the cell get by without having to achieve the 
impossible? 
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How Genes and 
Genomes Evolve 


For a given individual, the nucleotide sequence of the genome in every GENERATING GENETIC 
one of its cells is virtually the same. But compare the DNA of two individ- VARIATION 

uals—even parent and child—and that is no longer the case: the genomes 

of individuals within a species contain slightly different information. And 

between members of different species, the deviations are even more RECONSTRUCTING LIFE’S 
extensive. FAMILY TREE 


Such differences in DNA sequence are responsible for the diversity of 

life on Earth, from the subtle variations in hair color, eye color, and skin MOBILE GENETIC ELEMENTS 
color that characterize members of our own species ( ) to the AND VIRUSES 

dramatic differences in phenotype that distinguish a fish from a fungus or 

a robin from a rose. But if all life emerged from a common ancestor—a 

single-celled organism that existed some 3.5 billion years ago—where EXAMINING THE HUMAN 
did these genetic improvisations come from? How did they arise, why GENOME 

were they preserved, and how do they contribute to the breathtaking bio- 

logical diversity that surrounds us? 


Improvements in the methods used to sequence and analyze whole 
genomes—from pufferfish to people—are now allowing us to address 
some of these questions. In Chapter 10, we describe these revolutionary 
technologies, which continue to transform the modern era of genom- 
ics. In this chapter, we present some of the fruits of these technological 
innovations. We discuss how genes and genomes have been sculpted 
over billions of years to give rise to the spectacular menagerie of life- 
forms that crowd every corner of the planet. We examine the molecular 
mechanisms that generate genetic diversity, and we consider how the 
information in present-day genomes can be deciphered to yield a his- 
torical record of the evolutionary processes that have shaped these DNA 
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Figure 9-1 Small differences in DNA 
sequence account for differences in 
appearance between one individual 
and the next. A group of schoolchildren 
displays a sampling of the characteristics 
that define the unity and diversity of our 
own species. (joSon/Getty Images.) 


sequences. We also take a brief look at mobile genetic elements and 
consider how these elements, along with modern-day viruses, can carry 
genetic information from place to place and from organism to organism. 
Finally, we end the chapter by taking a closer look at the human genome 
to see what the DNA sequences from individuals all around the world tell 
us about who we are and where we come from. 


GENERATING GENETIC VARIATION 


There is no natural mechanism for making long stretches of entirely 
novel nucleotide sequences. Thus evolution is more a tinkerer than an 
inventor: it uses as its raw materials the DNA sequences that each organ- 
ism inherits from its ancestors. In this sense, no gene or genome is ever 
entirely new. Instead, the astonishing diversity in form and function in 
the living world is all the result of variations on preexisting themes. As 
genetic variations pile up over millions of generations, they can produce 
radical change. 


Several basic types of genetic change are especially crucial in evolution 
(Figure 9-2): 


e Mutation within a gene: An existing gene can be modified by a 
mutation that changes a single nucleotide or deletes or duplicates 
one or more nucleotides. These mutations can alter the splicing of 
a gene’s RNA transcript or change the stability, activity, location, or 
interactions of its encoded protein or RNA product. 


e Mutation within regulatory DNA sequences: When and where a gene 
is expressed can be affected by a mutation in the stretches of DNA 
sequence that regulate the gene's activity (described in Chapter 8). 
For example, humans and fish have a surprisingly large number of 
genes in common, but changes in the regulation of those shared 
genes underlie many of the most dramatic differences between 
those species. 


e Gene duplication and divergence: An existing gene, or even a whole 
genome, can be duplicated. As the cell containing this duplication, 
and its progeny, continue to divide, the original DNA sequence 
and the duplicate sequence can acquire different mutations and 
thereby assume new functions and patterns of expression. 


e Exon shuffling: Two or more existing genes can be broken and 
rejoined to make a hybrid gene containing DNA segments that 
originally belonged to separate genes. In eukaryotes, such break- 
ing and rejoining often occurs within the long intron sequences, 
which do not encode protein. Because these intron sequences are 
removed by RNA splicing, the breaking and joining do not have to 
be precise to produce a functional gene. 


e Transposition of mobile genetic elements: Specialized DNA sequences 
that can move from one chromosomal location to another can alter 
the activity or regulation of a gene; they can also promote gene 
duplication, exon shuffling, and other genome rearrangements. 


e Horizontal gene transfer: A piece of DNA can be passed from the 
genome of one cell to that of another—even to that of another spe- 
cies. This process, which is rare among eukaryotes but common 
among bacteria, differs from the usual “vertical” transfer of genetic 
information from parent to progeny. 


Each of these forms of genetic variation has played an important part in 
the evolution of modern organisms. And they still play that part today, 
as organisms continue to evolve. In this section, we discuss these basic 
mechanisms of genetic change, and we consider their consequences for 
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genome evolution. But first, we pause to consider the contribution of 
sex—the mechanism that many organisms use to pass genetic informa- 
tion on to future generations. 


In Sexually Reproducing Organisms, Only Changes to 
the Germ Line Are Passed On to Progeny 


For bacteria and unicellular organisms that reproduce asexually, the 
inheritance of genetic information is fairly straightforward. Each indi- 
vidual duplicates its genome and donates one copy to each daughter cell 
when the individual divides in two. The family tree of such unicellular 
organisms is simply a branching diagram of cell divisions that directly 
links each individual to its progeny and to its ancestors. 


For a multicellular organism that reproduces sexually, however, the fam- 
ily connections are considerably more complex. Although individual cells 
within that organism divide, only the specialized reproductive cells—the 
gametes—carry a copy of its genome to the next generation of organisms 
(discussed in Chapter 19). All the other cells of the body—the somatic 
cells—are doomed to die without leaving evolutionary descendants of 
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Figure 9-2 Genes and genomes 

can be altered by several different 
mechanisms. Small mutations, duplications, 
rearrangements, and even the infusion 

of fresh genetic material all contribute to 
genome evolution. 


QUESTION 9-1 


In this chapter, we argue that 
genetic variability is beneficial for 

a species because it enhances that 
species’ ability to adapt to changing 
conditions. Why, then, do you think 
that cells go to such great lengths 
to ensure the fidelity of DNA 
replication? 
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Figure 9-3 Germ-line cells and somatic D gamete D gamete 
cells have fundamentally different 
functions. In sexually reproducing 
organisms, genetic information is germ-line cells germ-line cells 
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to somatic cells (blue). Somatic cells form the 
body of the organism but do not contribute PARENT OFFSPRING 
their DNA to the next generation. 
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their own (Figure 9-3). In a sense, somatic cells exist only to support the 
germ-line cell lineage that gives rise to the gametes. 


A mutation that occurs in a somatic cell—although it might have unfortu- 
nate consequences for the individual in which it occurs (causing cancer, 
for example)—will not be transmitted to the organism’s offspring. For a 
mutation to be passed on to the next generation, it must alter the germ 
line (Figure 9-4). Thus, when we track the genetic changes that accu- 
mulate during the evolution of sexually reproducing organisms, we are 
looking at events that took place in a germ-line cell. It is through a series 
of germ-line cell divisions that sexually reproducing organisms trace 
their descent back to their ancestors and, ultimately, back to the ances- 
tors of us all—the first cells that existed, at the origin of life more than 3.5 
billion years ago. 


In addition to perpetuating a species, sex also introduces its own form of 
genetic change: when gametes from a male and female unite during fer- 
tilization, they generate offspring that are genetically distinct from either 
parent. We discuss this form of genetic diversification, which occurs only 
in sexually reproducing species, in detail in Chapter 19. The mechanisms 
for generating genetic change we discuss in this chapter, on the other 
hand, apply to all living things—and we return to them now. 


Point Mutations Are Caused by Failures of the Normal 
Mechanisms for Copying and Repairing DNA 


Despite the elaborate mechanisms that exist to faithfully copy and repair 
DNA sequences, every nucleotide pair in an organism’s genome runs a 
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Figure 9-4 Mutations in germ-line 

cells and somatic cells have different 
consequences. A mutation that occurs in 

a germ-line cell (A) can be passed on to 

the next generation (green). By contrast, 

a mutation that arises in a somatic cell (B) 
affects only the progeny of that cell (orange) 
and will not be passed on to the organism's 
offspring. As we discuss in Chapter 20, somatic cells somatic cells 
somatic mutations are responsible for most 
human cancers (see pp. 720-721). PARENT OFFSPRING 
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small risk of changing each time a cell divides. Changes that affect a sin- 
gle nucleotide pair are called point mutations. These typically arise from 
rare errors in DNA replication or repair (discussed in Chapter 6). 


The point mutation rate has been determined directly in experiments with 
bacteria such as E. coli. Under laboratory conditions, E. coli divides about 
once every 20-25 minutes; in less than a day, a single E. coli can produce 
more descendants than there are humans on Earth—enough to provide 
a good chance for almost any conceivable point mutation to occur. A 
culture containing 10? E. coli cells thus harbors millions of mutant cells 
whose genomes differ subtly from a single ancestor cell. A few of these 
mutations may confer a selective advantage on individual cells: resist- 
ance to a poison, for example, or the ability to survive when deprived of 
a standard nutrient. By exposing the culture to a selective condition— 
adding an antibiotic or removing an essential nutrient, for example—one 
can find these needles in the haystack; that is, the cells that have under- 
gone a specific mutation enabling them to survive in conditions where 
the original cells cannot (Figure 9-5). Such experiments have revealed 
that the overall point mutation frequency in E. coli is about 3 changes for 
each 10!° nucleotide pairs replicated. With a genome size of 4.6 million 
nucleotide pairs, this mutation rate means that approximately 99.99% 
of the time, the two daughter cells produced in a round of cell division 
will inherit exactly the same genome sequence of the parent E. coli cell; 
mutant cells are therefore produced only rarely. 


The overall mutation rate in humans, as determined by comparing the 
DNA sequences of children and their parents (and estimating how many 
times the parental germ cells divided before producing gametes), is 
about one-third that of E. coli—which suggests that the mechanisms that 


medium lacking rare colony of cells that 


mutant E. coli cell histidine contains a new mutation 
that requires histidine \) AS CELLS DIVIDE, yy SAMPLE OF CELLS enabling proliferation in 
to proliferate INNOCULATE RANDOM MUTATIONS SPREAD ON the absence of histidine 
CULTURE ARISE SPONTANEOUSLY PETRI DISH 
— —— — 
U Y U Y 


rich medium, 
which includes 


bacteria in which 
different mutations 


MUTATION IN His GENE histidine, allows have occured NEW MUTATION 
a : all bacteria IN His GENE 
LI His 
gene 


UGA mRNA 
premature stop codon 


enzyme 


Figure 9-5 Mutation rates can be measured in the laboratory. In this experiment, an E. coli strain that carries a deleterious point 
mutation in the His gene—which is needed to manufacture the amino acid histidine—is used. The mutation has converted a G-C 
nucleotide pair to an A-T, resulting in a premature stop signal in the mRNA produced from the mutant gene (left box). As long as 
histidine is supplied in the growth medium, this strain can grow and divide normally. If a large number of mutant cells (say 1010) is 
spread on an agar plate that lacks histidine, the great majority will die. The rare survivors will contain a new mutation in which the A-T is 
changed back to a G-C. This “reversion” corrects the original defect and allows the bacterium to make the enzyme it needs to survive 
in the absence of histidine. Such mutations happen by chance and only rarely, but the ability to work with very large numbers of E. coli 
cells makes it possible to detect this change and to accurately measure its frequency. 
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evolved to maintain genome integrity operate with an efficiency that does 
not greatly differ between even distantly related species. 


Point mutations can destroy a gene’s activity or—very rarely—improve it 
(as shown in Figure 9-5). More often, however, they do neither of these 
things. At many sites in the genome, a point mutation has absolutely 
no effect on the organism's appearance, viability, or ability to reproduce. 
Such neutral mutations often fall in regions of the gene where the DNA 
sequence is unimportant, including most of an intron’s sequence. In cases 
where they occur within an exon, neutral mutations can change the third 
position of a codon such that the amino acid it specifies is unchanged—or 
is so similar that the protein’s function is unaffected. 


Mutations Can Also Change the Regulation of a Gene 


Point mutations that lie outside the coding sequences of genes can some- 
times affect regulatory DNA sequences—elements that control the timing, 
location, and level of gene expression. Such mutations in regulatory DNA 
sequences can have a profound effect on the protein’s production and 
thereby on the organism. For example, a small number of people are 
resistant to malaria because of a point mutation that affects the expres- 
sion of a cell-surface receptor to which the malaria parasite Plasmodium 
vivax binds. The mutation prevents the receptor from being produced 
in red blood cells, rendering the individuals who carry this mutation 
immune to malarial infection. 


Point mutations in regulatory DNA sequences also have a role in our abil- 
ity to digest lactose, the main sugar in milk. Our earliest ancestors were 
lactose intolerant, because the enzyme that breaks down lactose—called 
lactase—was made only during infancy. Adults, who were no longer 
exposed to breast milk, did not need the enzyme. When humans began 
to get milk from domesticated cattle some 10,000 years ago, variant 
genes—the product of random mutation—enabled those who carried the 
variation to continue to express lactase as adults, and thus take advan- 
tage of nutrition provided by cow’s milk. We now know that people who 
retain the ability to digest milk as adults contain a point mutation in the 
regulatory DNA sequence of the lactase gene, allowing it to be efficiently 
transcribed throughout life. In a sense, these milk-drinking adults are 
“mutants” with respect to their ancestors. It is remarkable how quickly 
this adaptation spread through the human population, especially in soci- 
eties that depended heavily on milk for nutrition (Figure 9-6). 


These evolutionary changes in the regulatory DNA sequence of the lactase 
gene occurred relatively recently (10,000 years ago), well after humans 
became a distinct species. However, much more ancient changes in reg- 
ulatory DNA sequences have occurred in other genes, and some of these 
are thought to underlie many of the profound differences among species 
(Figure 9-7). 


DNA Duplications Give Rise to Families of Related 
Genes 


Point mutations can influence the activity of an existing gene, but how do 
new genes with new functions come into being? Gene duplication is per- 
haps the most important mechanism for generating new genes from old 
ones. Once a gene has been duplicated, each of the two copies is free to 
accumulate mutations—as long as whatever activities the original gene 
may have had are not lost. Over time, as mutations continue to accu- 
mulate in the descendants of the original cell in which gene duplication 
occurred, some of these genetic changes allow one of the gene copies to 
perform a different function. 
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Figure 9-6 The widespread ability of adult humans to digest milk followed the domestication of cattle. 
Approximately 10,000 years ago, humans in northern Europe and central Africa began to raise cattle. The 
subsequent availability of cow's milk—particularly during periods of starvation—gave a selective advantage to those 
humans able to digest lactose as adults. Two independent point mutations that allow the expression of lactase in 
adults arose in human populations—one in northern Europe and another in central Africa. These mutations have 


since spread through different regions of the world. 


By repeated rounds of this process of gene duplication and divergence 
over many millions of years, one gene can give rise to a whole family of 
genes, each with a specialized function, within a single genome. Analysis 
of genome sequences reveals many examples of such gene families: in 
Bacillus subtilis, for example, nearly half of the genes have one or more 
obvious relatives elsewhere in the genome. And in vertebrates, the globin 
family of genes, which encode oxygen-carrying proteins, clearly arose 
from a single primordial gene, as we see shortly. But how does gene 
duplication occur in the first place? 


Many gene duplications are believed to be generated by homologous 
recombination. As discussed in Chapter 6, homologous recombination 
provides an important mechanism for mending a broken double helix; 
it allows an intact chromosome to be used as a template to repair a 
damaged sequence on its homolog. But as we discuss in Chapter 19, 
homologous recombination can also catalyze crossovers in which two 
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Figure 9-7 Changes in regulatory 

DNA sequences can have dramatic 
consequences for the development of 
an organism. In this hypothetical example, 
the genomes of two closely related species 
A and B contain the same three genes 

(1, 2, and 3) and encode the same two 
transcription regulators (red oval, brown 
triangle). However, the regulatory DNA 
sequences controlling expression of genes 
2 and 3 are different in the two species. 
Although both express gene 1 during 
embryonic stage 1, the differences in their 
regulatory DNA sequences cause them 

to express different genes in stage 2. In 
principle, a collection of such regulatory 
changes can have profound effects on an 
organism's developmental program—and, 
ultimately, on the appearance of the adult. 
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Figure 9-8 Gene duplication can be 
caused by crossovers between short, 
repeated DNA sequences in adjacent 
homologous chromosomes. The two 
chromosomes shown here undergo 
homologous recombination at short 
repeated sequences (red), that bracket a 
gene (orange). For simplicity, only one gene 
is shown on each homolog. The repeated 
sequences can be remnants of mobile 
genetic elements, which are present in 
many copies in the human genome, as we 
discuss shortly. When crossing-over occurs 
unequally, as shown, one chromosome will 
get two copies of the gene, while the other 
will get none. The type of homologous 
recombination that produces gene 
duplications is called unequal crossing-over 
because the resulting products are unequal 
in size. If this process occurs in the germ 
line, some progeny will inherit the long 
chromosome, while others will inherit the 
short one. 
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chromosomes are broken and joined up to produce hybrid chromo- 
somes. Crossovers take place only between regions of chromosomes that 
have nearly identical DNA sequences; for this reason, they usually occur 
between homologous chromosomes and generate hybrid chromosomes 
in which the order of genes is exactly the same as on the original chro- 
mosomes. This process occurs extensively during meiosis, as we see in 
Chapter 19. 


On rare occasions, however, a crossover can occur between a pair of 
short DNA sequences—identical or very similar—that fall on either side of 
a gene. If these short sequences are not aligned properly during recom- 
bination, a lopsided exchange of genetic information can occur. Such 
unequal crossovers can generate one chromosome that has an extra 
copy of the gene and another with no copy (Figure 9-8); this shorter 
chromosome will eventually be lost. 


Once a gene has been duplicated in this way, extra copies of the gene 
can be added by the same mechanism. As a result, entire sets of closely 
related genes, arranged in series, are commonly found in genomes. 


Duplication and Divergence Produced the Globin 
Gene Family 


The evolutionary history of the globin gene family provides a striking 
example of how gene duplication and divergence has generated new 
proteins. The unmistakable similarities in amino acid sequence and 
structure among present-day globin proteins indicate that all the globin 
genes must derive from a single ancestral gene. 


The simplest globin protein has a single polypeptide chain of about 150 
amino acids, and is found in many marine worms, insects, and primi- 
tive fish. Like our hemoglobin, this protein transports oxygen molecules 
throughout the animal's body. The oxygen-carrying protein in the blood of 
adult mammals and most other vertebrates, however, is more complex; 
it is composed of four globin chains of two distinct types—a globin and 
B globin (Figure 9-9). The four oxygen-binding sites in the a2ß2 molecule 
interact, allowing an allosteric change in the molecule as it binds and 
releases oxygen. This structural shift enables the four-chain hemoglobin 
molecule to efficiently take up and release four oxygen molecules in an 
all-or-none fashion, a feat not possible for the single-chain version. Such 
efficiency is particularly important for large multicellular animals, which 
cannot rely on the simple diffusion of oxygen through the body to oxy- 
genate their tissues adequately. 


Figure 9-9 An ancestral globin gene encoding a single-chain 
globin molecule gave rise to the pair of genes that produce 
four-chain hemoglobin proteins of modern humans and other 
mammals. The mammalian hemoglobin molecule is a complex of two 
a-globin (green) and two B-globin (blue) chains. Each chain contains a 
tightly bound heme group (red) that is responsible for binding oxygen. 


The a- and B-globin genes are the result of a gene duplication that 
occurred early in vertebrate evolution. Genome analyses suggest that one 
of our distant ancestors had a single globin gene. But about 500 million 
years ago, a gene duplication followed by an accumulation of differ- 
ent mutations in each gene copy is thought to have given rise to two 
slightly different globin genes, one encoding a globin, the other encoding 
B globin. Still later, as the different mammals began diverging from their 
common ancestor, the B-globin gene underwent its own duplication and 
divergence to give rise to a second f-like globin gene that is expressed 
specifically in the fetus (Figure 9-10). The resulting fetal hemoglobin 
molecule has a higher affinity for oxygen compared with adult hemo- 
globin, a property that helps transfer oxygen from mother to fetus. 


Subsequent rounds of duplication and divergence in both the a- and 
B-globin genes gave rise to additional members of these families. Each of 
these duplicated genes has been modified by point mutations that affect 
the properties of the final hemoglobin molecule, and by changes in regu- 
latory DNA sequences that determine when—and how strongly—each 
gene is expressed. As a result, each globin differs slightly in its ability to 
bind and release oxygen and in the stage of development during which 
it is expressed. 


In addition to these specialized globin genes, there are several duplicated 
DNA sequences in the a- and B-globin gene clusters that are not func- 
tional genes. They are similar in DNA sequence to the functional globin 
genes, but they have been disabled by the accumulation of many inacti- 
vating mutations. The existence of such pseudogenes makes it clear that 
not every DNA duplication leads to a new functional gene. In fact, most 
gene duplication events are unsuccessful in that one copy is gradually 
inactivated by mutation. Although we have focused here on the evolution 
of the globin genes, similar rounds of gene duplication and divergence 
have clearly taken place in many other gene families present in the 
human genome. 


Figure 9—10 Repeated rounds of duplication and mutation 
generated the globin gene family in humans. About 500 million 
years ago, an ancestral globin gene duplicated and gave rise to both 
the B-globin gene family (including the five genes shown) and the 
a-globin gene family. In most vertebrates, a molecule of hemoglobin 
(see Figure 9-9) is formed from two chains of a globin and two chains 
of B globin—which can be any one of the five subtypes of the B family 
listed here. 

The evolutionary scheme shown was worked out by comparing 
globin genes from many different organisms. The nucleotide 
sequences of the yS and y^ genes—which produce the B-globin-like 
chains that form fetal hemoglobin—are much more similar to each 
other than either of them is to the adult B gene. The d-globin gene 
encodes a minor form of adult B-globin. In humans, the B-globin 
genes are located in a cluster on Chromosome 11. 

A subsequent chromosome breakage event, which occurred 
about 300 million years ago, is believed to have separated the 
a- and B-globin genes; the a-globin genes now reside on human 
Chromosome 16 (not shown). 
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Whole-Genome Duplications Have Shaped the 
Evolutionary History of Many Species 


Almost every gene in the genomes of vertebrates exists in multiple ver- 
sions, suggesting that, rather than single genes being duplicated in a 
piecemeal fashion, the whole vertebrate genome was long ago dupli- 
cated in one fell swoop. Early in vertebrate evolution, it appears that 
the entire genome actually underwent duplication twice in succession, 
giving rise to four copies of every gene. In some groups of vertebrates, 
such as the salmon and carp families (including the zebrafish; see Figure 
1-38), there may have been yet another duplication, creating an eightfold 
multiplicity of genes. 


The precise history of whole-genome duplications in vertebrate evolu- 
tion is difficult to chart because many other changes, including the loss 
of genes, have occurred since these ancient evolutionary events. In some 
organisms, however, full genome duplications are especially obvious, as 
they have occurred relatively recently, evolutionarily speaking. The frog 
genus Xenopus, for example, includes closely related species that differ 
dramatically in DNA content: some are diploid—containing two complete 
sets of chromosomes—whereas others are tetraploid or octoploid. Such 
large-scale duplications can happen if cell division fails to occur following 
a round of genome replication in the germ line of a particular individual. 
Once an accidental doubling of the genome occurs in a germ-line cell, it 
will be faithfully passed on to germ-line progeny cells in that individual 
and, ultimately, to any offspring these cells might produce. 


Whole-genome duplications are also common in plants, including many 
of those that we eat. These genome duplications generally make the plant 
easier to cultivate and its fruit more palatable. In some cases, genome 
duplication renders the plant sterile so that it cannot produce seeds; such 
is the case with seedless grapes. Apples, leeks, and potatoes are all tetra- 
ploid, whereas strawberries and sugarcane are octoploid (Figure 9—11). 


Novel Genes Can Be Created by Exon Shuffling 


As we discussed in Chapter 4, many proteins are composed of smaller 
functional domains. In eukaryotes, each of these protein domains is usu- 
ally encoded by a separate exon, which is surrounded by long stretches 
of noncoding introns (see Figures 7-18 and 7-19). This organization of 
eukaryotic genes can facilitate the evolution of new proteins by allow- 
ing exons from one gene to be added to another—a process called 
exon shuffling. 


Such duplication and movement of exons is promoted by the same type of 
recombination that gives rise to gene duplications (see Figure 9-8). In this 
case, recombination occurs within the introns that surround the exons. 


Figure 9-11 Many crop plants 

have undergone whole-genome 
duplication. Many of these duplications, 
which arose spontaneously, were 
propagated by plant breeders because 
they rendered the plants easier to 
cultivate or made their fruits larger, more 
flavorful, or devoid of indigestible seeds. 
N indicates the ploidy of each type of 
plant: for example, wheat and kiwi are 
hexaploid—possessing six complete 
sets of chromosomes (6N). 


apple, potato wheat, kiwi 8N sugarcane, strawberry 


If the introns in question are from two different genes, this recombina- 
tion can generate a hybrid gene that includes complete exons from both. 
The results of such exon shuffling are seen in many present-day pro- 
teins, which contain a patchwork of many different protein domains 
(Figure 9-12). 


It has been proposed that nearly all the proteins encoded by the human 
genome (approximately 19,000) arose from the duplication and shuffling 
of a few thousand distinct exons, each encoding a protein domain of 
approximately 30-50 amino acids. This remarkable idea suggests that the 
great diversity of protein structures is generated from a fairly small uni- 
versal “parts list,” pieced together in different combinations. 


The Evolution of Genomes Has Been Profoundly 
Influenced by Mobile Genetic Elements 


Mobile genetic elements—DNA sequences that can move from one chro- 
mosomal location to another—are an important source of genomic 
change and have profoundly affected the structure of modern genomes. 
These parasitic DNA sequences can colonize a genome and then spread 
within it. In the process, they often disrupt the function or alter the regu- 
lation of existing genes; sometimes they even create novel genes through 
fusions between mobile sequences and segments of existing genes. 


The insertion of a mobile genetic element into the coding sequence of a 
gene or into its regulatory DNA sequence can cause the “spontaneous” 
mutations that are observed in many of today’s organisms. Mobile genetic 
elements can severely disrupt a gene’s activity if they land directly within 
its coding sequence. Such an insertion mutation destroys the gene's 
capacity to encode a useful protein—as is the case for a number of muta- 
tions that cause hemophilia in humans, for example. 


The activity of mobile genetic elements can also change the way existing 
genes are regulated. An insertion of an element into a regulatory DNA 
sequence, for instance, will often have a striking effect on where or when 
genes are expressed (Figure 9-13). Many mobile genetic elements carry 
DNA sequences that are recognized by specific transcription regulators; if 
these elements insert themselves near a gene, that gene can be brought 
under the control of these transcription regulators, thereby changing the 
gene’s expression pattern. Thus, mobile genetic elements can be a major 
source of developmental changes: they have been particularly important 
in the evolution of domesticated plants. For example, the development 
of modern corn from a wild, grassy plant called teosinte required only 
a small number of genetic alterations. One of these changes was the 
insertion of a mobile genetic element upstream of a gene active in seed 
development, which transformed the small, hard seeds of teosinte into 
the plentiful soft kernels of modern corn (Figure 9—14). 
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Figure 9-12 Exon shuffling during 
evolution can generate proteins with 
new combinations of protein domains. 
Each type of colored symbol represents a 
different protein domain. These different 
domains were joined together by exon 
shuffling during evolution to create the 
modern-day human proteins shown here. 
EGF, epidermal growth factor. 


Figure 9-13 Mutation due to a mobile 
genetic element can induce dramatic 
alterations in the body plan of an 
organism. (A) A normal fruit fly (Drosophila 
melanogaster). (B) A mutant fly in which 
the antennae have been replaced by legs 
because of a mutation in a regulatory 
DNA sequence that causes genes for leg 
formation to be activated in the positions 
normally reserved for antennae. Although 
this particular change is not advantageous 
to the fly, it illustrates how the movement 
of a transposable element can produce 

a major change in the appearance of an 
organism. (A, Edward B. Lewis. Courtesy 
of the Archives, California Institute of 
Technology; B, courtesy of Matthew Scott.) 
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Figure 9-14 The insertion of a mobile 
genetic element helped produce modern 
corn. Today's corn plants were originally 
bred from a wild plant called teosinte (A). 
This wild ancestor produced numerous 
ears that contained small, hard seeds. 

(B) Modern corn, by contrast, produces 
fewer cobs—but they contain numerous 
plump, sweet kernels. The insertion of 

a mobile genetic element near a gene 
involved in seed development helped drive 
the change. Here, the two plants are drawn 
to the same scale; for simplicity, the leaves 
are not shown. 


QUESTION 9-2 


Why do you suppose that horizontal 
gene transfer is more prevalent 

in single-celled organisms than in 
multicellular organisms? 
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Finally, mobile genetic elements provide opportunities for genome re- 
arrangements by serving as targets of homologous recombination (see 
Figure 9-8). For example, the duplications that gave rise to the B-globin 
gene cluster are thought to have occurred by crossovers between the 
abundant mobile genetic elements sprinkled throughout the human 
genome. Later in the chapter, we describe these elements in more detail 
and discuss the mechanisms that have allowed them to establish a 
stronghold within our genome. 


Genes Can Be Exchanged Between Organisms by 
Horizontal Gene Transfer 


So far we have considered genetic changes that take place within the 
genome of an individual organism. However, genes and other portions of 
genomes can also be exchanged between individuals of different species. 
This mechanism of horizontal gene transfer is rare among eukaryotes 
but common among bacteria, which can exchange DNA by the process of 
conjugation (Figure 9-15 and Movie 9.1). 


E. coli, for example, has acquired about one-fifth of its genome from other 
bacterial species within the past 100 million years. And such genetic 
exchanges are currently responsible for the rise of new and potentially 
dangerous strains of drug-resistant bacteria. Genes that confer resistance 
to antibiotics are readily transferred from species to species, providing 
the recipient bacterium with an enormous selective advantage in evad- 
ing the antimicrobial compounds that constitute modern medicine's 
frontline attack against bacterial infection. As a result, many antibiot- 
ics are no longer effective against the common bacterial infections for 
which they were originally used; as an example, most strains of Neisseria 
gonorrhoeae, the bacterium that causes gonorrhea, are now resistant to 
penicillin, which is therefore no longer the primary drug used to treat this 
disease. 


Figure 9—15 Bacterial cells can exchange DNA through 
conjugation. Conjugation begins when a donor cell captures a 
recipient cell using a fine appendage called a sex pilus. Following 
capture, DNA moves from the donor cell, through the pilus, into the 
recipient cell. In this cryoelectron micrograph, the sex pilus is clearly 
distinguished from the flagellum. Conjugation is one of several ways 
in which bacteria carry out horizontal gene transfer. (From C.M. 
Oikonomou and G.J. Jensen, Nat. Rev. Microbiol. 14:205-220, 2016. 
With permission from Macmillan Publishers Ltd.) 


RECONSTRUCTING LIFE’S FAMILY TREE 


The nucleotide sequences of present-day genomes provide a record of 
those genetic changes that have survived the test of time. By comparing 
the genomes of a variety of living organisms, we can thus begin to deci- 
pher our evolutionary history and see how our ancestors veered off in 
adventurous new directions that led us to where we are today. 


The most astonishing revelation of such genome comparisons has been 
that homologous genes—those that are similar in nucleotide sequence 
because of their common ancestry—can be recognized across vast evo- 
lutionary distances. Unmistakable homologs of many human genes are 
easy to detect in organisms such as worms, fruit flies, yeasts, and even 
bacteria. Although the lineage that led to the evolution of vertebrates is 
thought to have diverged from the one that led to nematode worms and 
insects more than 600 million years ago, when we compare the genomes 
of the nematode Caenorhabditis elegans and the fruit fly Drosophila mela- 
nogaster with that of Homo sapiens, we find that about 50% of the genes in 
each of these species have clear homologs in one or both of the other two 
species. In other words, clearly recognizable versions of at least half of all 
human genes must have already been present in the common ancestor of 
worms, flies, and humans. 


By tracing such relationships among genes, we can begin to define the 
evolutionary relationships among different species, placing each bacte- 
rium, animal, plant, or fungus in a single vast family tree of life. In this 
section, we discuss how these relationships are determined and what 
they tell us about our genetic heritage. 


Genetic Changes That Provide a Selective Advantage 
Are Likely to Be Preserved 


Evolution is commonly thought of as progressive, but at the molecu- 
lar level the process is random. Consider the fate of a point mutation 
that occurs in a germ-line cell. On rare occasions, the mutation might 
cause a change for the better. But most often it will either have no con- 
sequence or cause serious damage. Mutations of the first type will tend 
to be perpetuated, because the organism that inherits them will have 
an increased likelihood of reproducing itself. Mutations that are deleteri- 
ous will usually be lost. And mutations that are selectively neutral may or 
may not persist, depending on factors such as the size of the population, 
or whether the individual carrying the neutral mutation also harbors a 
favorable mutation located nearby. Through endless repetition of such 
cycles of mutation and natural selection—a molecular form of trial and 
error—organisms gradually evolve. Their genomes change and they 
develop new ways to exploit the environment—to outcompete others and 
to reproduce successfully. 


Clearly, some parts of the genome can accumulate mutations more eas- 
ily than others in the course of evolution. A segment of DNA that does 
not code for protein or RNA and has no significant regulatory role is free 
to change at a rate limited only by the frequency of random mutation. 
In contrast, deleterious alterations in a gene that codes for an essen- 
tial protein or RNA molecule cannot be accommodated so easily: when 
mutations occur, the faulty organism will almost always be eliminated 
or fail to reproduce. Genes of this latter sort are therefore highly con- 
served; that is, the products they encode, whether RNA or protein, are 
very similar from organism to organism. Throughout the 3.5 billion years 
or more of evolutionary history, the most highly conserved genes remain 
perfectly recognizable in all living species. They encode crucial proteins 
such as DNA and RNA polymerases, and they are the ones we turn to 
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QUESTION 9-3 


Highly conserved genes such as 
those for ribosomal RNA are present 
as clearly recognizable relatives in all 
organisms on Earth; thus, they have 
evolved very slowly over time. Were 
such genes “born” perfect? 
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Figure 9-16 Phylogenetic trees display 
the relationships among modern life- 
forms. In this family tree of higher primates, 
humans fall closer to chimpanzees than to 
gorillas or orangutans, as there are fewer 
differences between human and chimp 
DNA sequences than there are between 
those of humans and gorillas, or of humans 
and orangutans. As indicated, the genome 
sequences of each of these four species are 
estimated to differ from the sequence of the 
last common ancestor of higher primates 
by about 1.5%. Because changes occur 
independently in each lineage after two 
species diverge from a common ancestor, 
the genetic differences between any two 
species will be twice as much as the amount 
of change between each of the species 

and the common ancestor. For example, 
although humans and orangutans each 
differ from their common ancestor by about 
1.5% in terms of nucleotide sequence, they 
typically differ from one another by slightly 
more than 3%; human and chimp genomes 
differ by about 1.2%. This phylogenetic tree 
is based solely on nucleotide sequences 

of species alive today, as indicated on the 
left side of the graph; the estimated dates 
of divergence, shown on the right side of 
the graph, are derived from analysis of 

the fossil record. (Modified from F.C. 

Chen and W.H. Li, Am. J. Hum. Genet. 
68:444-456, 2001.) 
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when we wish to trace family relationships among the most distantly 
related organisms in the tree of life. 


Closely Related Organisms Have Genomes That Are 
Similar in Organization as Well as Sequence 


For species that are closely related, it is often most informative to focus 
on selectively neutral mutations. Because they accumulate steadily at a 
rate that is unconstrained by selection pressures, these mutations pro- 
vide a metric for gauging how much modern species have diverged from 
their common ancestor. Such sequence comparisons allow the construc- 
tion of a phylogenetic tree, a diagram that depicts the evolutionary 
relationships among a group of organisms. As an example, Figure 9-16 
presents a phylogenetic tree that lays out the relationships among higher 
primates. 


As indicated in this figure, chimpanzees are our closest living relative 
among the higher primates. Not only do chimpanzees seem to have 
essentially the same set of genes as we do, but their genes are arranged 
in nearly the same way on their chromosomes. The only substantial 
exception is human Chromosome 2, which arose from a fusion of two 
chromosomes that remain separate in the chimpanzee, gorilla, and 
orangutan. Humans and chimpanzees are so closely related that it is pos- 
sible to use DNA sequence comparisons to reconstruct the amino acid 
sequences of proteins that must have been present in the now-extinct, 
common ancestor of the two species (Figure 9-17). 


Even the rearrangement of genomes by crossing over, which we described 
earlier, has produced only minor differences between the human and 
chimp genomes. For example, both the chimp and human genomes con- 
tain a million copies of a type of mobile genetic element called an Alu 
sequence. More than 99% of these elements are in corresponding posi- 
tions in both genomes, indicating that most of the Alu sequences in our 
genome were in place before humans and chimpanzees diverged. 


Functionally Important Genome Regions Show Up as 
Islands of Conserved DNA Sequence 


As we delve back further into our evolutionary history and compare 
our genomes with those of more distant relatives, the picture begins to 
change. The lineages of humans and mice, for example, diverged about 
75 million years ago. These genomes are about the same size, contain 
practically the same genes, and are both riddled with mobile genetic 
elements. However, the mobile genetic elements found in mouse and 
human DNA, although similar in nucleotide sequence, are distributed 
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gorilla AAG 


gorilla ccc 


12) 
human DNA TTICATTCCTGGGCTCCACCCCATCCTGACCTTATCCAAGATGGACCAGACACTGGCAGTC 


chimp DNA TTCATTCCTGGGCTCCACCCTATCCTGACCTTATCCAAGATGGACCAGACACTGGCAGTC 
protein F I P GLHPILTLS KMODOQOTLUA YV 


* 

V 
human DNA TACCAACAGATCCTCACCAGTATGCCTTCCAGAAACGTGATCCAAATATCCAACGACCTG 
chimp DNA TACCAACAGATCCTCACCAGTATGCCTTCCAGAAACATGATCCAAATATCCAACGACCTG 
protein Y Q Q I LTSMPSRNMIQOISNDL 

gorilla ATG 


D 
human DNA GAGAACCTCCGGGATCTTCTTCAGGTGCTGGCCTTCTCTAAGAGCTGCCACTTGCCCTGG 
chimp DNA GAGAACCTCCGGGACCTTCTTCAGGTGCTGGCCTTCTCTAAGAGCTGCCACTTGCCCTGG 
protein E N L RDLLHVLUA FS KS CHLP W 
gorilla GAC 


differently, as they have had more time to proliferate and move around 
the two genomes after these species diverged (Figure 9—18). 


In addition to the movement of mobile genetic elements, the large-scale 
organization of the human and mouse genomes has been scrambled by 
many episodes of chromosome breakage and recombination over the 
past 75 million years: it is estimated that about 180 such “break-and-join” 
events have dramatically altered chromosome organization. For exam- 
ple, in humans most centromeres lie near the middle of the chromosome, 
whereas those of mouse are located at the chromosome ends. 


Regardless of this significant degree of genetic shuffling, one can nev- 
ertheless still recognize many blocks of conserved synteny, regions 
in which corresponding genes are strung together in the same order in 
both species. These genes were neighbors in the ancestral species and, 
despite all the chromosomal upheavals, they remain neighbors in the two 
present-day species. More than 90% of the mouse and human genomes 
can be partitioned into such corresponding regions of conserved syn- 
teny. Within these regions, we can align the DNA of mouse with that 
of humans so that we can compare the nucleotide sequences in detail. 
Such genome-wide sequence comparisons reveal that, in the roughly 
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Figure 9-17 Ancestral gene sequences 
can be reconstructed by comparing 
closely related present-day species. 
Shown here, in five contiguous segments 
of DNA, are the nucleotide sequences 
that encode the mature leptin protein 
from humans and chimpanzees. Leptin 

is a hormone that regulates food intake 
and energy utilization. As indicated by 
the codons boxed in green, only five 
nucleotides differ between the chimp 
and human sequences. Only one of these 
changes (marked with an asterisk) results in 
a change in the amino acid sequence. 

The nucleotide sequence of the last 
common ancestor was probably the same 
as the human and chimp sequences where 
they agree; in the few places where they 
disagree, the gorilla sequence (red) can 
be used as a “tiebreaker,” as the gorilla 
sequence is evolutionarily more distant 
than those of chimp and human (see Figure 
9-16). Thus, the amino acid indicated by the 
asterisk was a methionine in the common 
ancestor of humans and chimpanzees 
and is changed to a valine in the human 
lineage. For convenience, only the first 
300 nucleotides of the coding sequences 
for the mature leptin protein are shown; 
the last 141 nucleotides of that sequence 
are identical between humans and 
chimpanzees. 


Figure 9—18 Differences in the positions 
of mobile genetic elements in the 
human and mouse genomes reflect 

the long evolutionary time separating 
the two species. This stretch of human 
Chromosome 11 (seen also in Figure 9—10) 
contains five functional B-globin-like genes 
(orange); the comparable region from the 
mouse genome contains only four. The 
positions of two types of mobile genetic 
element—Alu sequences (green) and 

L1 sequences (red)—are shown in each 
genome. Although the mobile genetic 
elements in human (circles) and mouse 
(triangles) are not identical, they are closely 
related. The absence of these elements 
within the globin genes can be attributed 
to purifying selection, which would have 
eliminated any insertion that compromised 
gene function. (The mobile genetic element 
that falls inside the human B-globin gene 
(far right) is located within an intron, not 

in a coding sequence.) (Courtesy of Ross 
Hardison and Webb Miller.) 
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GTGCCTATCC. GTCCAGGAT GACACCAAAACCCTCATCAAGACCAT TGTCACCAGGAT CAAT GACATT TCACACACGGTA-GGAGTCTCATGGGGGGACAAA GTAGGACT. 
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Figure 9-19 Accumulated mutations have 
resulted in considerable divergence in the 
nucleotide sequences of the human and 
the mouse genomes. Shown here in two 
contiguous segments of DNA are portions 
of the human and mouse leptin gene 
sequences. Positions where the sequences 
differ by a single nucleotide substitution 

are boxed in green, and positions where 
they differ by the addition or deletion of 
nucleotides are boxed in yellow. Note that 
the coding sequence of the exon is much 
more conserved than the adjacent intron 
sequence. 


75 million years since humans and mice diverged from their common 
ancestor, about 50% of the nucleotides have changed. However, these 
differences are not dispersed evenly across the genome. By observing 
where the human and mouse sequences have remained nearly the same, 
one can thus see very clearly the regions where genetic changes are 
not tolerated (Figure 9-19). These sequences have been conserved by 
purifying selection—that is, by the elimination of individuals carrying 
mutations that interfere with important functions. 


The power of comparative genomics can be further increased by stack- 
ing our genome up against the genomes of additional animals, including 
the rat, chicken, and dog. Such comparisons take advantage of the 
results of the “natural experiment” that has lasted for hundreds of mil- 
lions of years, and they highlight some of the most important regions 
of these genomes. These comparisons reveal that roughly 4.5% of the 
human genome consists of DNA sequences that are highly conserved in 
many other mammals (Figure 9-20). Surprisingly, only about one-third 


human gene: 190,000 nucleotide pairs 


HHHH H H 


intron 


chimpanzee 


exon 


aa 


mammalian conserved intron sequences 


intron 


100% identical 
50% identical 


a egy 
orangutan 
wrr Mania a aa a se 
baboon E 
ie heel ae Alta y 
marmoset j peel jaa 
lemur slat adil ie PTT yare ae 
rabbit 
-= 1 - percent 
horse " 7" Pir Baki ua maT identity 
cot” | E TEP i hit lan i 
dog ' 
mouse 
A ERRET TETT sadad baldaa aak 
chicken i ' i] | | 
; `100% identical 
fish (Fugu) i. al | -50% identical 


10,000 nucleotide pairs 


Figure 9-20 Comparison of nucleotide sequences from many different vertebrates reveals regions of high conservation. The 
nucleotide sequence examined in this diagram is a small segment of the human gene for a plasma membrane transporter protein. 
The upper part of the diagram shows the location of the exons (red) in both the complete gene (top) and in the expanded region of 
the gene. Three blocks of intron sequence that are conserved in mammals are shown in blue. In the lower part of the figure, the DNA 
sequence of the expanded segment of 10,000 nucleotide pairs is aligned with the corresponding sequences of different vertebrates; 
the percent identity with the human sequences for successive stretches of 100 nucleotide pairs is plotted in green, with only identities 
above 50% shown. Note that the sequence of the exon is highly conserved in all the species, including chicken and fish, but the 

three intron sequences that are conserved in mammals are not conserved in chickens or fish. The functions of most conserved intron 
sequences in the human genome (including these three) are not known. (Courtesy of Eric D. Green.) 


of these sequences code for proteins. Some of the conserved noncoding 
sequences correspond to regulatory DNA, whereas others are transcribed 
to produce RNA molecules that are not translated into protein but serve 
a variety of functions (see Chapter 8). The functions of many of these 
conserved noncoding sequences, however, remain unknown. The unex- 
pected discovery of these mysterious conserved DNA sequences suggests 
that we understand much less about the cell biology of mammals than 
we had previously imagined. With the plummeting cost and accelerating 
speed of whole-genome sequencing, we can expect many more surprises 
that will lead to an increased understanding in the years ahead. 


Genome Comparisons Show That Vertebrate Genomes 
Gain and Lose DNA Rapidly 


Going back even further in evolution, we can compare our genome 
with those of more distantly related vertebrates. The lineages of fish and 
mammals diverged about 400 million years ago. This stretch of time is 
long enough for random sequence changes and differing selection pres- 
sures to have obliterated almost every trace of similarity in nucleotide 
sequence—except where purifying selection has operated to prevent 
change. Regions of the genome conserved between humans and fishes 
thus stand out even more strikingly than those conserved between differ- 
ent mammals. In fishes, one can still recognize most of the same genes 
as in humans and even many of the same regulatory DNA sequences. On 
the other hand, the extent of duplication of any given gene is often dif- 
ferent, resulting in different numbers of members of gene families in the 
two species. 


Even more striking is the finding that although all vertebrate genomes 
contain roughly the same number of genes, their overall size varies con- 
siderably. Whereas human, dog, and mouse are all in the same size range 
(around 3 x 10? nucleotide pairs), the chicken genome is only one-third this 
size. An extreme example of genome compression is the pufferfish Fugu 
rubripes (Figure 9—21). The fish's tiny genome is about one-eighth the size 
of mammalian genomes, largely because of the small size of its intergenic 
regions, which are missing nearly all of the repetitive DNA that makes up 
a large portion of most mammalian genomes. The Fugu introns are also 
short in comparison to human introns. Nonetheless, the positions of most 
Fugu introns are perfectly conserved when compared with their positions in 
the genomes of mammals. Clearly, the intron structure of most vertebrate 
genes was already in place in the common ancestor of fish and mammals. 


What factors could be responsible for the size differences among mod- 
ern vertebrate genomes? Detailed comparisons of many genomes have 
led to the unexpected finding that small blocks of sequence are being 
lost from and added to genomes at a surprisingly rapid rate. It seems 
likely, for example, that the Fugu genome is so tiny because it lost DNA 
sequences faster than it gained them. Over long periods, this imbalance 
apparently cleared out those DNA sequences whose loss could be toler- 
ated. This “cleansing” process has been enormously helpful to biologists: 
by “trimming the fat” from the Fugu genome, evolution has provided a 
conveniently slimmed-down version of a vertebrate genome in which 
the only DNA sequences that remain are those that are very likely to have 
important functions. 


Sequence Conservation Allows Us to Trace Even the 
Most Distant Evolutionary Relationships 


As we go back further still to the genomes of our even more distant 
relatives—beyond apes, mice, fish, flies, worms, plants, and yeasts, all 
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Figure 9—21 The pufferfish, Fugu 
rubripes, has a remarkably compact 
genome. At 400 million nucleotide pairs, 
the Fugu genome is only one-quarter the 
size of the zebrafish genome, even though 
the two species have nearly the same genes. 
(From a woodcut by Hiroshige, courtesy of 
Arts and Designs of Japan.) 
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GTTCCGGGGGGAGTATGGT TGCAAAGCTGAAACTTAAAGGAAT TGACGGAAGGGCACCACCAGGAGTGGAGCCTGCGGCTTAATTTGACTCAACACGGGAAACCTCACCC human 


GCCGCCTGGGGAGTACGGTCGCAAGACTGAAACTTAAAGGAATTGGCGGGGGAGCACTACAACGGGTGGAGCCTGCGGTTTAATTGGATTCAACGCCGGGCATCTTACCA Methanococcus 


ACCGCCTGGGGAGTACGGCCGCAAGGT TAAAACTCAAATGAATTGACGGGGGCCCGC e ACAAGCGGTGGAGCATGTGGTTTAATTCGATGCAACGCGAAGAACCTTACCT £. coli 


GTTCCGGGGGGAGTATGGT TGCAAAGC TGAAACTTAAAGGAATTGACGGAAGGGCACCACCAGGAGTGGAGCCTGCGGCTTAATTTGACTCAACACGGGAAACCTCACCC human 


Figure 9-22 Some genetic information 
has been conserved since the beginnings 
of life. A part of the gene for the small 
ribosomal subunit rRNA (see Figure 

7-35) is shown. Corresponding segments 
of nucleotide sequence from this 

gene in three distantly related species 
(Methanococcus jannaschii and Escherichia 
coli, both prokaryotes, and Homo sapiens, 

a eukaryote) are aligned in parallel. Sites 
where the nucleotides are identical between 
any two species are indicated by green 
shading; the human sequence is repeated 
at the bottom of the alignment so that all 
three two-way comparisons can be seen. 
The red dot halfway along the E. coli 
sequence denotes a site where a nucleotide 
has been either deleted from the bacterial 
lineage in the course of evolution or 
inserted in the other two lineages. Note that 
the three sequences have all diverged from 
one another to a roughly similar extent, 
while still retaining unmistakable similarities. 
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the way to bacteria—we find fewer and fewer resemblances to our own 
genome. Yet even across this enormous evolutionary divide, purifying 
selection has maintained a few hundred fundamentally important genes. 
By comparing the sequences of these genes in different organisms and 
seeing how far they have diverged, we can attempt to construct a phy- 
logenetic tree that goes all the way back to the ultimate ancestors—the 
cells at the very origins of life, from which we all derive. 


To construct such a tree, biologists have focused on one particular gene 
that is conserved in all living species: the gene that codes for the ribo- 
somal RNA (rRNA) of the small ribosomal subunit (shown schematically 
in Figure 7-35). Because the process of translation is fundamental to all 
living cells, this component of the ribosome has been highly conserved 
since early in the history of life on Earth (Figure 9-22). 


By applying the same principles used to construct the primate family tree 
(see Figure 9-16), the small-subunit rRNA nucleotide sequences have 
been used to create a single, all-encompassing tree of life. Although many 
aspects of this phylogenetic tree were anticipated by classical taxonomy 
(which is based on the outward appearance of organisms), there were 
also many surprises. Perhaps the most important was the realization 
that some of the organisms that were traditionally classed as “bacteria” 
are as widely divergent in their evolutionary origins as is any prokary- 
ote from any eukaryote. As discussed in Chapter 1, it is now apparent 
that the prokaryotes comprise two distinct groups—the bacteria and the 
archaea—that diverged early in the history of life on Earth. The living 
world therefore has three major divisions or domains: bacteria, archaea, 
and eukaryotes (Figure 9-23). 


Although we humans have been classifying the visible world since antig- 
uity, we now realize that most of life’s genetic diversity lies in the world 
of microscopic organisms. These microbes have tended to go unnoticed, 
unless they cause disease or rot the timbers of our houses. Yet they make 
up most of the total mass of living matter on our planet. Many of these 
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Figure 9-23 The tree of life has three major divisions. Each branch on the tree is labeled with the name of a representative member 
of that group, and the length of each branch corresponds to the degree of difference in the DNA sequences that encode their small- 
subunit rRNAs (see Figure 9-22). Note that all the organisms we can see with the unaided eye—animals, plants, and some fungi 
(highlighted in yellowM—represent only a small subset of the diversity of life. 
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organisms cannot be grown under laboratory conditions. Thus it is only 
through the analysis of DNA sequences, obtained from around the globe, 
that we are beginning to obtain a more detailed understanding of all life 
on Earth—knowledge that is less distorted by our biased perspective as 
large animals living on dry land. 


MOBILE GENETIC ELEMENTS AND VIRUSES 


The tree of life depicted in Figure 9-23 includes representatives from life's 
most distant branches, from the cyanobacteria that release oxygen into 
Earth’s atmosphere to the animals, like us, that use that oxygen to boost 
their metabolism. What the diagram does not encompass, however, 
are the parasitic genetic elements that operate on the outskirts of life. 
Although these elements are built from the same nucleic acids contained 
in all life-forms and can multiply and move from place to place, they do 
not cross the threshold of actually being alive. Yet because of their preva- 
lence and their penchant for propagating themselves, these diminutive 
genetic parasites have major implications for the evolution of species 
and for human health. 


We briefly discussed these mobile genetic elements, earlier in the chap- 
ter, and here we consider them in greater detail. Known informally as 
jumping genes, mobile genetic elements are found in virtually all cells. 
Their DNA sequences make up almost half of the human genome. 
Although they can insert themselves into virtually any region of the 
genome, most mobile genetic elements lack the ability to leave the cell in 
which they reside. This is not the case for their relatives, the viruses. Not 
much more than strings of genes wrapped in a protective coat, viruses 
can escape from one cell and infect another. 


In this section, we discuss mobile genetic elements and viruses. We 
review their structure and outline how they operate—and we consider 
the effects they have on gene expression, genome evolution, and the 
transmission of disease. 


Mobile Genetic Elements Encode the Components They 
Need for Movement 


Mobile genetic elements, also called transposons, are typically classi- 
fied according to the mechanism by which they move or transpose. In 
bacteria, the most common mobile genetic elements are the DNA-only 
transposons. The name is derived from the fact that the element moves 
from one place to another as a piece of DNA, as opposed to being con- 
verted into an RNA intermediate—which is the case for another type 
of mobile element we discuss shortly. Bacteria contain many different 
DNA-only transposons. Some move to the target site using a simple cut- 
and-paste mechanism, whereby the element is simply excised from the 
genome and inserted into a different site. Other DNA-only transposons 
replicate before transposing; in this case, the new copy of the transposon 
inserts into a second chromosomal site, while the original copy remains 
intact at its previous location (Figure 9—24). 


Each mobile genetic element typically encodes a specialized enzyme, 
called a transposase, that mediates its movement. These enzymes rec- 
ognize and act on unique DNA sequences that are present on the mobile 
genetic elements that code for the transposase. Many mobile genetic 
elements also harbor additional genes: some mobile genetic elements, 
for example, carry antibiotic-resistance genes, which have contributed 
greatly to the widespread dissemination of antibiotic resistance in bacte- 
rial populations (Figure 9-25). 
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Figure 9-24 The most common mobile 
genetic elements in bacteria, DNA- 

only transposons, move by two types 

of mechanism. (A) In cut-and-paste 
transposition, the element is cut out of the 
donor DNA and inserted into the target 
DNA, leaving behind a broken donor 

DNA molecule, which is subsequently 
repaired. (B) In replicative transposition, the 
mobile genetic element is copied by DNA 
replication. The donor molecule remains 
unchanged, and the target molecule 
receives a copy of the mobile genetic 
element. In general, a particular type of 
transposon moves by only one of these 
mechanisms. However, the two mechanisms 
have many enzymatic similarities, and a few 
transposons can move by either mechanism. 
The donor and target DNAs can be part 

of the same DNA molecule or reside on 
different DNA molecules. 


QUESTION 9-4 


Many transposons move within a 
genome by replicative mechanisms 
(such as those shown in Figure 
9—24B). They therefore increase 

in copy number each time they 
transpose. Although individual 
transposition events are rare, many 
transposons are found in multiple 
copies in genomes. What do you 
suppose keeps the transposons 
from completely overrunning their 
hosts’ genomes? 
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In addition to relocating themselves, mobile genetic elements occa- 
sionally rearrange the DNA sequences of the genome in which they are 
embedded. For example, if two mobile genetic elements that are rec- 
ognized by the same transposase integrate into neighboring regions 
of the same chromosome, the DNA between them can be accidentally 
excised and inserted into a different gene or chromosome (Figure 9—26). 
In eukaryotic genomes, such accidental transposition provides a path- 
way for generating novel genes, both by altering gene expression and by 
duplicating existing genes. 


The Human Genome Contains Two Major Families of 
Transposable Sequences 


The sequencing of human genomes has revealed many surprises, as we 
describe in detail in the next section. But one of the most stunning was 
the finding that a large part of our DNA is not entirely our own. Nearly 
half of the human genome is made up of mobile genetic elements, which 
number in the millions. Some of these elements have moved from place 
to place within the human genome using the cut-and-paste mechanism 
discussed earlier (see Figure 9-24A). However, most have moved not as 
DNA, but via an RNA intermediate. These retrotransposons appear to be 
unique to eukaryotes. 


One abundant human retrotransposon, the L1 element (sometimes 
referred to as LINE-1, a long interspersed nuclear element), is transcribed 
into RNA by a host cell’s RNA polymerase. A double-stranded DNA copy 
of this RNA is then made using an enzyme called reverse transcriptase, 
an unusual DNA polymerase that can use RNA as a template. The reverse 
transcriptase is encoded by the L1 element itself. The DNA copy of 
the element is then free to reintegrate into another site in the genome 
(Figure 9-27). 


L1 elements constitute about 15% of the human genome. Although 
most copies have been immobilized by the accumulation of deleterious 


Figure 9—25 Transposons contain the components they need 

for transposition. Shown here are two types of bacterial DNA-only 
transposons. Each carries a gene that encodes a transposase (blue and 
red)—the enzyme that catalyzes the element's movement—as well as 
DNA sequences (red) that are recognized by each transposase. 

Some transposons carry additional genes (yellow) that encode 
enzymes that inactivate antibiotics such as ampicillin (AmpR). The 
spread of these transposons is a serious problem in medicine, as it 
has allowed many disease-causing bacteria to become resistant to 
antibiotics developed during the twentieth century. 
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mutations, a few still retain the ability to transpose. Their movement can 
sometimes precipitate disease: for example, movement in the germline of 
an L1 element into the gene that encodes Factor VIII—a protein essential 
for proper blood clotting—caused hemophilia in a child with no family 
history of the disease. 


Another type of retrotransposon, the Alu sequence, is present in about 
1 million copies, making up about 10% of our genome. Alu elements do 
not encode their own reverse transcriptase and thus depend on enzymes 
already present in the cell to help them move. 


Comparisons of the sequence and locations of the L1 and Alu elements 
in different mammals suggest that these sequences have proliferated 
in primates relatively recently in evolutionary history (see Figure 9-18). 
Given that the placement of mobile genetic elements can have profound 
effects on gene expression, it is humbling to contemplate how many 
of our uniquely human qualities we might owe to these prolific genetic 
parasites. 


Viruses Can Move Between Cells and Organisms 


Viruses are also mobile, but unlike the transposons we have discussed 
so far, they can actually escape from cells and move to other cells and 
organisms. Viruses were first categorized as disease-causing agents that, 
by virtue of their tiny size, passed through ultrafine filters that can hold 
back even the smallest bacterial cell. We now know that viruses are 
essentially small genomes enclosed by a protective protein coat, and that 
they must enter a cell and coopt its molecular machinery to express their 
genes, make their proteins, and reproduce. Although the first viruses that 
were discovered attack mammalian cells, it is now recognized that many 
types of viruses exist, and virtually all organisms—including plants, ani- 
mals, and bacteria—can serve as viral hosts. 


Viral reproduction is often lethal to the host cells; in many cases, the 
infected cell breaks open (lyses), releasing progeny viruses, which can 
then infect neighboring cells. Many of the symptoms of viral infections 
reflect this lytic effect of the virus. The cold sores formed by herpes sim- 
plex virus and the blisters caused by the chickenpox virus, for example, 
reflect the localized killing of human skin cells. 


Figure 9-26 Mobile genetic elements can 
move exons from one gene to another. 
When two mobile genetic elements of the 
same type (red) happen to insert near each 
other in a chromosome, the transposition 
mechanism occasionally recognizes the 
ends of two different elements (instead 

of the two ends of the same element). As 

a result, the chromosomal DNA that lies 
between the mobile genetic elements gets 
excised and moved to a new site. Such 
inadvertent transposition of chromosomal 
DNA can either generate novel genes, as 
shown, or alter gene regulation (not shown). 
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Figure 9-27 Retrotransposons move via 
an RNA intermediate. These transposable 
elements are first transcribed into an 

RNA intermediate (not shown). Next, a 
double-stranded DNA copy of this RNA 

is synthesized by the enzyme reverse 
transcriptase. This DNA copy is then 
inserted into the target location, which 

can be on either the same or a different 
DNA molecule. The donor retrotransposon 
remains at its original location, so each 
time it transposes, it duplicates itself. 
These mobile genetic elements are called 
retrotransposons because at one stage in 
their transposition their genetic information 
flows backward, from RNA to DNA. 
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QUESTION 9-5 


Discuss the following statement: 
“Viruses exist in the twilight zone 

of life: outside cells they are simply 
dead assemblies of molecules; inside 
cells, however, they are alive.” 
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TABLE 9-1 VIRUSES THAT CAUSE HUMAN DISEASE 


Herpes simplex virus 
Epstein-Barr virus (EBV) 
Varicella-zoster virus 
Smallpox virus 
Hepatitis B virus 
Human immunodeficiency 
virus (HIV) 

Influenza virus type A 
Poliovirus 

Rhinovirus 

Hepatitis A virus 


Hepatitis C virus 


Yellow fever virus 
Rabies virus 
Mumps virus 


Measles virus 


double-stranded DNA 
double-stranded DNA 
double-stranded DNA 
double-stranded DNA 


part single-, part 
double-stranded DNA 


single-stranded RNA 


single-stranded RNA 
single-stranded RNA 


single-stranded RNA 
single-stranded RNA 
single-stranded RNA 


single-stranded RNA 
single-stranded RNA 
single-stranded RNA 
single-stranded RNA 


Disease 


recurrent cold sores 
infectious mononucleosis 
chickenpox and shingles 
smallpox 

serum hepatitis 
acquired immune 
deficiency syndrome 
(AIDS) 

respiratory disease (flu) 
poliomyelitis 

common cold 

infectious hepatitis 


non-A, non-B type 
hepatitis 


yellow fever 
rabies encephalitis 
mumps 


measles 


Most viruses that cause human disease have genomes made of either 
double-stranded DNA or single-stranded RNA (Table 9-1). However, 
viral genomes composed of single-stranded DNA and of double-stranded 
RNA are also known. The simplest viruses found in nature have a small 
genome, composed of as few as three genes, enclosed by a protein coat 
built from many copies of a single polypeptide chain. More complex 
viruses have larger genomes of up to several hundred genes, surrounded 
by an elaborate shell composed of many different proteins (Figure 9-28). 
The amount of genetic material that can be packaged inside a viral pro- 
tein shell is limited. Because these shells are too small to encase the 
genes needed to encode the many enzymes and other proteins that are 
required to replicate even the simplest virus, viruses must hijack their 
host’s biochemical machinery to reproduce themselves (Figure 9-29). A 
viral genome will typically encode both viral coat proteins and proteins 
that help the virus to commandeer the host enzymes needed to replicate 
its genetic material. 


Retroviruses Reverse the Normal Flow of Genetic 
Information 


Although there are many similarities between bacterial and eukaryotic 
viruses, one important class of viruses—the retroviruses—is found only 
in eukaryotic cells. In many respects, retroviruses resemble the retro- 
transposons we just discussed. A key feature of the replication cycle of 
both is a step in which DNA is synthesized using RNA as a template— 
hence the prefix retro, which refers to the reversal of the usual flow of 
information from DNA to RNA. Retroviruses are thought to have derived 
from a retrotransposon that long ago acquired additional genes encoding 
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double-stranded DNA Figure 9-28 Viruses come in different 


shapes and sizes. Some of the viruses 
are shown in cross section (such as 

t poxvirus and HIV). For others, the outer 
structure is emphasized. Some viruses 
(such as papilloma and polio) contain an 
outer surface that is composed solely 
of viral-encoded proteins. Others (such 
as poxvirus and HIV) bear a lipid-bilayer 


envelope (gray) in which viral-encoded 
proteins are embedded. 
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the coat proteins and other proteins required to make a virus particle. 
The RNA stage of its replicative cycle could then be packaged into a viral 
particle that could leave the cell. 


Like retrotransposons, retroviruses use the enzyme reverse transcriptase virus Sr gn 
to convert RNA into DNA. The enzyme is encoded by the retroviral ee~ coat protein 
genome, and a few molecules of the enzyme are packaged along with ENTRY OF DNA INTO CELL 
the RNA genome in each virus particle. When the single-stranded RNA t 
genome of the retrovirus enters a cell, the reverse transcriptase brought fa is 
in with it makes a complementary DNA strand to form a DNA/RNA hybrid (ona 
double helix. The RNA strand is removed, and the reverse transcriptase 
(which can use either DNA or RNA as a template) now synthesizes a Veni econ 
complementary DNA strand to produce a DNA double helix. This DNA l 
is then inserted, or integrated, into a randomly selected site in the host 

DNA 


genome by a virally encoded integrase enzyme. In this integrated state, 
the virus is Jatent: each time the host cell divides, it passes on a copy of 
the integrated viral genome, which is known as a provirus, to its progeny 
cells. 


TRANSCRIPTION 


mRNA 


The next step in the replication of a retrovirus—which can take place 
long after its integration into the host genome—is the copying of the 
integrated viral DNA into RNA by a host-cell RNA polymerase, which pro- 
duces large numbers of single-stranded RNAs identical to the original 
infecting genome. These viral RNAs are then translated by the host-cell 
ribosomes to produce the viral shell proteins, the envelope proteins, and 
reverse transcriptase—all of which are assembled with the RNA genome 
into new virus particles. The steps involved in the integration and replica- 
tion of a retrovirus are shown in Figure 9-30. 
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Figure 9-29 Viruses commandeer the host cell's molecular 
machinery to reproduce. The hypothetical virus illustrated here 
consists of a small, double-stranded DNA molecule that encodes just a 


single type of viral coat protein. To reproduce, the viral genome must EXIT 

first enter a host cell, where it is replicated to produce multiple copies, FROM CELL 
which are transcribed and translated to produce the viral coat protein. e, 

The viral genomes can then assemble spontaneously with the coat iS) set 
protein to form new virus particles, which escape from the cell iS: iS: 


by lysing it. ia 
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Figure 9-30 Infection by a retrovirus includes reverse transcription and integration of the viral genome into 
the host cell's DNA. The retrovirus genome consists of an RNA molecule (blue) that is typically between 7000 and 
12,000 nucleotides in size. It is packaged inside a protein coat, which is surrounded by a lipid-bilayer envelope that 
contains virus-encoded envelope proteins (green). The enzyme reverse transcriptase (red circle), encoded by the viral 
genome and packaged with its RNA, first makes a single-stranded DNA copy of the viral RNA molecule and then a 
second DNA strand, generating a double-stranded DNA copy of the RNA genome. This DNA double helix is then 
integrated into a host chromosome, a step required for the synthesis of new viral RNA molecules by a host-cell RNA 
polymerase. 


The human immunodeficiency virus (HIV), which is the cause of AIDS, is 
a retrovirus. As with other retroviruses, the HIV genome can persist in a 
latent state as a provirus embedded in the chromosomes of an infected 
cell. This ability to hide in host cells complicates attempts to treat the 
infection with antiviral drugs. But because the HIV reverse transcriptase 
is not used by cells for any purpose of their own, it is one of the prime 
targets of drugs currently used to treat AIDS. 


EXAMINING THE HUMAN GENOME 


The human genome contains an enormous amount of information about 
who we are and where we came from (Figure 9-31). Its 3.2 x 10? nucleo- 
tide pairs, spread out over 23 sets of chromosomes—22 autosomes and 
a pair of sex chromosomes (X and Y)—provide the instructions needed 
to build a human being. Yet, 25 years ago, biologists actively debated the 
value of determining the human genome sequence—the complete list of 
nucleotides contained in our chromosomes. 


Figure 9-31 The 3 billion nucleotide pairs of the human genome 
contain a vast amount of information, including clues about our 
origins. If each nucleotide pair is drawn to span 1 mm, as shown in 
(A), the human genome would extend 3200 km (approximately 2000 
miles)—far enough to stretch across central Africa, where humans 
first arose (red line in B). At this scale, there would be, on average, a 
protein-coding gene every 150 m. An average gene would extend for 
about 30 m, but the coding sequences (exons) in this gene would add 
up to only just over a meter; the rest would be introns. 


The task was not simple. An international consortium of investiga- 
tors labored tirelessly for the better part of a decade—and spent nearly 
$3 billion—to give us our first glimpse of this genetic blueprint. But the 
effort turned out to be well worth the cost, as the data continue to shape 
our thinking about how our genome functions and how it has evolved. 


The first human genome sequence was just the beginning. The spec- 
tacular improvements in sequencing technologies (which we discuss 
in Chapter 10), coupled with powerful new tools for handling massive 
amounts of data, are taking genomics to a whole new level. The cost 
of DNA sequencing has dropped enormously since the Human Genome 
Project was launched in 1990, such that a whole human genome can now 
be sequenced in a few days for about $1000. Investigators around the 
world are collaborating to collect and compare the nucleotide sequences 
of thousands of human genomes. This resulting deluge of data offers tan- 
talizing clues as to what makes us human, and what makes each of us 
unique. 


Although it will take many years to analyze the rapidly accumulating 
genome data, the recent findings have already influenced the content of 
every chapter in this book. In this section, we describe some of the most 
striking features of the human genome—many of which were entirely 
unexpected. We review what genome comparisons can tell us about how 
we evolved, and we discuss some of the mysteries that still remain. 


The Nucleotide Sequences of Human Genomes Show 
How Our Genes Are Arranged 

When the DNA sequence of human Chromosome 22, one of the smallest 
human chromosomes, was completed in 1999, it became possible for 


the first time to see exactly how genes are arranged along an entire ver- 
tebrate chromosome (Figure 9-32). The subsequent publication of the 


(A) Human Chromosome 22 in its mitotic conformation, 
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QUESTION 9-6 


Mobile genetic elements, such as 
the Alu sequences, are found in 
many copies in human DNA. In what 
ways could the presence of an Alu 
sequence affect a nearby gene? 


composed of two double-stranded DNA molecules, each 48 x 10° nucleotide pairs long 
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Figure 9-32 The sequence of Chromosome 22 shows how human chromosomes are organized. (A) Chromosome 22, one of the 
smallest human chromosomes, contains 48 x 10° nucleotide pairs and makes up approximately 1.5% of the human genome. Most of 
the short arm of Chromosome 22 consists of short repeated sequences of DNA that are packaged in a particularly compact form of 
chromatin (heterochromatin), as discussed in Chapter 5. (B) A tenfold expansion of a portion of Chromosome 22 shows about 40 genes. 
Those in dark brown are known genes, and those in red are predicted genes. (C) An expanded portion of (B) shows the entire length 

of several genes. (D) The intron-exon arrangement of a typical gene is shown after a further tenfold expansion. Each exon (red) codes 
for a portion of the protein, while the DNA sequence of the introns (yellow) is relatively unimportant. (Adapted from The International 


Human Genome Sequencing Consortium, Nature 409:860-921, 2001.) 
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Figure 9-33 The bulk of the human 
genome is made of repetitive nucleotide 
sequences and other noncoding DNA. 
About half of our genome consists of 
repeated sequences. These include 
the LINEs (long interspersed nuclear 
elements, such as L1), SINEs (short 
interspersed nuclear elements, such as 
Alu), other retrotransposons, and DNA-only 
transposons—mobile genetic elements 
that have multiplied in our genome by 
replicating themselves and inserting the 
new copies in different positions. Most 
of these mobile genetic elements are 
fossils—remnants that are no longer 
capable of transposition. Simple repeats 
are short nucleotide sequences (less than 
14 nucleotide pairs) that are repeated again 
and again for long stretches. Segment 
duplications are large blocks of the genome 
(1000-200,000 nucleotide pairs) that are 
present at two or more locations in the 
genome. These, too, represent repeated 
DNA sequences. The most highly repeated 
blocks of DNA in heterochromatin have 
not yet been completely sequenced; 
these comprise about 10% of human DNA 
sequences and are not represented in this 
diagram. 

The unique sequences that are not part 
of any introns or exons (dark green) include 
regulatory DNA sequences, sequences that 
code for functional RNA, and sequences 
whose functions are not known. (Data 
courtesy of E.H. Margulies.) 
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TABLE 9-2 SOME VITAL STATISTICS FOR THE HUMAN GENOME 
DNA Length 3.2 x 10? Nucleotide Pairs* 


approximately 19,000 


Number of protein-coding genes 
Number of non-protein-coding genes** approximately 5000 
Largest gene 2.4 x 10° nucleotide pairs 


Mean gene size 27,000 nucleotide pairs 


Smallest number of exons per gene 1 
Largest number of exons per gene 178 
Mean number of exons per gene 10.4 


Largest exon size 17,106 nucleotide pairs 


Mean exon size 145 nucleotide pairs 


kkk 


Number of pseudogenes approximately 11,000 


Percentage of DNA sequence in exons 1.5% 
(protein-coding sequences) 


Percentage of DNA conserved with other 3.0% 
mammals that does not encode protein**** 


Percentage of DNA in high-copy repetitive 


approximately 50% 
elements 


*The sequence of 2.85 billion nucleotide pairs is known precisely (error rate of 
only about one in 100,000 nucleotides). The remaining DNA consists primarily 
of short, highly repeated sequences that are tandemly repeated, with repeat 
numbers differing from one individual to the next. 


**These include genes that encode structural, catalytic, and regulatory RNAs. 


***A pseudogene is a DNA sequence that closely resembles that of a functional 
gene but contains numerous mutations that prevent its proper expression. Most 
pseudogenes arise from the duplication of a functional gene, followed by the 
accumulation of damaging mutations in one copy. 


****This includes DNA encoding 5’ and 3’ UTRs (untranslated regions of mRNAs), 
regulatory DNA sequences, and conserved regions of unknown function. 


whole human genome sequence—a first draft in 2001 and a finished draft 
in 2004—provided a more panoramic view of the complete genetic land- 
scape, including how many genes we have, what those genes look like, 
and how they are distributed across the genome (Table 9-2). 


The first striking feature of the human genome is how little of it—less than 
2%—codes for proteins (Figure 9-33). In addition, almost half of our DNA 
is made up of mobile genetic elements that have colonized our genome 
over evolutionary time. Because these elements have accumulated 
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mutations, most can no longer move; rather, they are relics from an ear- 
lier evolutionary era when mobile genetic elements ran rampant through 
our genome. 


It was a surprise to discover how few protein-coding genes our genome 
actually contains. Earlier estimates had been in the neighborhood of 
100,000 (as discussed in How We Know, pp. 324-325). Although the exact 
count is still being refined, current estimates place the number of human 
protein-coding genes at about 19,000, with perhaps another 5000 genes 
encoding functional RNAs that are not translated into proteins. This esti- 
mate brings us much closer to the gene numbers for simpler multicellular 
animals—for example, 14,000 protein-coding genes for Drosophila, 22,000 
for C. elegans, and 28,000 for the small weed Arabidopsis (see Table 1-2). 


The number of protein-coding genes we have may be unexpectedly small, 
but their relative size is unusually large. Only about 1300 nucleotide pairs 
are needed to encode an average-sized human protein of about 430 
amino acids. Yet the average length of a human gene is 27,000 nucleotide 
pairs. Most of this DNA is in noncoding introns. In addition to the volumi- 
nous introns (see Figure 9-32D), each gene is associated with regulatory 
DNA sequences that ensure that the gene is expressed at the proper level, 
time, and place. In humans, these regulatory DNA sequences are typically 
interspersed along tens of thousands of nucleotide pairs, much of which 
seems to be “spacer” DNA. Indeed, compared to many other eukaryotic 
genomes, the human genome is much less densely packed (Figure 9-34). 


Although exons and their associated regulatory DNA sequences com- 
prise less than 2% of the human genome, comparative studies indicate 
that about 4.5% of the human genome is highly conserved when com- 
pared with other mammalian genomes (see Figure 9-20). An additional 
5% of the genome shows reduced variation in the human population, 
as determined by comparing the DNA sequence of thousands of indi- 
viduals. This reduced variation reflects the relative importance of these 
sequences compared with the majority of the genome. Taken together, 
such analyses suggest that only about 10% of the human genome con- 
tains sequences that truly matter—but we do not yet know the function 
of much of this DNA. 


Differences in Gene Regulation May Help Explain How 
Animals with Similar Genomes Can Be So Different 


We now have the complete genome sequences for many different mam- 
mals, including humans, chimpanzees, gorillas, orangutans, dogs, cats, 
and mice. All of these species contain essentially the same protein-coding 
genes, which raises a fundamental question: What makes these creatures 
so different from one another? And what makes humans different from 
other animals? 


10,000 nucleotide pairs 


Figure 9-34 Genes are sparsely 
distributed in the human genome. 
Compared to some other eukaryotic 
genomes, the human genome is less gene- 
dense. Shown here are DNA segments 
about 50,000 nucleotide pairs in length 
from bakers yeast, Drosophila, and human. 
The human segment contains only 4 genes, 
compared to 26 in the yeast and 11 in the 
fly. Exons are shown in orange, introns in 
yellow, repetitive elements in blue, and 
intergenic DNA in gray. The genes of yeast 
and flies are generally more compact, with 
fewer introns, than the genes of humans. 


= HOW WE KNOW 
COUNTING GENES 


How many genes does it take to make a human? It seems 
a natural thing to wonder. If about 6000 genes can pro- 
duce a yeast and 14,000 a fly, how many are needed to 
make a human being—a creature curious and clever 
enough to study its own genome? Until researchers com- 
pleted the first draft of the human genome sequence, the 
most frequently cited estimate was 100,000. But where 
did that figure come from? And how was the revised 
estimate of only 19,000 protein-coding genes derived? 


Walter Gilbert, a physicist-turned-biologist who won a 
Nobel Prize for developing techniques for sequencing 
DNA, was one of the first to throw out a ballpark esti- 
mate of the number of human genes. In the mid-1980s, 
Gilbert suggested that humans could have 100,000 
genes, an estimate based on the average size of the few 
human genes known at the time (about 3 x 104 nucleo- 
tide pairs) and the size of our genome (about 3 x 10? 
nucleotide pairs). This back-of-the-envelope calculation 
yielded a number with such a pleasing roundness that it 
wound up being quoted widely in articles and textbooks. 


The calculation provides an estimate of the number 
of genes a human could have in principle, but it does 
not address the question of how many genes we actu- 
ally have. As it turns out, that question is not so easy 
to answer, even with the complete human genome 
sequence in hand. The problem is, how does one identify 
a gene? Consider protein-coding genes, which comprise 
only 1.5% of the human genome. Looking at a given piece 
of raw DNA sequence—an apparently random string of 
As, Ts, Gs, and Cs—how can one tell which parts repre- 
sent protein-coding segments? Being able to accurately 


Rea pairs at 


non- ia methionine codons 
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and reliably distinguish the rare coding sequences from 
the more plentiful noncoding sequences in a genome is 
necessary before one can hope to locate and count its 
genes. 


Signals and chunks 


As always, the situation is simplest in bacteria and simple 
eukaryotes such as yeasts. In these genomes, genes that 
encode proteins are identified by searching through the 
entire DNA sequence looking for open reading frames 
(ORFs). These are long sequences—say, 100 codons or 
more—that lack stop codons. A random sequence of 
nucleotides will by chance encode a stop codon about 
once every 20 codons (as there are three stop codons 
in the set of 64 possible codons—see Figure 7-27). So 
finding an ORF—a continuous nucleotide sequence that 
encodes more than 100 amino acids—is the first step in 
identifying a good candidate for a protein-coding gene. 
Today, computer programs are used to search for such 
ORFs, which begin with an initiation codon, usually 
ATG, and end with a termination codon, TAA, TAG, or 
TGA (Figure 9-35). 


In animals and plants, the process of identifying ORFs is 
complicated by the presence of large intron sequences, 
which interrupt the protein-coding portions of genes. As 
we have seen, these introns are generally much larger 
than the exons, which might represent only a few percent 
of the gene. In human DNA, exons sometimes contain 
as few as 50 codons (150 nucleotide pairs), while introns 
may exceed 10,000 nucleotide pairs in length. Fifty 
codons is too short to generate a statistically significant 
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Figure 9-35 Computer programs are used to identify protein-coding genes. In this example, a DNA sequence of 
7500 nucleotide pairs from the pathogenic yeast Candida albicans was fed into a computer, which then calculated the proteins that 
could, in theory, be produced from each of its six possible reading frames—three on each of the two strands (see Figure 7-28). The 


output shows the location of start and stop codons for each reading frame. The reading frames are laid out in horizontal columns. Stop, 
or termination, codons (TGA, TAA, and TAG) are represented by tall, vertical black lines, and methionine codons (ATG) are represented 
by shorter black lines. Four open reading frames, or ORFs (shaded yellow), can be clearly identified by the statistically significant 
absence of stop codons. For each ORF, the presumptive initiation codon (ATG) is indicated in red. The additional ATG codons (black) in 
the ORFs code for methionine in the protein. 
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Figure 9-36 RNA sequencing can be used to characterize protein-coding genes. 
Presented here is a set of data corresponding to RNAs produced from a segment of the 
gene for B-actin, which is depicted schematically at the top. Millions of RNA “sequence 
reads,” each approximately 200 nucleotides long, were collected from a variety of cell types 
(right) and matched to DNA sequences within the B-actin gene. The height of each trace is 
proportional to how often each sequence appears in a read. Exon sequences are present at 
high levels, reflecting their presence in mature B-actin mRNAs. Intron sequences are present 
at low levels, most likely reflecting their presence in pre-mRNA molecules that have not yet 
been spliced or spliced introns that have not yet been degraded. 


“ORF signal,” as it is not all that unusual for 50 random 
codons to lack a stop signal. Moreover, introns are so 
long that they are likely to contain by chance quite a bit 
of “ORF noise,” numerous stretches of sequence lacking 
stop signals. Finding the true ORFs in this sea of infor- 
mation in which the noise often outweighs the signal 
can be difficult. To make the task more manageable, 
computers are used to search for other distinctive fea- 
tures that mark the presence of a protein-coding gene. 
These include the splicing sequences that signal an 
intron-exon boundary (see Figure 7-20), regulatory DNA 
sequences, or conservation with coding sequences from 
other organisms. 


In 1992, researchers used a computer program to pre- 
dict protein-coding regions in a preliminary human 
sequence. They found two genes in a 58,000-nucleo- 
tide-pair segment of Chromosome 4, and five genes in 
a 106,000-nucleotide-pair segment of Chromosome 19. 
That works out to an average of 1 gene every 23,000 
nucleotide pairs. Extrapolating from that density to 
the whole genome would give humans nearly 130,000 
genes. It turned out, however, that the chromosomes 
the researchers analyzed had been chosen for sequenc- 
ing precisely because they appeared to be gene-rich. 
When the estimate was adjusted to take into account 
the gene-poor regions of the human genome—guessing 
that half of the human genome had maybe one-tenth of 
that gene-rich density—the estimated number dropped 
to 71,000. 


Matching RNAs 


Of course, these estimates are based on what we think 
genes look like; to get around this bias, we must employ 


more direct, experiment-based methods for locating 
genes. Because genes are transcribed into RNA, the pre- 
ferred strategy for finding genes involves isolating all of 
the RNAs produced by a particular cell type and deter- 
mining their nucleotide sequence—a technique called 
RNA-Seq. These sequences are then mapped back to the 
genome to locate their genes. For protein-coding genes, 
exon segments are more highly represented among the 
sequenced transcripts, as intron sequences tend to be 
spliced out and destroyed. Because different cell types 
express different genes, and splice their RNA transcripts 
differently, a variety of cell types are used in the analysis 
(Figure 9-36). 


Thanks to RNA-Seq, the number of predicted protein- 
coding genes has dropped even further, because the 
technique detects only those genes that are actively 
transcribed. At the same time, the approach also 
allowed the detection of genes that do not code for pro- 
teins, but instead encode functional or regulatory RNAs. 
Many noncoding RNAs were first identified through 
RNA-Seq. 


Human gene countdown 


Based on a combination of all of these computational 
and experimental techniques, current estimates of 
the total number of human genes are now converg- 
ing around 24,000, of which approximately 19,000 are 
protein-coding. It could be many years, however, before 
we have the final answer to how many genes it takes to 
make a human. In the end, having an exact count will 
not be nearly as important as understanding the func- 
tions of each gene and how they interact to build the 
living organism. 
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CHAPTER 9 


How Genes and Genomes Evolve 


The instructions needed to produce a multicellular animal from a ferti- 
lized egg are provided, in large part, by the regulatory DNA sequences 
associated with each gene. These noncoding DNA sequences contain, 
scattered within them, dozens of separate regulatory elements, including 
short DNA segments that serve as binding sites for specific transcription 
regulators (discussed in Chapter 8). Regulatory DNA sequences ulti- 
mately dictate each organism’s developmental program—the rules its 
cells follow as they proliferate, assess their positions in the embryo, and 
specialize by switching on and off specific genes at the right time and 
place. The evolution of species is likely to have more to do with inno- 
vations in regulatory DNA sequences than in the proteins or functional 
RNAs the genes encode. 


Given the importance of regulatory DNA sequences in defining the char- 
acteristics of a species, one place to begin searching for clues to identity is 
in the regulatory DNA sequences that are highly conserved across mam- 
malian species, but are altered or absent in our own genome. One study 
identified more than 500 such sequences, providing some intriguing clues 
as to what makes us human. One of these regulatory DNA sequences, 
missing in humans, seems to suppress the proliferation of neurons in the 
brain. Although further investigation is required, it is possible that the 
loss of this sequence—or changes in other neural-specific regulatory DNA 
sequences—played an instrumental role in the evolution of the human 
brain. 


Another regulatory DNA sequence lost in the human lineage directs the 
formation of penile spines—structures present in a wide variety of mam- 
mals including chimpanzees, bonobos, gorillas, orangutans, gibbons, 
rhesus monkeys, and bushbabies. Whether the loss of these structures 
provides some advantage to humans is not known; it could be that the 
change is neutral—neither advantageous nor harmful. Regardless, it is a 
characteristic that makes us unique. 


Thanks to such genetic comparisons, we are beginning to unravel the 
secrets of how our genome evolved to produce the qualities that define us 
as a species. But these analyses can only provide information about our 
distant evolutionary past. To learn about the more recent events in the 
history of modern Homo sapiens, we are turning to the genomes of our 
closest extinct relations, as we see next. 


The Genome of Extinct Neanderthals Reveals Much 
about What Makes Us Human 


In 2010, investigators completed their analysis of the first Neanderthal 
genome. One of our closest evolutionary relatives, Neanderthals lived 
side-by-side with the ancestors of modern humans in Europe and Western 
Asia. By comparing the Neanderthal genome sequence—obtained from 
DNA that was extracted from a fossilized bone fragment found in a 
cave in Croatia—with those of people from different parts of the world, 
researchers identified a handful of genomic regions that have undergone 
a sudden spurt of changes in modern humans. These regions include 
genes involved in metabolism, brain development, the voice box, and 
the shape of the skeleton, particularly the rib cage and brow—all features 
thought to differ between modern humans and our extinct cousins. 


Remarkably, these studies also revealed that many modern humans— 
particularly those that hail from Europe and Asia—share about 2% of 
their genomes with Neanderthals. This genetic overlap indicates that our 
ancestors mated with Neanderthals—before outcompeting or actively 
exterminating them—on the way out of Africa (Figure 9-37). This ancient 
relationship left a permanent mark in the human genome. 
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Figure 9-37 Ancestral humans encountered Neanderthals on their way out of Africa. Modern humans descended 
from a relatively small population—perhaps as few as 10,000 individuals—that existed in Africa approximately 200,000 (200 
K) years ago. Among that small group of ancestors, some migrated northward, and their descendants spread across the 
globe. As ancestral humans left Africa, around 130,000 years ago (purple arrows), they encountered Neanderthals who 
inhabited the region indicated in light blue. As a result of interbreeding (in the region shown in dark blue), the humans that 
subsequently spread throughout Europe and Asia (red arrows) carried with them traces of Neanderthal DNA. Ultimately, 
ancestral humans continued their global spread to the New World, reaching North America approximately 25,000 years 
ago and the southern regions of South America 15,000 years later. This scenario is based on many types of data, including 
fossil records, anthropological studies, and the genome sequences of Neanderthals and of humans from around the world. 


(Adapted from M.A. Jobling et al., Human Evolutionary Genetics, 2nd ed. New York: Garland Science, 2014.) 


Genome Variation Contributes to Our Individuality—But 
How? 


With the possible exception of some identical twins, no two people have 
exactly the same genome sequence. When the same region of the genome 
from two different humans is compared, the nucleotide sequences typi- 
cally differ by about 0.1%. This degree of variation represents about 1 
difference in every 1000 nucleotide pairs—or some 3 million genetic 
differences between the genome of one person and the next. Detailed 
analyses of human genetic variation suggest that the bulk of this varia- 
tion was already present early in our evolution, perhaps 200,000 years 
ago, when the human population was still small. Yet much of this varia- 
tion has been reshuffled as more and more generations of humans have 
arisen. Thus, although a great deal of the genetic diversity in present-day 
humans was inherited from our early human ancestors, each individual 
inherits a unique combination of this ancient genetic variation. 


Sprinkled on top of this “tossed salad” of ancient variation are mutations 
that are much more recent. At birth, each human’s genome contains 
approximately 70 new mutations that were not present in the genomes of 
either parent. Combined with the jumbled collection of ancient variation 
we acquired from our ancestors, these recent mutations further distin- 
guish one individual from another. Most of the variation in the human 
genome takes the form of single base-pair changes. Although some of 
these base-pair changes are unique to individual humans, many more 
are preserved from our distant ancestors and are therefore widespread in 
the human population. Those single-base changes that are present in at 
least 1% of the population are called single-nucleotide polymorphisms 
(SNPs, pronounced “snips”). These polymorphisms are simply points in 
the genome that differ in nucleotide sequence between one portion of 
the population and another—positions where, for example, more than 1% 
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Figure 9-38 Single-nucleotide 
polymorphisms (SNPs) are points in 

the genome that differ by a single 
nucleotide pair between one portion 

of the population and another. Here, 

the differences are highlighted in green 

and blue. By convention, to count as a 
polymorphism, a genetic difference must be 
present in at least 1% of the total population 
of the species. Most, but not all, SNPs in the 
human genome occur in regions where they 
do not affect the function of a gene. 

As indicated by the bracket, when 
comparing any two humans one finds, 

on average, about one SNP per every 

1000 nucleotide pairs. 


~1000 nucleotide pairs 
EE .. - 


ere ATGTC CGACT TCGTA 
TACAG GCTGA AGCAT 
sc ctanks ATATC cGTcr TCATA 
TATAG GCAGA AGTAT 
ndtidiaie ATATC CGACT TCATA 
meviguUa TATAG GCTGA AGTAT 
ndviduaib ATATC CGACT TCATA 
mavIgua TATAG GCTGA AGTAT 
SNP1 SNP2 SNP3 


of the population has a G-C nucleotide pair, while the rest have an A-T 
(Figure 9-38). Two human genomes chosen at random from the world’s 
population will differ by approximately 2.5 x 10° SNPs that are scattered 
throughout the genome. 


Most of these SNPs are genetically silent, as they fall within noncritical 
regions of the genome. Such variations have no effect on how we look or 
how our cells function. This means that only a small subset of the vari- 
ation we observe in our DNA is responsible for the heritable differences 
from one human to the next. We discussed one such difference—that 
responsible for the ability of some adults to digest milk—earlier in the 
chapter. However, it remains a major challenge to identify the thousands 
of other genetic variations that are functionally important—a problem we 
return to in Chapter 19. 


Genome sequences hold the secrets to why humans look, think, and act 
the way we do—and why one human differs from another. Our genome 
contains the instructions that guide the countless decisions made by all 
of our cells as they interact with one another to build our tissues and 
organs. But we are only just beginning to learn the grammar and rules by 
which this genetic information orchestrates our biology and our behav- 
ior. Deciphering this code—which has been shaped by evolution and 
refined by individual variation—is one of the great challenges facing the 
next generation of cell biologists. 


ESSENTIAL CONCEPTS 


e Bycomparing the DNA and protein sequences of contemporary organ- 
isms, we are beginning to reconstruct how genomes have evolved in 
the billions of years that have elapsed since the appearance of the 
first cells. 


e Genetic variation—the raw material for evolutionary change—arises 
through a variety of mechanisms that alter the nucleotide sequence of 
genomes. These changes in sequence range from simple point muta- 
tions to larger-scale deletions, duplications, and rearrangements. 


e Genetic changes that give an organism a selective advantage are 
likely to be perpetuated. Changes that compromise an organism’s fit- 
ness or ability to reproduce are eliminated through natural selection. 


e Gene duplication is one of the most important sources of genetic 
diversity. Once duplicated, the two genes can accumulate different 
mutations and thereby diversify to perform different roles. 


e Repeated rounds of gene duplication and divergence during evolu- 
tion have produced many large gene families. 


e The evolution of new proteins is thought to have been greatly facili- 
tated by the swapping of exons between genes to create hybrid 
proteins with new functions. 


The human genome contains 3.2 x 10° nucleotide pairs distributed 
among 23 pairs of chromosomes—22 autosomes and a pair of sex 
chromosomes. Less than a tenth of this DNA is transcribed to pro- 
duce protein-coding or otherwise functional RNAs. 


Individual humans differ from one another by an average of 1 nucleo- 
tide pair in every 1000; this and other genetic variation underlies 
most of our individuality and provides the basis for identifying indi- 
viduals by DNA analysis. 


Nearly half of the human genome consists of mobile genetic elements 
that can move from one site to another within a genome. Two classes 
of these elements have multiplied to especially high copy numbers. 


Viruses are genes packaged in protective coats that can move from 
cell to cell and organism to organism, but they require host cells to 
reproduce. 


Some viruses have RNA instead of DNA as their genetic material. To 
reproduce, retroviruses copy their RNA genomes into DNA, and inte- 
grate into the host-cell genome. 


Comparing genome sequences of different species provides a power- 
ful way to identify conserved, functionally important DNA sequences. 


Related species, such as human and mouse, have many genes in 
common; evolutionary changes in the regulatory DNA sequences 
that affect how these genes are expressed are especially important in 
determining the differences between species. 


A comparison of genome sequences from people around the world 
has helped reveal how humans have evolved and spread across the 
globe. 


Questions 
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KEY TERMS 


horizontal gene transfer 
L1 element 
mobile genetic element 


Alu sequence 
conserved synteny 
exon shuffling 


gamete open reading frame (ORF) 
gene duplication and divergence phylogenetic tree 

gene family point mutation 

germ line purifying selection 


homologous gene 


retrotransposon 

retrovirus 

reverse transcriptase 
single-nucleotide polymorphism (SNP) 
somatic cell 

transposon 

virus 


QUESTIONS 


QUESTION 9-7 


Discuss the following statement: “Mobile genetic elements 
are parasites. They are always harmful to the host organism.” 


QUESTION 9-8 


Human Chromosome 22 (48 x 10° nucleotide pairs in length) 
has about 700 protein-coding genes, which average 19,000 
nucleotide pairs in length and contain an average of 

5.4 exons, each of which averages 266 nucleotide pairs. 
What fraction of the average protein-coding gene is 
converted into mRNA? What fraction of the chromosome do 
these genes occupy? 


QUESTION 9-9 


(True or False?) The DNA sequence of most of the human 
genome is unimportant. Explain your answer. 


QUESTION 9-10 


Mobile genetic elements make up nearly half of the 
human genome and are inserted more or less randomly 
throughout it. However, in some spots these elements 
are rare, as illustrated for a cluster of genes called HoxD, 
which lies on Chromosome 2 (Figure Q9-10). This cluster 
is about 100 kb in length and contains nine genes whose 
differential expression along the length of the developing 
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embryo helps establish the basic body plan for humans 

(and other animals). In Figure Q9-10, lines that project 
upward indicate exons of known genes. Lines that project 
downward indicate mobile genetic elements; they are so 
numerous they merge into nearly a solid block outside 

the HoxD cluster. For comparison, an equivalent region of 
Chromosome 22 is shown. Why do you suppose that mobile 
genetic elements are so rare in the HoxD cluster? 


QUESTION 9-11 


An early graphical method for comparing nucleotide 
sequences—the so-called diagon plot—still yields one 

of the best visual comparisons of sequence relatedness. 

An example is illustrated in Figure Q9-11, in which the 
human B-globin gene is compared with the human cDNA 
for B globin (which contains only the coding portion of the 
gene; Figure Q9-11A) and with the mouse -globin gene 
(Figure O9-11B). Diagon plots are generated by comparing 
blocks of sequence, in this case blocks of 11 nucleotides 

at a time. If 9 or more of the nucleotides match, a dot is 
placed on the diagram at the coordinates corresponding to 
the blocks being compared. A comparison of all possible 
blocks generates diagrams such as the ones shown in Figure 
Q9-11, in which sequence similarities show up as diagonal 
lines. 


A. From the comparison of the human B-globin gene with 
the human B-globin cDNA (Figure O9-11A), can you deduce 
the positions of exons and introns in the B-globin gene? 


B. Are the exons of the human B-globin gene (indicated 
by shading in Figure Q9-11B) similar to those of the mouse 
B-globin gene? Identify and explain any key differences. 


C. Is there any sequence similarity between the human 
and mouse -globin genes that lies outside the exons? 
If so, identify its location and offer an explanation for its 
preservation during evolution. 


D. Did the mouse or human gene undergo a change of 
intron length during their evolutionary divergence? How can 
you tell? 


QUESTION 9-12 


Your advisor suggests that you write a computer program 
that will identify the exons of protein-coding genes directly 
from the sequence of the human genome. In preparation 
for that task, you decide to write down a list of the features 
that might distinguish protein-coding sequences from 


(A) HUMAN B-GLOBIN cDNA 
COMPARED WITH HUMAN 


B-GLOBIN GENE 


intronic DNA and from other sequences in the genome. 
What features would you list? (You may wish to review basic 
aspects of gene expression in Chapter 7.) 


QUESTION 9-13 


You are interested in finding out the function of a particular 
gene in the mouse genome. You have determined the 
nucleotide sequence of the gene, defined the portion that 
codes for its protein product, and searched the relevant 
database for similar sequences; however, neither the gene 
nor the encoded protein resembles anything previously 
described. What types of additional information about the 
gene and the encoded protein would you like to know in 
order to narrow down its function, and why? Focus on the 
information you would want, rather than on the techniques 
you might use to get that information. 


QUESTION 9-14 


Why do you expect to encounter a stop codon about every 
20 codons or so in a random sequence of DNA? 


QUESTION 9-15 


Which of the processes listed below contribute significantly 
to the evolution of new protein-coding genes? 


A. Duplication of genes to create extra copies that can 
acquire new functions. 


B. Formation of new genes de novo from noncoding DNA 
in the genome. 


C. Horizontal transfer of DNA between cells of different 
species. 


D. Mutation of existing genes to create new functions. 


E. Shuffling of protein domains by gene rearrangement. 


QUESTION 9-16 


Some protein sequences evolve more rapidly than others. 
But how can this be demonstrated? One approach is to 
compare several genes from the same two species, as 
shown for rat and human in the table. Two measures of 
rates of nucleotide substitution are indicated in the table. 
Nonsynonymous changes refer to single-nucleotide 
changes in the DNA sequence that alter the encoded 
amino acid (for example, ATC — TTC, which gives 
isoleucine — phenylalanine). Synonymous changes refer 
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Rates of Change 


Histone H3 135 0.0 4.5 
Hemoglobin a 141 0.6 4.4 
Interferon y 136 3.1 55 


Rates were determined by comparing rat and human 
sequences and are expressed as nucleotide changes per site 
per 10? years. The average rate of nonsynonymous changes 
for several dozen rat and human genes is about 0.8. 


to those that do not alter the encoded amino acid (ATC 
— ATT, which gives isoleucine — isoleucine, for example). 
(As is apparent in the genetic code, Figure 7-27, there are 
many cases where several codons correspond to the same 
amino acid.) 


A. Why are there such large differences between the 
synonymous and nonsynonymous rates of nucleotide 
substitution? 


B. Considering that the rates of synonymous changes are 
about the same for all three genes, how is it possible for 
the histone H3 gene to resist so effectively those nucleotide 
changes that alter its amino acid sequence? 


C. In principle, a protein might be highly conserved 
because its gene exists in a “privileged” site in the genome 
that is subject to very low mutation rates. What feature of 
the data in the table argues against this possibility for the 
histone H3 protein? 


QUESTION 9-17 


Hemoglobin-like proteins were discovered in legumes, 
where they function in root nodules to lower the oxygen 
concentration, allowing the resident bacteria to fix nitrogen. 
These plant “hemoglobins” impart a characteristic pink 
color to the root nodules. The discovery of hemoglobin in 
plants was initially surprising because scientists regarded 
hemoglobin as a distinctive feature of animal blood. It 

was hypothesized that the plant hemoglobin gene was 
acquired by horizontal transfer from an animal. Many more 
hemoglobin-like genes have now been discovered and 
sequenced from a variety of organisms, and a phylogenetic 
tree of hemoglobins is shown in Figure Q9-17. 


A. Does the evidence in the tree support or refute the 
hypothesis that the plant hemoglobins arose by horizontal 
gene transfer from animals? 


B. Supposing that the plant hemoglobin genes were 
originally derived by horizontal transfer (from a parasitic 
nematode, for example), what would you expect the 
phylogenetic tree to look like? 


QUESTION 9-18 


The accuracy of DNA replication in the human germ-cell 
line is such that on average only about 0.6 out of the 6 
billion nucleotides is altered at each cell division. Because 
most of our DNA is not subject to any precise constraint 
on its sequence, most of these changes are selectively 
neutral. Any two modern humans chosen at random will 
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differ by about 1 nucleotide pair per 1000. Suppose we are 
all descended from a single pair of ancestors (an “Adam 
and Eve”) who were genetically identical and homozygous 
(each chromosome was identical to its homolog). Assuming 
that all germ-line mutations that arise are preserved 

in descendants, how many cell generations must have 
elapsed since the days of our original ancestor parents for 
1 difference per 1000 nucleotides to have accumulated in 
modern humans? Assuming that each human generation 
corresponds on average to 200 cell-division cycles in 

the germ-cell lineage and allowing 30 years per human 
generation, how many years ago would this ancestral couple 
have lived? 


QUESTION 9-19 


Reverse transcriptases do not proofread as they synthesize 
DNA using an RNA template. What do you think the 
consequences of this are for the treatment of AIDS? 
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Analyzing the Structure 
and Function of Genes 


Since the turn of the century, biologists have amassed an unprecedented 
wealth of information about the genes that direct the development and 
behavior of living things. Thanks to advances in our ability to rapidly 
determine the nucleotide sequence of entire genomes, we now have 
access to the complete molecular blueprints for thousands of different 
organisms, from the platypus to the plague bacterium, and for thousands 
of different people from all over the world. 


This information explosion was ignited by technological advances that 
allowed the isolation and manipulation of a selected piece of DNA 
from among the many millions of nucleotide pairs in a typical chromo- 
some. Investigators then developed powerful techniques for replicating, 
sequencing, and modifying this DNA—and even introducing it into other 
organisms that can then be studied in the laboratory. 


These technical breakthroughs have had a dramatic impact on all aspects 
of cell biology. They have advanced our understanding of the organization 
and evolutionary history of complex eukaryotic genomes (as discussed in 
Chapter 9) and have led to the discovery of whole new classes of genes, 
RNAs, and proteins. They continue to generate new ways of determin- 
ing the functions of genes and proteins in living organisms, and they 
provide an important set of tools for unraveling the mechanisms—still 
poorly understood—by which a complex organism can develop from a 
single fertilized egg. 


At the same time, our ability to manipulate DNA has had a profound 
influence on our understanding and treatment of disease: using these 
techniques, we can now detect the mutations in human genes that are 
responsible for inherited disorders or that predispose us to a variety of 


ISOLATING AND CLONING 
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DNA CLONING BY PCR 


SEQUENCING DNA 


EXPLORING GENE FUNCTION 
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QUESTION 10-1 


DNA sequencing of your own two 
B-globin genes (one from each of 
your two Chromosome 11s) reveals 
a mutation in one of the genes. 
Given this information alone, should 
you worry about being a carrier of 
an inherited disease that could be 
passed on to your children? What 
other information would you like to 
have to assess your risk? 


Figure 10-1 Selective breeding is, in 
essence, a form of genetic manipulation. 
(A) The oldest known depiction of a rose in 
Western art, from the palace of Knossos in 
Crete, around 2000 BC. Modern roses are 
the result of centuries of breeding between 
such wild roses. (B) Dogs have been bred 
to exhibit a wide variety of characteristics, 
including different head shapes, coat colors, 
and of course size. All dogs, regardless 

of breed, belong to a single species that 
was domesticated from the gray wolf 

some 10,000 to 15,000 years ago. (B, from 
A.L. Shearin & E.A. Ostrander, PLoS Biol. 
8:e1000310, 2010.) 


common diseases, including cancer. We can also produce an increasing 
number of pharmaceuticals, such as insulin for diabetics and blood- 
clotting proteins for hemophiliacs. 


In this chapter, we present a brief overview of how we can manipulate 
DNA, identify genes, and produce many copies of any given nucleotide 
sequence in the laboratory. We discuss several ways to explore gene func- 
tion, including recent approaches to DNA sequencing and to modifying or 
inactivating genes in cells, animals, and plants. These methods—which 
are continuously being improved and made more powerful—are not only 
revolutionizing the way we do science, but are transforming our under- 
standing of cell biology and human disease. 


ISOLATING AND CLONING DNA MOLECULES 


Humans have been experimenting with DNA, albeit without realizing it, 
for millennia. The roses in our gardens, the corn on our plate, and the 
dogs in our yards are all the product of selective breeding that has taken 
place over many, many generations (Figure 10-1). But it wasn’t until the 
1970s that we could begin to engineer organisms with desired properties 
by directly tinkering with their genes. 


Isolating and manipulating individual genes is not a trivial matter. Unlike 
a protein, a gene does not exist as a discrete entity in cells; it is a small 
part of a much larger DNA molecule. Even bacterial genomes, which are 
much less expansive and complex than the chromosomes of eukaryotes, 
are still enormously long. The E. coli genome, for example, contains 4.6 
million nucleotide pairs. 


How, then, can we go about separating a single gene from a eukaryotic 
genome—which is considerably larger than that of a bacteri'um—so that it 
can be handled in the laboratory? The solution to this problem emerged, 
in large part, with the discovery of a class of bacterial enzymes that cut 
double-stranded DNA at particular sequences. These enzymes can be 
used to produce a reproducible set of specific DNA fragments from any 
genome—including fragments that harbor genes. The desired fragment is 
then amplified, producing many identical copies, by a process called DNA 
cloning. It is this amplification that makes it possible to separate a gene 
of interest from the rest of the genome. 


In this section, we describe how specific DNA fragments can be gener- 
ated, isolated, and produced in large quantities in bacteria—the classical 
approach to DNA cloning. In the next section of the chapter, we present 


(A) 
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an alternative approach to cloning DNA: this method, which is carried out 
in a test tube, uses a special form of DNA polymerase to make copies of 
the desired nucleotide sequence. 


Restriction Enzymes Cut DNA Molecules at Specific Sites 


Like many of the tools of DNA technology, the enzymes used to prepare 
DNA fragments for cloning were discovered by researchers trying to 
understand an intriguing biological phenomenon. It had been observed 
that certain bacteria always degraded “foreign” DNA that was intro- 
duced into them experimentally. A search for the underlying mechanism 
revealed a novel class of enzymes that cleave DNA at specific nucleotide 
sequences. Because these enzymes function to restrict the transfer of 
DNA between strains of bacteria, they were called restriction enzymes, 
or restriction nucleases. The pursuit of this seemingly arcane biological 
puzzle set off the development of technologies that have forever changed 
the way cell and molecular biologists study living things. 


Different bacterial species produce different restriction enzymes, each 
cutting at a different, specific nucleotide sequence (Figure 10-2). The 
bacteria’s own DNA is protected from cleavage by chemical modification 
of these specific sequences. Because these target sequences are short— 
generally four to eight nucleotide pairs—many sites of cleavage will 
occur, purely by chance, in any long DNA molecule. The reason restric- 
tion enzymes are so useful in the laboratory is that each enzyme will cut 
a particular DNA molecule at the same sites. Thus for a given sample of 
DNA, a particular restriction enzyme will reliably generate the same set 
of DNA fragments. 


The size of the resulting fragments depends on the target sequences of 
the restriction enzymes. As shown in Figure 10-2, the enzyme Haelll cuts 
at a sequence of four nucleotide pairs; a sequence this long would be 
expected to occur purely by chance approximately once every 256 nucle- 
otide pairs (1 in 44). In comparison, a restriction enzyme with a target 
sequence that is eight nucleotides long would be expected to cleave DNA 
on average once every 65,536 nucleotide pairs (1 in 48). This difference 
in sequence selectivity makes it possible to cleave a long DNA molecule 
into the fragment sizes that are most suitable for a given application. 


Gel Electrophoresis Separates DNA Fragments of 
Different Sizes 


After a large DNA molecule is cleaved into smaller pieces with a restric- 
tion enzyme, the DNA fragments can be separated from one another on 
the basis of their length by gel electrophoresis—the same method used to 
separate mixtures of proteins (see Panel 4-5, p. 167). A mixture of DNA 
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Figure 10-2 Restriction enzymes cleave 
both strands of the DNA double helix 

at specific nucleotide sequences. Target 
sequences (orange) are often palindromic— 
that is, the nucleotide sequence is 
symmetrical around a central point. Some 
enzymes, such as Haelll, cut straight across 
the double helix and leave two blunt-ended 
DNA molecules; with others, such as EcoRI 
and Hindlll, the cuts on each strand are 
staggered. These staggered cuts generate 
“sticky ends”—short, single-stranded 
overhangs that help the cut DNA molecules 
join back together through complementary 
base-pairing. This rejoining of DNA 
molecules becomes important for DNA 
cloning, as we discuss shortly. Restriction 
enzymes are usually obtained from 

bacteria, and their names reflect their 
origins: for example, the enzyme EcoRI 
comes from E. coli. 
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Figure 10-3 DNA molecules can 

be separated by size using gel 
electrophoresis. (A) Schematic illustration 
compares the results of cutting the same 
DNA molecule (in this case, the genome 
of a virus that infects parasitic wasps) with 
two different restriction enzymes, EcoRI 
(middle) and Hindlll (right). The fragments 
are then separated by gel electrophoresis. 
Because larger fragments migrate more 
slowly than smaller ones, the lowermost 
bands on the gel contain the smallest DNA 
fragments. The sizes of the fragments can 
be estimated by comparing them to a set 
of DNA fragments of known sizes (left). 

(B) Photograph of an actual gel shows the 
positions of DNA bands that have been 
labeled with a fluorescent dye. (B, from 

U. Albrecht et al., J. Gen. Virol. 75: 
3353-3363, 1994. With permission from the 
Microbiology Society.) 
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fragments is loaded at one end of a slab of agarose or polyacrylamide 
gel, which contains a microscopic network of pores. When a voltage is 
applied across the gel, the negatively charged DNA fragments migrate 
toward the positive electrode; larger fragments migrate more slowly 
because their progress is impeded to a greater extent by the gel matrix. 
Over several hours, the DNA fragments become spread out across the gel 
according to size, forming a ladder of discrete bands, each composed of 
a collection of DNA molecules of identical length (Figure 10-3). 


The separated DNA bands on an agarose or polyacrylamide gel are not, 
by themselves, visible. To see these bands, the DNA must be labeled or 
stained in some way. One sensitive method involves exposing the gel to 
a dye that fluoresces under ultraviolet (UV) light when it is bound to DNA. 
When the gel is placed on a UV light box, the individual bands glow bright 
orange—or bright white when the gel is photographed in black and white 


QUESTION 10-2 


Which products result when the double-stranded DNA molecule below 
is digested with (A) EcoRI, (B) Haelll, (C) Hindlll, or (D) all three of these 
enzymes together? (See Figure 10-2 for the target sequences of these 
enzymes.) 
5/-AAGAATTGCGGAATTCGGGCCTTAAGCGCCGCGTCGAGGCCTTAAA-3' 
3’-TTCTTAACGCCTTAAGCCCGGAATTCGCGGCGCAGCTCCGGAATTT-5’ 
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(A) JOINING OF TWO FRAGMENTS CUT (B) JOINING OF TWO FRAGMENTS CUT 
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Figure 10-4 DNA ligase can join together any two DNA fragments in vitro to produce recombinant DNA 
molecules. The fragments joined by DNA ligase can be from different cells, tissues, or even different organisms. ATP 
provides the energy necessary to reseal the sugar-phosphate backbone of the DNA. (A) DNA ligase can readily join two 
DNA fragments produced by the same restriction enzyme, in this case EcoRI. Note that the staggered ends produced 
by this enzyme enable the ends of the two fragments to base-pair correctly with each other, greatly facilitating their 
rejoining. (B) DNA ligase can also be used to join DNA fragments produced by different restriction enzymes—for 
example, EcoRI and Haelll. In this case, before the fragments undergo ligation, DNA polymerase plus a mixture of 
deoxyribonucleoside triphosphates (dNTPs) are used to fill in the staggered cut produced by EcoRI prior to ligation. 


(see Figure 10-3B). To isolate a desired DNA fragment, the small section 
of the gel that contains the band is excised with a scalpel, and the DNA 
is then extracted. 


DNA Cloning Begins with the Production of 
Recombinant DNA 


Once a genome has been broken into smaller, more manageable pieces, 
the resulting fragments must then be prepared for cloning. This process 
involves inserting the DNA fragments into a carrier, or vector—another 
piece of DNA that can be copied inside cells. Because this union involves 
“recombining” DNA from different sources, the resulting molecules are 
called recombinant DNA. The production of recombinant DNA mol- 
ecules in this way is a key step in the classical approach to DNA cloning. 


Like the cutting of DNA by restriction enzymes, the joining together of 
DNA fragments to produce recombinant DNA molecules is made possible 
by an enzyme produced by cells. In this case, the enzyme is DNA ligase. 
In cells, DNA ligase reseals the nicks that arise in the DNA backbone dur- 
ing DNA replication and DNA repair (see Figure 6-19). In the laboratory, 
DNA ligase can be used to link together any two pieces of DNA in a test 
tube, producing recombinant DNA molecules that are not found in nature 
(Figure 10-4). 


Recombinant DNA Can Be Copied Inside Bacterial Cells 


The vectors used to carry the DNA that is to be cloned are small, circular 
DNA molecules called plasmids (Figure 10-5). Each plasmid contains 
its own replication origin, which enables it to replicate in a bacterial cell 
independently of the bacterial chromosome. This feature allows the DNA 
of interest to be produced in large amounts, even within a single bac- 
terial cell. The plasmid also has cleavage sites for common restriction 
enzymes, so that it can be conveniently opened and a foreign DNA frag- 
ment inserted. 


0.5 um 


Figure 10-5 Bacterial plasmids are 
commonly used as cloning vectors. This 
circular, double-stranded DNA molecule 
was the first plasmid for DNA cloning; it 
contains about nine thousand nucleotide 
pairs. The staining procedure used to make 
the DNA visible in this electron micrograph 
causes the DNA to appear much thicker 
than it actually is. (Courtesy of Stanley N. 
Cohen, Stanford University.) 
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Figure 10-6 A DNA fragment is inserted circular, DNA fragment 


into a bacterial plasmid using the enzyme double-stranded 


plasmid DNA to be cloned 


DNA ligase. The plasmid is first cut open (cloning vector) xr RENA 
at a single site with a restriction enzyme 

(in this case, one that produces staggered 

ends). It is then mixed with the DNA 

fragment to be cloned, which has been ‘ 

cut with the same restriction enzyme. The CLEAVAGE WITH COVALENT 

staggered ends base-pair, and when DNA RESTRICTION LINKAGE 

ligase and ATP are added, the nicks in the ENZYME BY DNA LIGASE 


DNA backbone are sealed to produce a 
complete recombinant DNA molecule. In 
the accompanying micrographs, we have 
colored the DNA fragment red to make 
it easier to see. (Micrographs courtesy of 
Huntington Potter and David Dressler.) 
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The vectors used for cloning are streamlined versions of plasmids that 
occur naturally in many bacteria. Bacterial plasmids were first recognized 
by physicians and scientists because they often carry genes that render 
their microbial host resistant to one or more antibiotics. Indeed, histori- 
cally potent antibiotics—penicillin, for example—are no longer effective 
against many of today’s bacterial infections because plasmids that confer 
resistance to the antibiotic have spread among bacterial species by hori- 
zontal gene transfer (see Figure 9-15). 


To insert a piece of DNA into a plasmid vector, the purified plasmid DNA 
is opened up by a restriction enzyme that cleaves it at a single site, and 
the DNA fragment to be cloned is then spliced into that site using DNA 
ligase (Figure 10-6). This recombinant DNA molecule is now ready to be 
introduced into a bacterium, where it will be copied and amplified. 


To accomplish this feat, investigators take advantage of the fact that 
some bacteria naturally take up DNA molecules present in their surround- 
ings. The mechanism that controls this uptake is called transformation, 
because early observations suggested it could “transform” one bacterial 
strain into another. Indeed, the first proof that genes are made of DNA 
came from an experiment in which DNA purified from a pathogenic strain 
of pneumococcus was used to transform a harmless bacterium into a 
deadly one (see How We Know, pp. 192-194). 


In a natural bacterial population, a source of DNA for transformation is 
provided by bacteria that have died and released their contents, includ- 
ing DNA, into the environment. In a test tube, however, bacteria such as 
E. coli can be coaxed to take up recombinant DNA that has been created 
in the laboratory. These bacteria are then suspended in a nutrient-rich 
broth and allowed to proliferate. 


Each time the bacterial population doubles—every 30 minutes or so—the 
number of copies of the recombinant DNA molecule also doubles. Thus, 
in 24 hours, the engineered cells will produce hundreds of millions of 
copies of the plasmid, along with the DNA fragment it contains. The bac- 
teria can then be split open (lysed) and the plasmid DNA purified from 
the rest of the cell contents, including the large bacterial chromosome 
(Figure 10-7). 


The DNA fragment can be readily recovered by cutting it out of the plas- 
mid DNA with the same restriction enzyme that was used to insert it, 
and then separating it from the plasmid DNA by gel electrophoresis (see 
Figure 10-3). Together, these steps allow the amplification and purifica- 
tion of any segment of DNA from the genome of any organism. 


Isolating and Cloning DNA Molecules 339 


DOUBLE-STRANDED 
RECOMBINANT 
PLASMID DNA 
INTRODUCED INTO \) 
BACTERIAL CELL 


bacterial Y O 
cell -a culture produces 
hundreds of millions of 
new bacteria 


An Entire Genome Can Be Represented in a DNA Library 


When a whole genome is cut by a restriction enzyme, a large number 
of different DNA fragments is generated. This collection of DNA frag- 
ments can be ligated into plasmid vectors, under conditions that favor 
the insertion of a single DNA fragment into each plasmid molecule. These 
recombinant plasmids are then introduced into E. coli at a concentra- 
tion that ensures that no more than one plasmid molecule is taken up 
by each bacterium. The resulting collection of cloned DNA fragments, 
present in the bacterial culture, is known as a DNA library. Because the 
DNA fragments were derived by digesting chromosomal DNA directly 
from an organism, the resulting collection—called a genomic library— 
should represent the entire genome of that organism (Figure 10-8). Such 
genomic libraries often provide the starting material for determining the 
complete nucleotide sequence of an organism's genome. 


For other applications, however, it can be advantageous to work with a 
different type of library—one that includes only the coding sequences of 
genes; that is, a library that lacks intronic and other noncoding sequences 
that make up most eukaryotic DNA. For some genes, the complete 
genomic clone—including introns and exons—is too large and unwieldy 
to handle conveniently in the laboratory (see, for example, Figure 7-19B). 
What's more, the bacterial cells typically used to amplify cloned DNA are 
unable to remove introns from mammalian RNA transcripts. So if the 
goal is to use a cloned mammalian gene to produce a large amount of 
the protein it encodes, for example, it is essential to use only the coding 
sequence of the gene. 


In this case, investigators generate a cDNA library. A cDNA library is 
similar to a genomic library in that it also contains numerous clones con- 
taining many different DNA sequences. But it differs in one important 
respect. The DNA that goes into a cDNA library is not genomic DNA; 
it is DNA copied from the mRNAs present in a particular type of cell. 
To prepare a cDNA library, all of the mRNAs are extracted, and double- 
stranded DNA copies of these mRNAs are produced by the enzymes 
reverse transcriptase and DNA polymerase (Figure 10-9). The resulting 
complementary DNA—or cDNA—molecules are then introduced into 
bacteria and amplified, as described for genomic DNA fragments (see 
Figure 10-8). 


Figure 10-8 Human genomic libraries containing DNA fragments 
representing the whole human genome can be constructed using 
restriction enzymes and DNA ligase. Such a genomic library consists 
of a set of bacteria, each carrying a different small fragment of human 
DNA. For simplicity, only the colored DNA fragments are shown in 

the library; in reality, all of the different gray fragments will also be 
represented. 


Figure 10-7 A DNA fragment can be 
replicated inside a bacterial cell. To clone 
a particular fragment of DNA, it is first 
inserted into a plasmid vector, as shown 

in Figure 10-6. The resulting recombinant 
plasmid DNA is then introduced into a 
bacterium, where it is replicated many 
millions of times as the bacterium multiplies. 
For simplicity, the genome of the bacterial 
cell is not shown. 
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Figure 10-9 Complementary DNA (cDNA) 

is prepared from mRNA. Total mRNA a O 
is extracted from a selected type of cell, 

and double-stranded complementary 
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There are several important differences between genomic DNA clones 
and cDNA clones. Genomic clones represent a random sample of all of 
the DNA sequences found in an organism's genome and, with very rare 
exceptions, will contain the same sequences regardless of the cell type 
from which the DNA came. Also, genomic clones from eukaryotes con- 
tain large amounts of noncoding DNA, repetitive DNA sequences, introns, 
regulatory DNA, and spacer DNA; sequences that code for proteins will 
make up only a few percent of the library (see Figure 9-33). By contrast, 
cDNA clones contain predominantly protein-coding sequences, and only 
those sequences that have been transcribed into MRNA in the cells from 
which the cDNA was made. 


As different types of cells produce distinct sets of mRNA molecules, each 
yields a different cDNA library. Furthermore, patterns of gene expression 
change during development, so cells at different stages in their develop- 
ment will also yield different cDNA libraries. Thus, cDNAs can be used to 
assess which genes are expressed in specific cells, at particular times in 
development, or under a particular set of conditions. 


Hybridization Provides a Sensitive Way to Detect 
Specific Nucleotide Sequences 


Thus far, we have been talking about large collections of DNA frag- 
ments. For many studies, however, investigators wish to identify or 
examine an individual gene or RNA. Fortunately, an intrinsic property of 
nucleic acids—their ability to form complementary base pairs—provides 
a convenient and powerful technique for detecting a specific nucleotide 
sequence. 


To see how, let’s look at a molecule of double-stranded DNA. Under nor- 
mal conditions, the two strands of a DNA double helix are held together by 
hydrogen bonds between the complementary base pairs (see Figure 5-4). 
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But these relatively weak, noncovalent bonds can be fairly easily bro- 
ken—for example, by heating the DNA to around 90°C. Such treatment 
will cause DNA denaturation, releasing the two strands from each other. 
When the conditions are reversed—by slowly lowering the temperature— 
the complementary strands will readily come back together to re-form a 
double helix. This DNA renaturation, or hybridization, is driven by the 
re-formation of the hydrogen bonds between complementary base pairs 
(Figure 10-10). 


Hybridization can be employed for detecting any nucleotide sequence 
of interest, whether DNA or RNA. One simply designs a short, single- 
stranded DNA probe that is complementary to that sequence. Because the 
nucleotide sequences of so many genomes are known—and are stored in 
publicly accessible databases—designing such a probe is straightforward. 
The desired probe can then be synthesized in the laboratory—usually by 
a commercial organization or a centralized academic facility. 


Hybridization with DNA probes has many uses in cell and molecular biol- 
ogy. As we will see later in this chapter, for example, DNA probes that 
carry a fluorescent or radioactive label can be used to detect complemen- 
tary RNA molecules in tissue preparations. But one of the most powerful 
applications of hybridization is in the cloning of DNA by the polymerase 
chain reaction, as we discuss next. 


DNA CLONING BY PCR 


Genomic and cDNA libraries were once the only route to gene cloning, 
and they are still used for cloning very large genes and for sequencing 
whole genomes. However, a powerful and versatile method for ampli- 
fying DNA, known as the polymerase chain reaction (PCR), provides 
a more rapid and straightforward approach, particularly in organisms 
whose complete genome sequence is known. Today, most genes are 
cloned via PCR. 


Invented in the 1980s, PCR revolutionized the way that DNA and RNA 
are analyzed. The technique can amplify any nucleotide sequence 
quickly and selectively. Unlike the traditional approach of cloning using 
vectors—which relies on bacteria to make copies of the desired DNA 
sequences—PCR is performed entirely in a test tube. Eliminating the 
need for bacteria makes PCR convenient and fast—billions of copies of a 
nucleotide sequence can be generated in a matter of hours. At the same 
time, PCR is remarkably sensitive: the method can be used to amplify and 
detect the trace amounts of DNA in a drop of blood left at a crime scene 
or in a few copies of a viral genome in a patient’s blood sample. Because 
of its sensitivity, speed, and ease of use, PCR has many applications in 
addition to DNA cloning, including forensics and diagnostics. 


In this section, we provide a brief overview of how PCR works and how 
it is used for a range of purposes that require the amplification of specific 
DNA sequences. 
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Figure 10-10 A molecule of DNA can 
undergo denaturation and renaturation 
(hybridization). For two single-stranded 
molecules to hybridize, they must have 
complementary nucleotide sequences that 
allow base-pairing. In this example, the red 
and orange strands are complementary to 
each other, and the blue and green strands 
are complementary to each other. Although 
denaturation by heating is shown, DNA 

can also be denatured by alkali treatment. 
The 1961 discovery that single strands of 
DNA could readily re-form a double helix 

in this way was a big surprise to scientists. 
Hybridization can also occur between 
complementary strands of DNA and RNA or 
between two RNAs. 


QUESTION 10-3 


Discuss the following statement: 
“From the nucleotide sequence of 

a cDNA clone, the complete amino 
acid sequence of a protein can be 
deduced by applying the genetic 
code. Thus, protein biochemistry has 
become superfluous because there 
is nothing more that can be learned 
by studying the protein.” 


342 CHAPTER 10 


| 
region of 

double-stranded 
DNA to be 
amplified 


N 3" 
m E5 


E E 3' 5'| 


Analyzing the Structure and Function of Genes 


PCR Uses DNA Polymerase and Specific DNA Primers to 
Amplify DNA Sequences in a Test Tube 


The success of PCR depends on the exquisite selectivity of DNA hybridi- 
zation, along with the ability of DNA polymerase to copy a DNA template 
reliably, through repeated rounds of replication in vitro. The enzyme 
works by adding nucleotides to the 3' end of a growing strand of DNA (see 
Figure 6-11). To initiate the reaction, the polymerase requires a primer— 
a short nucleotide sequence that provides a 3' end from which synthesis 
can begin. The beauty of PCR is that the primers that are added to the 
reaction mixture not only serve as starting points, but they also direct the 
polymerase to the specific DNA sequence to be amplified. These primers 
are designed by the experimenter based on the DNA sequence of interest 
and then synthesized chemically. Thus, PCR can only be used to clone a 
DNA segment for which the sequence is known in advance. However, 
with the large and growing number of genome sequences available in 
public databases, this requirement is rarely a drawback. 


The power of PCR comes from repetition: the cycle of amplification is 
carried out dozens of times over the course of a few hours. At the start 
of each cycle, the two strands of the double-stranded DNA template 
are separated and a unique primer is hybridized, or annealed, to each. 
DNA polymerase is then allowed to replicate each strand independently 
(Figure 10-11). In subsequent cycles, all the newly synthesized DNA mol- 
ecules produced by the polymerase serve as templates for the next round 
of replication (Figure 10-12). Through this iterative process of amplifica- 
tion, many copies of the original sequence can be made—billions after 
about 20 to 30 cycles. 


PCR is the method of choice for cloning relatively short DNA fragments 
(say, under 10,000 nucleotide pairs). Because the original template for 
PCR can be either DNA or RNA, the method can be used to obtain either 
a full genomic clone (complete with introns and exons) or a cDNA copy 
of an MRNA (Figure 10-13). A major benefit of PCR is that genes can be 
cloned directly from any piece of DNA or RNA without the time and effort 
needed to first construct a DNA library. 
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Figure 10-11 A pair of PCR primers directs the amplification of a desired segment of DNA in a test tube. Each cycle of PCR 
includes three steps: (1) The double-stranded DNA is heated briefly to separate the two strands. (2) The DNA is exposed to a large 
excess of a pair of specific primers—designed to bracket the region of DNA to be amplified—and the sample is cooled to allow the 
primers to hybridize to complementary sequences in the two DNA strands. (3) This mixture is incubated with DNA polymerase and 
the four deoxyribonucleoside triphosphates so that DNA can be synthesized, starting from the two primers. The process can then be 
repeated by reheating the sample to separate the double-stranded products of the previous cycle (see Figure 10—12). 

The technique depends on the use of a special DNA polymerase isolated from a thermophilic bacterium; this polymerase is stable at 
much higher temperatures than eukaryotic DNA polymerases, so it is not denatured by the heat treatment shown in step 1. The enzyme 
therefore does not have to be added again after each cycle. 
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Figure 10-12 PCR uses repeated rounds of strand separation, hybridization, and synthesis to amplify DNA. As 
the procedure outlined in Figure 10-11 is repeated, all the newly synthesized fragments serve as templates in their 
turn. Because the polymerase and the primers remain in the sample after the first cycle, PCR involves simply heating 
and then cooling the same sample, in the same test tube, again and again. Each cycle doubles the amount of DNA 
synthesized in the previous cycle, so that within a few cycles, the predominant DNA is identical to the sequence 
bracketed by and including the two primers in the original template. In the example illustrated here, three cycles 
of reaction produce 16 DNA chains, 8 of which (boxed in yellow) correspond exactly to one or the other strand of 
the original bracketed sequence. After four more cycles, 240 of the 256 DNA chains will correspond exactly to the 
original sequence, and after several more cycles, essentially all of the DNA strands will be this length. The whole 


procedure is shown in Movie 10.1. 


PCR Can Be Used for Diagnostic and Forensic 
Applications 


In addition to its use in cloning, PCR is frequently employed to amplify 
DNA for other, more practical purposes. Because of its extraordinary sen- 
sitivity, PCR can be used to detect an infection at its earliest stages. In 
this case, short sequences complementary to the suspected pathogen’s 
genome are used as primers, and following many cycles of amplifica- 
tion, even a few copies of an invading bacterial or viral genome in a 
patient sample can be detected (Figure 10-14). PCR can also be used to 
track epidemics, detect bioterrorist attacks, and test food products for 
the presence of potentially harmful microbes. It is also used to verify the 
authenticity of a food source—for example, whether a sample of beef 
actually came from a cow. 


Finally, PCR is widely used in forensic medicine. The method's extreme 
sensitivity allows forensic investigators to isolate DNA from even the 
smallest traces of human blood or other tissue to obtain a DNA fingerprint 
of the person who left the sample behind. With the possible exception 
of identical twins, the genome of each human differs in DNA sequence 
from that of every other person on Earth. Using primer pairs targeted at 
genome sequences that are known to be highly variable in the human 


A. Ifthe PCR shown in Figure 
10-12 is carried through an 
additional two rounds of 
amplification, how many of the 
DNA fragments (gray, green, 

red, or outlined in yellow) will be 
produced? If many additional cycles 
are carried out, which fragments will 
predominate? 

B. Assume you start with one 
double-stranded DNA molecule 
and amplify a 500-nucleotide-pair 
sequence contained within it. 
Approximately how many cycles 
of PCR amplification will you need 
to produce 100 ng of this DNA? 
100 ng is an amount that can be 
easily detected after staining with 
a fluorescent dye. (Hint: for this 
calculation, you need to know that 
each nucleotide has an average 
molecular mass of 330 g/mole.) 
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Figure 10-13 PCR can be used to obtain 
either genomic or cDNA clones. (A) To use 
PCR to clone a segment of chromosomal 
DNA, total DNA is first purified from cells. 
PCR primers that flank the stretch of DNA 

to be cloned are added, and many cycles 

of PCR are completed (see Figure 10-12). 
Because only the DNA between (and 
including) the primers is amplified, PCR 
provides a way to obtain selectively any short 
stretch of chromosomal DNA in an effectively 
pure form. (B) To use PCR to obtain a cDNA 
clone of a gene, total mRNA is first purified 
from cells. The first primer is added to a 
population of single-stranded mRNAs, and 
reverse transcriptase is used to make a 

DNA strand complementary to the specific 
RNA sequence of interest. A second primer 
is then added, and the DNA molecule is 
amplified through many cycles of PCR. 


Analyzing the Structure and Function of Genes 


chromosomal 
DNA 


| isolate total mRNA 


ss mRNA sequence 


to be cloned 


eT 
SSeS Sey, 
DNA segment to 
be cloned | 


ADD FIRST PRIMER, 
REVERSE TRANSCRIPTASE, 
AND DEOXYRIBONUCLEOSIDE 
TRIPHOSPHATES 


l 
SEPARATE STRANDS 
AND ADD PRIMERS 


= 


l 
PCR AMPLIFICATION 


iw] 


NA 


mRNA 


SEPARATE STRANDS AND 
ADD SECOND PRIMER 


n 


PCR AMPLIFICATION 
WITH BOTH PRIMERS PRESENT 


J a 
—— 
== 
— = æ e 
— = — = Ul 
genomic — è —_ cDNA a 
e 
clones Á clones 
— = æ i 
— = æ = 
— ç æ i 
(A) — = = (B) Á 


population, PCR makes it possible to generate a distinctive DNA finger- 
print for any individual (Figure 10-15). Such forensic analyses can be 
used not only to point the finger at those who have done wrong, but— 
equally important—to help exonerate those who have been wrongfully 
convicted. 
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Figure 10-14 PCR can be used to detect the presence of a viral genome in 

a sample of blood. Because of its ability to amplify enormously the signal from 
every single molecule of nucleic acid, PCR is an extraordinarily sensitive method for 
detecting trace amounts of virus in a sample of blood or tissue without the need to 
purify the virus. For HIV, the virus that causes AIDS, the genome is a single-stranded 
molecule of RNA, as illustrated here. In addition to HIV, many other viruses that infect 
humans are now detected in this way. 
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Figure 10-15 PCR is used in forensic science to distinguish one individual from another. The DNA sequences typically analyzed 
are short tandem repeats (STRs). These sequences, composed of stretches of CACA... or GTGT..., for example, are found in various 
positions (loci) in the human genome. The number of repeats in each STR locus is highly variable in the population, ranging from 4 to 40 
in different individuals. Because of the variability in these sequences, individuals will usually inherit a different number of repeats at each 
STR locus from their mother and from their father; two unrelated individuals, therefore, rarely contain the same pair of repeat sequences 
at a given STR locus. (A) PCR using primers that recognize unique sequences on either side of one particular STR locus produces a 
pair of bands of amplified DNA from each individual, one band representing the maternal STR variant and the other representing the 
paternal STR variant. The length of the amplified DNA, and thus its position after gel electrophoresis, will depend on the exact number 
of repeats at the locus. (B) In the schematic example shown here, the same three STR loci are analyzed in samples from three suspects 
(individuals A, B, and C), producing six bands for each individual. Although different people can have several bands in common, the 
overall pattern is quite distinctive for each person. The band pattern can therefore serve as a DNA fingerprint to identify an individual 
nearly uniquely. The fourth lane in the gel (lane F) contains the products of the same PCR amplifications carried out on a hypothetical 
forensic DNA sample, which could have been obtained from a single hair or a tiny spot of blood left at a crime scene. 

The more loci that are examined, the more confidence we can have about the results. When examining the variability at 5-10 different 
STR loci, the odds that two random individuals would share the same fingerprint by chance are approximately one in 10 billion. In 
the case shown here, individuals A and C can be eliminated from inquiries, while B is a clear suspect. A similar approach is now used 
routinely in paternity testing. 
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Figure 10-16 The dideoxy method 

of sequencing DNA relies on chain- 
terminating dideoxynucleoside 
triphosphates (ddNTPs). These 

ddNTPs are derivatives of the normal 
deoxyribonucleoside triphosphates that lack 
the 3’ hydroxyl group. When incorporated 
into a growing DNA strand, they block 
further elongation of that strand. 


SEQUENCING DNA 


Because information is encoded in the linear sequence of nucleotides in 
an organism's genome, the key to understanding the function and regu- 
lation of genes and genomes lies in the sequence of the DNA. Nucleotide 
sequences can reveal clues to the evolutionary relationships among dif- 
ferent organisms, and provide insights into the causes of human disease. 
Knowing the sequence of a gene is a prerequisite for cloning that gene 
by PCR, and it allows large-scale production of any protein a gene might 
encode. 


Because sequence information is so valuable, a great deal of effort has 
been dedicated over the past few decades to the development of DNA 
sequencing technologies with greater speed and sensitivity. As a result, 
we now have a variety of sophisticated and powerful methods that make 
it possible to obtain the complete nucleotide sequence of a genome in a 
fraction of the time, and at a fraction of the cost, required even 10 years 
ago. 


In this section, we briefly describe the principles underlying the major 
DNA sequencing methods used today, and we provide a glimpse of some 
new sequencing technologies that are just around the corner. 


Dideoxy Sequencing Depends on the Analysis of DNA 
Chains Terminated at Every Position 


In the late 1970s, researchers developed several schemes for determin- 
ing, simply and quickly, the nucleotide sequence of any purified DNA 
fragment. The method that became the most widely used—and continues 
to be employed in some applications today—is called dideoxy sequenc- 
ing or Sanger sequencing (after the scientist who invented it). This 
technique uses DNA polymerase, along with special chain-terminating 
nucleotides called dideoxyribonucleoside triphosphates (Figure 10-16), 
to make partial copies of the DNA fragment to be sequenced. Dideoxy 
sequencing reactions ultimately produce a collection of different DNA 
copies that terminate at every position in the original DNA sequence. 


Although the original method could be quite laborious—particularly read- 
ing the nucleotide sequences from the bands on a sequencing gel—the 
procedure is now fully automated: robotic devices mix the reagents— 
including the four different chain-terminating dideoxynucleotides, each 
tagged with a different-colored fluorescent dye—and load the reaction 
samples onto long, thin capillary gels, which separate the reaction prod- 
ucts into a series of distinct bands. A detector then records the color of 
each band, and a computer translates the information into a nucleotide 
sequence (Figure 10-17). 


The automated dideoxy method made it possible to sequence the first 
genomes of humans and of many other organisms, including most of 
those discussed in this book. How such sequence information was ana- 
lyzed to assemble a complete genome sequence—for example, the initial 
draft of the human genome—is described in How We Know, pp. 348-349. 
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Next-Generation Sequencing Techniques Make 
Genome Sequencing Faster and Cheaper 


Newer methods for the determination of nucleotide sequence, devel- 
oped over the past decade or so, have made genome sequencing much 
more rapid—and much cheaper. As the cost of sequencing DNA has 
plummeted, the number of genomes that have been sequenced has sky- 
rocketed. These rapid methods allow multiple genomes to be sequenced 
in parallel in a matter of weeks. With these techniques—collectively 
referred to as second-generation sequencing methods—investigators have 
been able to examine thousands of human genomes, catalog the varia- 
tion in nucleotide sequences from people around the world, and uncover 
the mutations that increase the risk of various diseases—from cancer to 
autism—as we discuss in Chapter 19. 


Although each method differs in detail, many rely on the sequencing of 
libraries of DNA fragments that, taken together, represent the DNA of the 
entire genome. Instead of using bacterial cells to generate these libraries 
(as seen in Figure 10-8), however, the libraries are synthesized by PCR 
amplification of a collection of DNA fragments, each attached to a solid 
support such as a glass slide or bead. The resulting PCR-generated cop- 
ies, instead of drifting away in solution, remain bound in proximity to 
their original “parent” DNA fragment. The process thus generates DNA 
clusters, each containing about 1000 identical copies of a single DNA 
fragment. All of these clusters are then sequenced at the same time. One 
of the most common methods for doing so is called Illumina sequenc- 
ing. Like automated dideoxy sequencing, Illumina sequencing is based 
on the use of chain-terminating nucleotides with uniquely colored fluo- 
rescent tags. In the Illumina method, however, the fluorescent tags and 
the chemical group that blocks elongation are removable. Once DNA 
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Figure 10-17 Automated dideoxy 
sequencing relies on a set of four 
ddNTPs, each bearing a uniquely colored 
fluorescent tag. (A) To determine the 
complete sequence of a single-stranded 
fragment of DNA (gray), the DNA is 

first hybridized with a short DNA primer 
(orange). The DNA is then mixed with DNA 
polymerase (not shown), an excess amount 
of normal dNTPs, and a mixture containing 
small amounts of all four chain-terminating 
ddNTPs, each of which is labeled with 

a fluorescent tag of a different color. 
Because the chain-terminating ddNTPs will 
be incorporated only occasionally, each 
reaction produces a diverse set of DNA 
copies that terminate at different points 

in the sequence. The reaction products 

are loaded onto a long, thin capillary 

gel and separated by electrophoresis. A 
camera reads the color of each band on 
the gel and feeds the data to a computer 
that assembles the sequence (not shown). 
The sequence read from the gel will be 
complementary to the sequence of the 
original DNA molecule. (B) A tiny part of the 
data from such an automated sequencing 
run. Each colored peak represents a 
nucleotide in the DNA sequence. 


“ HOW WE KNOW 
SEQUENCING THE HUMAN GENOME 


When DNA sequencing techniques became fully auto- and then trying to put one whole copy of the book back 
mated, determining the order of the nucleotides in a together again by matching up the words or phrases or 
piece of DNA went from being an elaborate Ph.D. thesis sentences that appear on each piece. (Several copies 
project to a routine laboratory chore. Feed DNA into the would be needed to generate enough overlap for reas- 
sequencing machine, add the necessary reagents, and sembly.) It could be done, but it would be much easier if 
out comes the sought-after result: the order of As, Ts, the book were, say, only two pages long. 


Ge adie orig could pe inpia For this reason, a straight-out shotgun approach is the 


So why was sequencing the human genome such a strategy of choice only for sequencing small genomes. 
formidable task? Largely because of its size. The DNA The method proved its worth in 1995, when it was used 
sequencing methods employed at the time were lim- to sequence the genome of the infectious bacterium 
ited by the physical size of the gel used to separate the Haemophilus influenzae, the first organism to have its 
labeled fragments (see, for example, Figure Q10-9). At complete genome sequence determined. The trouble 
most, only a few hundred nucleotides could be read with shotgun sequencing is that the reassembly pro- 
from a single gel. How, then, do you handle a genome cess can be derailed by repetitive nucleotide sequences. 
that contains billions of nucleotide pairs? Although rare in bacteria, these sequences make up a 
large fraction of vertebrate genomes (see Figure 9-33). 
Highly repetitive DNA segments make it difficult to 
piece DNA sequences back together accurately (Figure 
10-19). Returning to the ECB analogy, this chapter alone 
contains more than a few instances of the phrase “the 
human genome.” Imagine that one slip of paper from the 
shredded ECBs contains the information: “So why was 
sequencing the human genome” (which appears at the 
start of this section); another contains the information: 
“the human genome sequence consortium combined 
shotgun sequencing with a clone-by-clone approach” 
The most straightforward approach to sequencing a (which appears below). You might be tempted to join 
genome is to break it into random fragments, separate these two segments together based on the overlapping 
and sequence each of the single-stranded fragments, phrase “the human genome.” But you would wind up 
and then use a powerful computer to order these pieces with the nonsensical statement: “So why was sequenc- 
using sequence overlaps to guide the assembly (Figure ing the human genome sequence consortium combined 
10-18). This approach is called the shotgun sequencing shotgun sequencing with a clone-by-clone approach.” 
strategy. As an analogy, imagine shredding several cop- You would also lose the several paragraphs of impor- 
ies of Essential Cell Biology (ECB), mixing up the pieces, tant text that originally appeared between these two 
instances of “the human genome.” 


The solution is to break the genome into fragments and 
sequence these smaller pieces. The main challenge then 
comes in piecing the short fragments together in the 
correct order to yield a comprehensive sequence of a 
whole chromosome, and ultimately a whole genome. 
There are two main strategies for accomplishing this 
genomic breakage and reassembly: the shotgun method 
and the clone-by-clone approach. 


Shotgun sequencing 


And that’s just in this section. The phrase “the human 


TEE genome” appears in many chapters of this book. Such 

repetition compounds the problem of placing each 

RANDOM fragment in its correct context. To circumvent these 
| FRAGMENTATION assembly problems, researchers in the human genome 


sequence consortium combined shotgun sequencing 
with a clone-by-clone approach. 
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Figure 10-19 Repetitive DNA sequences in a genome make it 
difficult to accurately assemble its fragments. In this example, 
the DNA contains two segments of repetitive DNA, each made 
of many copies of the sequence GATTACA. When the resulting 
sequences are examined, two fragments from different parts 

of the DNA appear to overlap. Assembling these sequences 
incorrectly would result in a loss of the information (in brackets) 
that lies between the original repeats. 


The researchers then determined where each of these 
DNA fragments fit into the existing map of the human 
genome. To do this, different restriction enzymes were 
used to cut each clone to generate a unique restriction- 
site “signature.” The locations of the restriction sites in 
each fragment allowed researchers to map each BAC 
clone onto a restriction map of a whole human genome 
that had been generated previously using the same set of 
restriction enzymes (Figure 10-20). 


Knowing the relative positions of the cloned fragments, 
the researchers then selected some 30,000 BACs, sheared 
each into smaller fragments, and determined the nucleo- 
tide sequence of each BAC separately using the shotgun 
method. They could then assemble the whole genome 
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sequence by stitching together the sequences of thou- 
sands of individual BACs that span the length of the 
genome. 


The beauty of this approach was that it was relatively 
easy to accurately determine where the BAC fragments 
belong in the genome. This mapping step reduced the 
likelihood that regions containing repetitive sequences 
were assembled incorrectly, and it virtually eliminated 
the possibility that sequences from different chromo- 
somes were mistakenly joined together. Returning to 
the textbook analogy, the BAC-based approach is akin to 
first separating your copies of ECB into individual pages 
and then shredding each page into its own separate pile. 
It should be much easier to put the book back together 
when one pile of fragments contains words from page 1, 
a second pile from page 2, and so on. And there's virtu- 
ally no chance of mistakenly sticking a sentence from 
page 40 into the middle of a paragraph on page 412. 


All together now 


The clone-by-clone approach produced the first draft 
of the human genome sequence in 2000 and the com- 
pleted sequence in 2004. As the set of instructions that 
specify all of the RNA and protein molecules needed to 
build a human being, this string of genetic bits holds the 
secrets to human development and physiology. But the 
sequence was also of great value to researchers inter- 
ested in comparative genomics or in the physiology of 
other organisms: it eased the assembly of nucleotide 
sequences from other mammalian genomes—mice, rats, 
dogs, and other primates. It also made it much easier to 
determine the nucleotide sequences of the genomes of 
individual humans by providing a framework on which 
the new sequences could be simply superimposed. 


The first human sequence was the only mammalian 
genome completed in this methodical way. But the 
Human Genome Project was an unqualified success in 
that it provided the techniques, confidence, and momen- 
tum that drove the development of the next generation 
of DNA sequencing methods, which are now rapidly 
transforming all areas of biology. 
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and the sites at which the different enzymes cut 
each clone are recorded. The distinctive pattern 
of restriction sites allows investigators to order 
the fragments and place them on a restriction 
map of a human genome that had been 
previously generated using the same nucleases. 
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CHAPTER 10 


Analyzing the Structure and Function of Genes 


polymerase has added the labeled, chain-terminating nucleotide, a photo 
of the slide is taken and the identity of the nucleotide added at each clus- 
ter is recorded; the label and the chain-terminator are then stripped away, 
allowing DNA polymerase to add the next nucleotide (Figure 10-21). 


More recent technological advances have led to the development of third- 
generation sequencing methods that permit the sequencing of just a single 
molecule of DNA. One of these techniques, called Single Molecule Real 
Time sequencing, employs a special apparatus in which a single DNA 
polymerase and a DNA template with an attached primer are anchored 
together in a tiny compartment with differently colored fluorescent 
dNTPs. As DNA synthesis proceeds, the attachment of each nucleotide 
to the growing DNA strand is determined one base at a time, reveal- 
ing the sequence of the template; as in other sequencing methods, large 
numbers of reactions are measured in parallel in separate compartments. 
In another method, still under development, a single DNA molecule is 
pulled slowly through a tiny channel, like thread through the eye of a 
needle. Because each of the four nucleotides has different, characteristic 
chemical properties, the way a nucleotide obstructs the pore as it passes 
through reveals its identity—information that is then used to compile the 
sequence of the DNA molecule. Further refinement of these and other 
technologies will continue to drive down the amount of time and money 
required to sequence a human genome. 


Comparative Genome Analyses Can Identify Genes and 
Predict Their Function 


Strings of nucleotides, at first glance, reveal nothing about how that 
genetic information directs the development of a living organism—or 
even what type of organism it might encode. One way to learn something 
about the function of a particular nucleotide sequence is to compare it 
with the multitude of sequences available in public databases. Using a 
computer program to search for sequence similarity, one can determine 
whether a nucleotide sequence contains a gene and what that gene is 
likely to do—based on the gene's known activity in other organisms. 


Comparative analyses have revealed that the coding regions of genes 
from a wide variety of organisms show a large degree of sequence conser- 
vation (see Figure 9-20). The sequences of noncoding regions, however, 
tend to diverge rapidly over evolutionary time (see Figure 9-19). Thus, a 
search for sequence similarity can often indicate from which organism a 
particular piece of DNA was derived, and which species are most closely 
related. Such information is particularly useful when the origin of a DNA 
sample is unknown—because it was extracted, for example, from a sam- 
ple of soil or seawater or the blood of a patient with an undiagnosed 
infection. 


EXPLORING GENE FUNCTION 


Knowing where a nucleotide sequence comes from—or even what activ- 
ity it might have—is only the first step toward determining what role it has 
in the development or physiology of an organism. The knowledge that a 
particular DNA sequence encodes a transcription regulator, for exam- 
ple, does not reveal when and where that protein is produced, or which 
genes it might regulate. To learn that, investigators must head back to 
the laboratory. 


This is where creativity comes in. There are as many ways to study how 
genes function as there are scientists with an interest in studying the 
question. The techniques an investigator chooses often depend on his or 
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her background and training: a geneticist might, for example, engineer 
mutant organisms in which the activity of the gene has been disrupted, 
whereas a biochemist might take the same gene and produce large 
amounts of its protein to determine its three-dimensional structure. 


In this section, we present a few of the approaches that investigators 
currently use to study gene function. We explore a variety of techniques 
for investigating when and where a gene is expressed. We then describe 
how disrupting the activity of a gene in a cell, tissue, or whole plant or 
animal can provide insights into what that gene normally does. Finally, 
we explain how proteins can be produced in large amounts for biochemi- 
cal and structural studies. 


Analysis of mRNAs Provides a Snapshot of Gene 
Expression 


As we discuss in Chapter 8, a cell expresses only a subset of the thou- 
sands of genes available in its genome. This subset of genes differs from 
one cell type to another, and under different conditions in the same cell 
type. One way to determine which genes are being expressed in a popula- 
tion of cells or in a tissue is to analyze which mRNAs are being produced. 


To sequence all the RNAs produced by a cell, investigators make use of 
the next-generation sequencing technologies described earlier. In most 
cases, a collection of RNAs is converted into complementary DNA (cDNA) 
by reverse transcriptase, and these cDNAs are then sequenced. This 
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Figure 10-21 Illumina sequencing 

is based on the basic principles of 
automated dideoxy sequencing. 

(A) A genome or other large DNA sample 
of interest is broken into millions of short 
fragments. These fragments are attached 
to a glass surface and amplified by PCR to 
generate DNA clusters, each containing 
about a thousand copies of a single DNA 
fragment. The large number of clusters 
provides complete coverage of the 
genome. In the first step, the anchored DNA 
clusters are incubated with DNA polymerase 
and a special set of four nucleoside 
triphosphates (NTPs) with two reversible 
chemical modifications: a uniquely colored 
fluorescent marker and a 3’ chemical group 
that terminates DNA synthesis. No normal 
dNTPs are present in the reaction. After a 
nucleotide is added by DNA polymerase, 

a high-resolution digital camera records 
the color of the fluorescence at each 

DNA cluster. In the second step, the 

DNA is chemically treated to remove the 
fluorescent markers and chemical blockers. 
A new batch of fluorescent, reversible 
terminator NTPs is then added to initiate 
another round of DNA synthesis. These 
steps are repeated until the sequence is 
complete. The snapshots of each round 

of synthesis are compiled by computer 

to yield the sequence of each DNA 
fragment. The sequence of the millions of 
overlapping DNA fragments can then be 
used to reconstruct the complete genome 
sequence. (B) An image of a glass slide 
showing individual DNA clusters after a 
round of DNA synthesis with colored NTPs. 
(B, courtesy of Illumina, Inc.) 
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Figure 10-22 In situ hybridization can be 
used to detect the presence of a virus 

in cells. In this micrograph, the nuclei of 
cultured epithelial cells infected with the 
human papillomavirus (HPV) are stained 
pink by a fluorescent probe that recognizes 
a viral DNA sequence. The cytoplasm of all 
cells is stained green. (Courtesy of Hogne 
Røed Nilsen.) 


method, called RNA-Seq or deep RNA sequencing, provides a quantitative 
analysis of the transcriptome—the complete collection of RNAs produced 
by a cell under a certain set of conditions. It also reveals the number 
of times a particular sequence appears in a sample and can detect rare 
mRNAs, RNA transcripts that are alternatively spliced, mRNAs that har- 
bor sequence variations, and noncoding RNAs. This remarkably powerful 
technology has led to dramatic new insights into the genes expressed in 
a variety of cells and tissues at different times in development, during 
different stages of the cell-division cycle, in response to treatment with 
different drugs, or as a result of different mutations. 


In Situ Hybridization Can Reveal When and Where a 
Gene Is Expressed 


Although RNA-Seg can provide a list of genes that are being expressed 
by a particular tissue at a particular time, it does not reveal exactly where 
in the tissue those RNAs are produced. To do that, investigators use a 
technique called in situ hybridization (from the Latin in situ, “in place”), 
which allows a specific nucleic acid sequence—either DNA or RNA—to be 
visualized in its normal location. 


In situ hybridization uses single-stranded DNA or RNA probes, labeled 
with either fluorescent dyes or radioactive isotopes, to detect comple- 
mentary nucleic acid sequences within a tissue (Figure 10-22) or even 
on an isolated chromosome (Figure 10-23). The latter application is used 
in the clinic to determine, for example, whether fetuses carry abnormal 
chromosomes. In situ hybridization is also used to study the expression 
patterns of a particular gene or collection of genes in an adult or develop- 
ing tissue, providing important clues about when and where these genes 
carry out their functions. 


Reporter Genes Allow Specific Proteins to Be Tracked in 
Living Cells 


For a gene that encodes a protein, the location of the protein within the 
cell, tissue, or organism yields clues to the gene's function. Traditionally, 
the most effective way to visualize a protein within a cell or tissue 
involved using a labeled antibody. That approach requires the genera- 
tion of an antibody that specifically recognizes the protein of interest—a 
process that can be time-consuming and offers no guarantee of success. 


An alternative approach is to use the regulatory DNA sequences of the 
protein-coding gene to drive the expression of some type of reporter 
gene, which encodes a protein that can be easily monitored by its fluo- 
rescence or enzymatic activity. A recombinant gene of this type usually 
mimics the expression of the gene of interest, producing the reporter 


Figure 10-23 In situ hybridization can be used to locate genes on 
isolated chromosomes. Here, six different DNA probes have been 
used to mark the locations of their respective nucleotide sequences 

on human Chromosome 5 isolated from a mitotic cell in metaphase 
(see Figure 5-15 and Panel 18-1, pp. 628-629). The DNA probes have 
been labeled with different chemical groups and are detected using 
fluorescent antibodies specific for those groups. Both the maternal and 
paternal copies of Chromosome 5 are shown, aligned side-by-side. Each 
probe produces two dots on each chromosome because chromosomes 
undergoing mitosis have already replicated their DNA; therefore, 

each chromosome contains two identical DNA helices. The technique 
employed here is nicknamed FISH, for fluorescence in situ hybridization. 
(Courtesy of David C. Ward.) 
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protein when, where, and in the same amounts as the normal protein 
would be made (Figure 10-24A). This approach can also be used to study 
the regulatory DNA sequences that control the gene's expression (Figure 
10-24B). 


One of the most popular reporter proteins is green fluorescent protein 
(GFP), the molecule that gives luminescent jellyfish their greenish glow. If 
the gene that encodes GFP is fused to the regulatory sequences of a gene 
of interest, the expression of the resulting reporter gene can be moni- 
tored by fluorescence microscopy (Figure 10-25). The use of multiple GFP 
variants that fluoresce at different wavelengths can provide insights into 
how different cells interact in a living tissue (Figure 10-26). 


In some cases, the DNA encoding GFP is attached directly to the protein- 
coding region of the gene of interest, resulting in a GFP fusion protein 


Exploring Gene Function 353 


Figure 10-24 Reporter genes can be used 
to determine the pattern of a gene’s 
expression. (A) Suppose the goal is to find 
out which cell types (A-F) express protein 
X, but it is difficult to detect the protein 
directly—with antibodies, for example. 
Using recombinant DNA techniques, 

the coding sequence for protein X can 

be replaced with the coding sequence 

for reporter protein Y, which can be 

easily monitored visually; two commonly 
used reporter proteins are the enzyme 
B-galactosidase (see Figure 8—-14C) and 
green fluorescent protein (GFP, see Figure 
10-25). The expression of the reporter 
protein Y will now be controlled by the 
regulatory sequences (here labeled 1, 2, 
and 3) that control the expression of the 
normal protein X. (B) To determine which 
regulatory sequences normally control 
expression of gene X in particular cell types, 
reporters with various combinations of the 
regulatory regions associated with gene 

X can be constructed. These recombinant 
DNA molecules are then tested for 
expression after their introduction into the 
different cell types. 


Figure 10-25 Green fluorescent protein 
(GFP) can be used to identify specific 
cells in a living animal. For this experiment, 
carried out in the fruit fly, recombinant 

DNA techniques were used to join the 

gene encoding GFP to the regulatory DNA 
sequences that direct the production of a 
particular Drosophila protein. Both the GFP 
and the normal fly protein are made only in 
a specialized set of neurons. This image of a 
live fly larva was captured by a fluorescence 
microscope and shows approximately 

20 neurons, each with long extensions 
(axons and dendrites) that communicate 
with other (nonfluorescent) cells. These 
neurons, located just under the body 
surface, allow the organism to sense its 
immediate environment. (Courtesy of 
Samantha Galindo/Grueber Lab/Columbia 
University's Zuckerman Institute.) 
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Figure 10-26 GFPs that fluoresce at 
different wavelengths help reveal the 
connections that individual neurons 
make within the brain. This image shows 
differently colored neurons in one region 
of a mouse brain. The neurons express 
different combinations of differently colored 
GFPs, making it possible to distinguish 
and trace many individual neurons within 
a population. The stunning appearance of 
these labeled neurons earned the animals 
that bear them the colorful nickname 
“brainbow mice.” (From J. Livet et al., 
Nature 450:56-62, 2007. With permission 
from Macmillan Publishers Ltd.) 
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that often behaves in the same way as the normal protein produced by 
the gene. GFP fusion has become a standard strategy for tracking not 
only the location but also the movement of specific proteins in living cells 
(see How We Know, pp. 520-521). 


The Study of Mutants Can Help Reveal the Function 
of a Gene 


Although it may seem counterintuitive, one of the best ways to determine 
a gene's function is to see what happens to an organism when the gene 
is inactivated by a mutation. Before the advent of gene cloning, geneti- 
cists would often study the mutant organisms that arise at random in a 
population. The mutants of most interest were often selected because 
of their unusual phenotype—fruit flies with white eyes or curly wings, for 
example. The gene responsible for the mutant phenotype could then be 
studied by breeding experiments, as Gregor Mendel did with peas in the 
nineteenth century (discussed in Chapter 19). 


Although mutant organisms can arise spontaneously, they do so infre- 
quently. The process can be accelerated by treating organisms with 
radiation or chemical mutagens, which randomly disrupt gene activity. 
Such random mutagenesis generates large numbers of mutant organisms, 
each of which can then be studied individually. This “classical genetic 
approach,” which we discuss in detail in Chapter 19, is most applicable to 
organisms that reproduce rapidly and can be analyzed genetically in the 
laboratory—such as bacteria, yeasts, nematode worms, and fruit flies— 
although it has also been used to study zebrafish and mice, which require 
more time to reproduce and develop. 


RNA Interference (RNAi) Inhibits the Activity of Specific 
Genes 


DNA technology has made possible more targeted genetic approaches 
to studying gene function. Instead of beginning with a randomly gener- 
ated mutant and then identifying the responsible gene, a gene of known 
sequence can be inactivated deliberately, and the effects on the cell or 
organism's phenotype can be observed. Because this strategy is essen- 
tially the reverse of that used in classical genetics—which goes from 
mutants to genes—it is often referred to as reverse genetics. 
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One of the fastest and easiest ways to silence genes in cells and organ- 
isms is via RNA interference (RNAi). Discovered in 1998, RNAi exploits a 
natural mechanism used in a wide variety of plants and animals to protect 
themselves against infection with certain viruses and the proliferation of 
mobile genetic elements (discussed in Chapter 9). The technique involves 
introducing into a cell or organism double-stranded RNA molecules with 
a nucleotide sequence that matches the gene to be inactivated. The 
double-stranded RNA is cleaved and processed by special RNAi machin- 
ery to produce shorter, double-stranded fragments called small interfering 
RNAs (siRNAS). These siRNAs are separated to form single-stranded RNA 
fragments that hybridize with the target gene’s mRNAs and direct their 
degradation (see Figure 8-28). In some organisms, the same fragments 
can direct the production of more siRNAs, allowing continued inactiva- 
tion of the target mRNAs. 


(A) 


RNAi is frequently used to inactivate genes in cultured mammalian cell 
lines, Drosophila, and the nematode C. elegans. Introducing double- 
stranded RNAs into C. elegans is particularly easy: the worm can be 
fed with E. coli that have been genetically engineered to produce the 
double-stranded RNAs that trigger RNAi (Figure 10-27). These RNAs are 
converted into siRNAs, which are then distributed throughout the ani- 
mal’s body to inhibit expression of the target gene in various tissues. For 
the many organisms whose genomes have been completely sequenced, 
RNAi can, in principle, be used to explore the function of any gene, and 
large collections of DNA vectors that produce these double-stranded 
RNAs are available for several species. 


A Known Gene Can Be Deleted or Replaced with an 
Altered Version 


Despite its usefulness, RNAi has some limitations. Non-target genes are 
sometimes inhibited along with the gene of interest, and certain cell types 
are resistant to RNAi entirely. Even for cell types in which the mechanism 
functions effectively, gene inactivation by RNAi is often temporary, earn- 
ing the description “gene knockdown.” 


Fortunately, there are other, more specific and effective means of elimi- 
nating gene activity in cells and organisms. The coding sequence of a 
cloned gene can be mutated in vitro to change the functional properties 
of its protein product. Alternatively, the coding region can be left intact 
and the regulatory region of the gene changed, so that the amount of 
protein made will be altered or the gene will be expressed in a different 
type of cell or at a different time during development. By re-introducing 
this altered gene back into the organism from which it originally came, 
one can produce a mutant organism that can be studied to determine 
the gene’s function. Often the altered gene is inserted into the genome 
of reproductive cells so that it can be stably inherited by subsequent 
generations. Organisms whose genomes have been altered in this way 
are known as transgenic organisms, or genetically modified organisms 
(GMOs); the introduced gene is called a transgene. 


To study the function of a gene that has been altered in vitro, ideally one 
would like to generate an organism in which the normal gene has been 
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Figure 10-27 Gene function can be 
tested by RNA interference. (A) Double- 
stranded RNA (dsRNA) can be introduced 
into C. elegans by feeding the worms 

E. coli that express the dsRNA. Gene 
function is reduced in all tissues, including 
the reproductive tissues where embryos 
are produced by self-fertilization. (B) In a 
wild-type worm embryo, the egg and sperm 
pronuclei (red arrowheads) come together 
in the posterior half of the embryo shortly 
after fertilization. (C) In an embryo in which 
a particular gene has been silenced by 
RNAi, the pronuclei fail to migrate. This 
experiment revealed an important but 
previously unknown function of this gene 
in embryonic development. (B and C, 
from P. Gönczy et al., Nature 408:331-336, 
2000. With permission from Macmillan 
Publishers Ltd.) 
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Figure 10-28 Targeted gene replacement 
in mice utilizes embryonic stem (ES) 

cells. (A) First, an altered version of the 
gene is introduced into cultured ES cells. 

In a few rare ES cells, the altered gene will 
replace the corresponding normal gene 
through homologous recombination (as 
described in Chapter 6, pp. 220-222 and 
Figure 6-31). Although the procedure is 
often laborious, these rare cells can be 
identified and cultured to produce many 
descendants, each of which carries an 
altered gene in place of one of its two 
normal corresponding genes. (B) Next, the 
altered ES cells are injected into a very early 
mouse embryo; the cells are incorporated 
into the growing embryo, which then 
develops into a mouse that contains some 
somatic cells (colored orange) that carry the 
altered gene. Some of these mice may also 
have germ-line cells that contain the altered 
gene; when bred with a normal mouse, 
some of the progeny of these mice will 
contain the altered gene in all of their cells. 
Such a mouse is called a “knock-in” mouse. 
If two such mice are bred, one can obtain 
progeny that contain two copies of the 
altered gene—one on each chromosome— 
in all of their cells. 
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replaced by the altered one. In this way, the function of the mutant protein 
can be analyzed in the absence of the normal protein. A common way 
of doing this in mice makes use of cultured mouse embryonic stem (ES) 
cells (discussed in Chapter 20). These cells are first subjected to targeted 
gene replacement before being transplanted into a developing embryo to 
produce a mutant mouse, as illustrated in Figure 10-28. 


Using a similar strategy, the activities of both copies of a gene can be 
eliminated entirely, creating a “gene knockout.” To do this, one can 
either introduce an inactive, mutant version of the gene into cultured ES 
cells or delete the gene altogether. The ability to use ES cells to produce 
such “knockout mice” revolutionized the study of gene function, and the 
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Figure 10-29 Transgenic mice with a mutant DNA helicase show premature 
aging. The helicase, encoded by the Xpd gene, is involved in both transcription 

and DNA repair. Compared with a wild-type mouse (A), a transgenic mouse that 
expresses a defective version of Xpd (B) exhibits many of the symptoms of premature 
aging, including osteoporosis, emaciation, early graying, infertility, and reduced life- 
span. The mutation in Xpd used here impairs the activity of the helicase and mimics 
a human mutation that causes trichothiodystrophy, a disorder characterized by brittle 
hair, skeletal abnormalities, and a greatly reduced life expectancy. These results 
support the hypothesis that an accumulation of DNA damage contributes to the 
aging process in both humans and mice. (From J. de Boer et al., Science 296:1276- 
1279, 2002. With permission from AAAS.) 


technique is now being used to systematically determine the function of 
every mouse gene (Figure 10-29). 


A variation of this technique can be used to produce conditional knockout 
mice, in which a known gene can be disrupted more selectively—only in 
a particular cell type or at a certain time in development. The strategy 
involves the introduction of an enzyme, called a recombinase, that can 
be directed to selectively excise—and thus disable—a gene of interest 
(Figure 10-30). Such conditional knockouts are useful for studying genes 
with a critical function during development, because mice missing these 
crucial genes often die before birth. 
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Figure 10-31 The CRISPR system can be 
used to study gene function in a variety 
of species. (A) The Cas? protein, along with 
a guide RNA designed by the experimenter, 
are both artificially expressed in the cell or 
species of interest. One portion of the guide 
RNA (light blue) associates with Cas9, and 
another segment (dark blue) is designed to 
match a particular target sequence in the 
genome. (B) Once Cas? has made a double- 
strand break in the target gene, that gene 
can be replaced with an experimentally 
altered gene by the enzymes that repair 
double-strand breaks through homologous 
recombination (see Figure 6—31). In this way, 
the CRISPR system promotes the precise 
and rapid replacement of a target gene. 

(C and D) By using a mutant form of Cas? 
that can no longer cleave DNA, Cas? can 
be used to activate a normally dormant 
gene (C) or turn off an actively expressed 
gene (D). (Adapted from P. Mali et al., Nat. 
Methods 10:957-963, 2013.) 
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Genes Can Be Edited with Great Precision Using the 
Bacterial CRISPR System 


Bacteria employ several mechanisms to protect themselves from foreign 
DNA. One line of defense is provided by the restriction enzymes, as pre- 
viously discussed. Recently, the discovery of another bacterial defense 
system led to the development of a powerful new method for editing 
genes in a variety of cells, tissues, and organisms. This system, called 
CRISPR, relies on a bacterial enzyme called Cas9, which produces a dou- 
ble-strand break in a molecule of DNA. Unlike restriction enzymes, Cas9 
is not sequence-specific; to direct Cas9 to its target sequence, investiga- 
tors provide the enzyme with a guide RNA molecule. This guide RNA, 
carried by Cas9, allows the enzyme to search the genome and bind to a 
segment of DNA with a complementary sequence (Figure 10—-31A). The 
gene coding for Cas9 has been genetically engineered into a variety of 
organisms; thus, to use the CRISPR system to target a gene—or multi- 
ple genes—researchers need only introduce the appropriate guide RNAs 
(Movie 10.2). 


AS we Saw in Chapter 6, double-strand breaks, like the one induced by 
Cas9, are often repaired by homologous recombination—a process that 
uses the information on an undamaged segment of DNA to repair the 
break. Thus, to replace a target gene using CRISPR, investigators sim- 
ply provide an altered version of the gene to serve as a template for the 
homologous repair. In this way, a target gene can be selectively cut by 
the CRISPR system and replaced at high efficiency by an experimentally 
altered version of the gene (Figure 10—31B).The CRISPR system therefore 
provides another means of generating transgenic organisms. 


Researchers are also adapting the CRISPR system for turning selected 
genes on or off. In this case, a catalytically inactive Cas9 protein can be 
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fused to a transcription activator or repressor; this hybrid transcription 
regulator can then be directed to a target gene by the appropriate guide 
RNA (Figure 10-31C and D). 


The transfer of the CRISPR system from bacteria to virtually all other 
experimental organisms—including mice, zebrafish, worms, flies, rice, 
and wheat—has revolutionized the study of gene function. Like the earlier 
discoveries of restriction enzymes and RNAi, this incredible breakthrough 
came from the work of scientists who were studying a fascinating biolog- 
ical phenomenon without—at first—realizing the enormous impact these 
discoveries would have on all aspects of biology, including human health. 
Such unintentional application highlights the fundamental importance of 
basic research. 


Mutant Organisms Provide Useful Models of Human 
Disease 


Technically speaking, transgenic approaches—including CRISPR—could 
be used to alter genes in the human germ line. Such manipulations would 
be unethical. However, transgenic technologies are currently being used 
to generate animal models of human diseases in which mutant genes 
play a major part. 


With the explosion of DNA sequencing technologies, investigators can 
rapidly search the genomes of patients for mutations that cause or greatly 
increase the risk of their disease (discussed in Chapter 19). These muta- 
tions can then be introduced into animals, such as mice, that can be 
studied in the laboratory. The resulting transgenic animals, which often 
mimic some of the phenotypic abnormalities associated with the condi- 
tion in patients, can be used to explore the cellular and molecular basis of 
the disease and to screen for drugs that could potentially be used thera- 
peutically in humans. 


An encouraging example is provided by fragile X syndrome, a neuropsy- 
chiatric disorder associated with intellectual impairment, neurological 
abnormalities, and often autism. The disease is caused by a mutation 
in the fragile X mental retardation gene (FMR1), which encodes a protein 
that inhibits the translation of mRNAs into proteins at synapses—the 
junctions where nerve cells communicate with one another (see Figure 
12-39). Transgenic mice in which the FMRI gene has been disabled show 
many of the same neurological and behavioral abnormalities seen in 
patients with the disorder, and drugs that return synaptic protein synthe- 
sis to near-normal levels also reverse many of the problems seen in these 
mutant mice. Preliminary studies suggest that at least one of these drugs 
may benefit patients with the disease. 


Transgenic Plants Are Important for both Cell Biology 
and Agriculture 


Although we tend to think of DNA technology in terms of animal biology, 
these techniques have also had a profound impact on the study of plants. 
In fact, certain features of plants make them especially amenable to these 
methods. 


When a piece of plant tissue is cultured in a sterile medium containing 
nutrients and appropriate growth regulators, some of the cells are stimu- 
lated to proliferate indefinitely in a disorganized manner, producing a 
mass of relatively undifferentiated cells called a callus. If the nutrients and 
growth regulators are carefully manipulated, one can induce the forma- 
tion of a shoot within the callus, and in many species a whole new plant 
can be regenerated from such shoots. In a number of plants—including 
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Figure 10-32 Transgenic plants can be 
made using recombinant DNA techniques 
optimized for plants. A disc is cut out 

of a leaf and incubated in a culture of 
Agrobacterium that carries a recombinant 
plasmid with both a selectable marker and 
a desired genetically engineered gene. 
The wounded plant cells at the edge of 
the disc release substances that attract 

the bacteria, which inject their DNA into 
the plant cells. Only those plant cells 

that take up the appropriate DNA and 
express the selectable marker gene survive 
and proliferate and form a callus. The 
manipulation of growth factors supplied to 
the callus induces it to form shoots, which 
subsequently root and grow into adult 
plants carrying the engineered gene. 


Figure 10-33 DNA technology allows the 
production of rice grains with high levels 
of 3-carotene. To help reduce vitamin A 
deficiency in the developing world, a strain 
of rice, called “golden rice,” was developed 
in which the edible part of the grain (called 
the endosperm) contains large amounts of 
B-carotene, which is converted in the human 
gut to vitamin A. (A) Rice plants, like most 
other plants, can synthesize B-carotene in 
their leaves from an abundant precursor 
(geranylgeranyl pyrophosphate) found in all 
plant tissues. However, the genes that code 
for two of the enzymes that act early in this 
biosynthetic pathway are turned off in the 
endosperm, preventing the production of 
B-carotene in rice grains. To produce golden 
rice, the genes for these two enzymes in the 
pathway were obtained from organisms that 
produce large amounts of B-carotene: one 
from maize and the other from a bacterium. 
Using DNA technology, these genes were 
connected to a promoter that drives gene 
expression in rice endosperm. Using the 
method outlined in Figure 10-32, this 
engineered DNA was then used to generate 
a transgenic rice plant that expresses these 
enzymes in endosperm, resulting in rice 
grains that contain high levels of B-carotene. 
Compared to the milled grains of wild-type 
rice (B), the grains of the transgenic rice are 
a deep yellow/orange due to the presence 
of B-carotene (C). (B and C, from J.A. Paine 
et al., Nature Biotechnology, Letters 23: 
482-487, 2005. With permission from 
Macmillan Publishers Ltd.) 
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tobacco, petunia, carrot, potato, and Arabidopsis—a single cell from such 
a callus can be grown into a small clump of cells from which a whole 
plant can be regenerated (see Figure 8-2B). Just as mutant mice can be 
derived by the genetic manipulation of embryonic stem cells in culture, 
transgenic plants can be created from plant cells transfected with DNA in 
culture (Figure 10-32). 


The ability to produce transgenic plants has greatly accelerated progress 
in many areas of plant cell biology. It has played an important part, for 
example, in isolating receptors for growth regulators and in analyzing 
the mechanisms of morphogenesis and of gene expression in plants. 
These techniques have also opened up many new possibilities in agri- 
culture that could benefit both the farmer and the consumer. They have 
made it possible, for example, to modify the ratio of lipid, starch, and 
protein in seeds, to impart pest and virus resistance to plants, and to cre- 
ate modified plants that tolerate extreme habitats such as salt marshes or 
water-stressed soil. One variety of rice has been genetically engineered 
to produce ß-carotene, the precursor of vitamin A (Figure 10-33). If it 
replaced conventional rice, this “golden rice”—so called because of its 
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Figure 10-34 Large amounts of a protein can be produced from expression vector 
a protein-coding DNA sequence inserted into an expression 
vector and introduced into cells. Here, a plasmid vector has 

been engineered to contain a highly active promoter, which causes 
unusually large amounts of mRNA to be produced from the inserted 


: eh promoter 
protein-coding gene. Depending on the characteristics of the cloning sequence Cur DNA WITH 
vector, the plasmid is introduced into bacterial, yeast, insect, or RESTRICTION NUCLEASE 


mammalian cells, where the inserted gene is efficiently transcribed and 


translated into protein. C — ) 
yellow/orange color—could help to alleviate severe vitamin A deficiency, 


INSERT PROTEIN- 


which causes blindness in hundreds of thousands of children in the CODING DNA SEQUENCE 


developing world each year. —————s 
Even Rare Proteins Can Be Made in Large Amounts 

One of the most important contributions of DNA cloning and genetic INTRODUCE 
engineering to cell biology is that they make it possible to produce any RECOMBINANT ENE | 


protein, including the rare ones, in large amounts. Such high-level pro- 
duction is usually accomplished by using specially designed vectors 
known as expression vectors. These vectors include transcription and 
translation signals that direct an inserted gene to be expressed at high 
levels. Different expression vectors are designed for use in bacterial, 
yeast, insect, or mammalian cells, each containing the appropriate regu- 
latory sequences for transcription and translation in these cells (Figure 
10-34). The expression vector is replicated at each round of cell division, 
so that the transfected cells in the culture are able to synthesize large 
amounts of the protein of interest—sometimes comprising 1-10% of the 
total cell protein. It is usually a simple matter to purify this protein away 
from the other proteins made by the host cell. 


This technology is now used to make large amounts of many medically 
useful proteins, including hormones (such as insulin), growth factors, ther- 
apeutic antibodies, and viral coat proteins for use in vaccines. Expression 
vectors also allow scientists to produce many proteins of biological inter- 
est in large enough amounts for detailed structural and functional studies 
that were once impossible—especially for proteins that are normally 
present in very small amounts, such as some receptors and transcription 
regulators. Recombinant DNA techniques thus allow scientists to move 
with ease from protein to gene, and vice versa, so that the functions of 
both can be explored from multiple directions (Figure 10-35). 
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Figure 10-35 Recombinant DNA techniques make it possible to move experimentally from gene to protein or from protein 

to gene. A small quantity of a purified protein or peptide fragment is used to obtain a partial amino acid sequence, which is used to 
search a DNA database for the corresponding nucleotide sequence. This sequence is used to synthesize DNA primers, which can be 
used to clone the gene by PCR from a sequenced genome (see Figure 10-13). Once the gene has been isolated and sequenced, its 
protein-coding sequence can be inserted into an expression vector to produce large quantities of the protein (see Figure 10—34), which 
can then be studied biochemically or structurally. In addition to producing protein, the gene or DNA can also be manipulated and 
introduced into cells or organisms to study its function. 


362 


CHAPTER 10 


Analyzing the Structure and Function of Genes 


KEY TERMS 


complementary DNA (cDNA) hybridization 


cDNA library in situ hybridization 
CRISPR plasmid 
dideoxy (Sanger) sequencing polymerase chain reaction (PCR) 
DNA cloning recombinant DNA 
DNA library reporter gene 
DNA ligase restriction enzyme 
gene knockout RNA interference (RNAi) 
genomic library RNA-Seq 
green fluorescent protein transformation 
(GFP) transgenic organism 


ESSENTIAL CONCEPTS 


e DNA technology has revolutionized the study of cells, making it pos- 
sible to pick out any gene at will from the thousands of genes in a cell 
and to determine its nucleotide sequence. 


e A crucial element in this technology is the ability to cut a large DNA 
molecule into a specific and reproducible set of DNA fragments using 
restriction enzymes, each of which cuts the DNA double helix only at 
a particular nucleotide sequence. 


e DNA fragments can be separated from one another on the basis of 
size by gel electrophoresis. 


e DNA cloning techniques enable any DNA sequence to be selected 
from millions of other sequences and produced in unlimited amounts 
in pure form. 


e DNA fragments can be joined together in vitro by using DNA ligase to 
form recombinant DNA molecules that are not found in nature. 


e DNA fragments can be maintained and amplified by inserting them 
into a larger DNA molecule capable of replication, such as a plasmid. 
This recombinant DNA molecule is then introduced into a rapidly 
dividing host cell, usually a bacterium, so that the DNA is replicated 
at each cell division. 


e A collection of cloned fragments of chromosomal DNA representing 
the complete genome of an organism is known as a genomic library. 
The library is often maintained as millions of clones of bacteria, 
each different clone carrying a different fragment of the organism’s 
genome. 


e cDNA libraries contain cloned DNA copies of the total mRNA of a 
particular type of cell or tissue. Unlike genomic DNA clones, CDNA 
clones contain predominantly protein-coding sequences; they lack 
introns, regulatory DNA sequences, and promoters. Thus they are 
useful when the cloned gene is needed to make a protein. 


e Nucleic acid hybridization can detect any given DNA or RNA sequence 
in a mixture of nucleic acid fragments. This technique depends on 
highly specific base-pairing between a labeled, single-stranded 
DNA or RNA probe and another nucleic acid with a complementary 
sequence. 


e The polymerase chain reaction (PCR) is a powerful form of DNA 
amplification that is carried out in vitro using a purified DNA poly- 
merase. Cloning via PCR requires prior knowledge of the sequence to 
be amplified, because two synthetic oligonucleotide primers must be 
synthesized that bracket the portion of DNA to be replicated. 


DNA sequencing techniques have become increasingly fast and 
cheap, so that the entire genome sequences of thousands of differ- 
ent organisms are now known, including thousands of individual 
humans. 


Using DNA technology, a protein can be joined to a molecular tag, 
such as green fluorescent protein (GFP), which allows its move- 
ment to be tracked inside a cell and, in some cases, inside a living 
organism. 


In situ nucleic acid hybridization can be used to detect the precise 
location of genes on chromosomes and of RNAs in cells and tissues. 


RNA-Seq can be used to monitor the expression of all of the genes in 
a cell or tissue. 


Cloned genes can be altered in vitro and stably inserted into the 
genome of a cell or an organism to study their function. Such mutants 
are called transgenic organisms. 


The expression of particular genes can be inhibited in cells or organ- 
isms by the technique of RNA interference (RNAi), which prevents an 
MRNA from being translated into protein. 


Genes can be deleted or modified with high specificity by the CRISPR 
system, which uses guide mRNAs to promote DNA cleavage at a spe- 
cific nucleotide sequence in the genome. 


Bacteria, yeasts, and mammalian cells can be engineered to synthe- 
size large quantities of any protein whose gene has been cloned, 
making it possible to study proteins that are otherwise rare or dif- 
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QUESTION 10-5 


What are the consequences for a dideoxy DNA sequencing 
reaction if the ratio of dideoxyribonucleoside triphosphates 
to deoxyribonucleoside triphosphates is increased? What 
happens if this ratio is decreased? 


QUESTION 10-6 


Almost all the cells in an individual animal contain identical 
genomes. In an experiment, a tissue composed of several 
different cell types is fixed and subjected to in situ 
hybridization with a DNA probe to a particular gene. To 
your surprise, the hybridization signal is much stronger in 
some cells than in others. How might you explain this result? 


QUESTION 10-7 


After decades of work, Dr. Ricky M. isolated a small amount 
of attractase—an enzyme that produces a powerful human 
pheromone—from hair samples of Hollywood celebrities. 
To take advantage of attractase for his personal use, he 
obtained a complete genomic clone of the attractase 

gene, connected it to a strong bacterial promoter on an 
expression plasmid, and introduced the plasmid into E. 

coli cells. He was devastated to find that no attractase was 
produced in the cells. What is a likely explanation for his 
failure? 


QUESTION 10-8 

Which of the following statements are correct? Explain your 
answers. 

A. Restriction enzymes cut DNA at specific sites that are 
always located between genes. 


B. DNA migrates toward the positive electrode during 
electrophoresis. 


C. Clones isolated from cDNA libraries contain promoter 
sequences. 


D. PCR utilizes a heat-stable DNA polymerase because 
for each amplification step, double-stranded DNA must be 
heat-denatured. 


E. Digestion of genomic DNA with Alul, a restriction 
enzyme that recognizes a four-nucleotide sequence, 
produces fragments that are all exactly 256 nucleotides in 
length. 


F. To make a cDNA library, both a DNA polymerase and a 
reverse transcriptase must be used. 


G. DNA fingerprinting by PCR relies on the fact that 
different individuals have different numbers of repeats in 
STR regions in their genome. 


H. It is possible for a coding region of a gene to be present 
in a genomic library prepared from a particular tissue but 

to be absent from a cDNA library prepared from the same 
tissue. 


QUESTION 10-9 


A. What is the sequence of the DNA that was used in the 
sequencing reaction shown in Figure Q10-9? The four lanes 
show the products of sequencing reactions that contained 
ddG (lane 1), ddA (lane 2), ddT (lane 3), and ddC (lane 4). 
The numbers to the right of the autoradiograph represent 
the positions of marker DNA fragments of 50 and 116 
nucleotides. 
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lanes B. This DNA was derived from 
the middle of a cDNA clone of 
= = a mammalian protein. Using the 
~ genetic code table (see Figure 
=~ 7-27), can you determine the 
amino acid sequence of this 
a portion of the protein? 


P QUESTION 10-10 


A. How many different DNA 
fragments would you expect 

to obtain if you cleaved human 

P genomic DNA with Haelll? (Recall 
that there are 3 x 10? nucleotide 
pairs per haploid genome.) How 
many fragments would you expect 
with EcoRI? 


- B. Human genomic libraries 
used for DNA sequencing are 

-> a often made from fragments 
- obtained by cleaving human DNA 
— with Haelll in such a way that the 
DNA is only partially digested; 
~- that is, not all the possible Haelll 
— 50 sites have been cleaved. What is a 
9 possible reason for doing this? 


(Courtesy of Leander Lauffer and Peter Walter.) 
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(Figure Q10-11). 5 ia! 
. v - œe a— 
You do not know if 335) =— 


the molecule is linear L pE 
DNA or a DNA circle. 
DNA fragments of 

known sizes were 1>- ë =—— — 
electrophoresed on 


the same gel for use 
as size markers (left 
lane). The size of the 
DNA markers is given in kilobase pairs (kb), where 

1 kb = 1000 nucleotide pairs. Using the size markers as 
a guide, estimate the length of each restriction fragment 
obtained. From this information, construct a map of the 
original DNA molecule indicating the relative positions 
of all the restriction enzyme cleavage sites. 


Figure Q10-11 


QUESTION 10-12 


There has been a colossal snafu in the maternity ward of 
your local hospital. Four sets of male twins, born within 
an hour of each other, were inadvertently shuffled in 

the excitement occasioned by that unlikely event. You 
have been called in to set things straight. As a first step, 
you would like to match each baby with his twin. (Many 


Analyzing the Structure and Function of Genes 


1 2 3 4 5 6 7 8 newborns look alike so 
S A a fl you don't want to rely 
_ pay on appearance alone.) 
= To that end you analyze 
-_ =. d æ a small blood sample 
— a a= ™ æ | from each infant using a 
= 2. hybridization probe that 
detects short tandem 
~» “™® | repeats (STRs) located 
=> ææ» 
a = in widely scattered 
= “e~ _@ = regions of the genome. 
a> The results are shown in 
=> 
_— ææ | A. Which infants 
= sp >~ & are twins? Which are 
@ ww | identical twins? 
= -æ = æa | B. How could you 
3 ap . . 
-D 2 aS match a pair of twins to 
the correct parents? 


Figure Q10-12 


QUESTION 10-13 


One of the first organisms that was genetically modified 
using recombinant DNA technology was a bacterium 
that normally lives on the surface of strawberry plants. 
This bacterium makes a protein, called ice-protein, that 
causes the efficient formation of ice crystals around it 
when the temperature drops to just below freezing. Thus, 
strawberries harboring this bacterium are particularly 
susceptible to frost damage because their cells are 
destroyed by the ice crystals. Consequently, strawberry 
farmers have a considerable interest in preventing ice 
crystallization. 

A genetically engineered version of this bacterium was 
constructed in which the ice-protein gene was knocked 
out. The mutant bacteria were then introduced in large 
numbers into strawberry fields, where they displaced the 
normal bacteria by competition for their ecological niche. 
This approach has been successful: strawberries bearing the 
mutant bacteria show a much reduced susceptibility to frost 
damage. 

At the time they were first carried out, the initial 
open-field trials triggered an intense debate because 
they represented the first release into the environment of 
an organism that had been genetically engineered using 
recombinant DNA technology. Indeed, all preliminary 
experiments were carried out with extreme caution and in 
strict containment. 

Do you think that bacteria lacking the ice-protein could 
be isolated without the use of modern DNA technology? Is 
it likely that such mutations have already occurred in nature? 
Would the use of a mutant bacterial strain isolated from 
nature be of lesser concern? Should we be concerned about 
the risks posed by the application of recombinant DNA 
techniques in agriculture and medicine? Do the potential 
benefits outweigh the risks? Explain your answers. 


Membrane Structure 


A living cell is a self-reproducing system of molecules held inside a con- 
tainer. That container is the plasma membrane—a protein-studded, fatty 
film so thin that it cannot be seen directly in the light microscope. Every 
cell on Earth uses such a membrane to separate and protect its chemical 
components from the outside environment. Without membranes, there 
would be no cells, and thus no life. 


The structure of the plasma membrane is simple: it consists of a two-ply 
sheet of lipid molecules about 5 nm—or 50 atoms—thick, into which pro- 
teins have been inserted. Its properties, however, are unlike those of any 
sheet of material we are familiar with in the everyday world. Although 
it serves as a barrier to prevent the contents of the cell from escaping 
and mixing with molecules in the surrounding environment ( 

), the plasma membrane does much more than that. If a cell is to 
survive and grow, nutrients must pass inward across the plasma mem- 
brane, and waste products must make their way out. To facilitate this 
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(A) BACTERIAL CELL 


(B) EUKARYOTIC CELL 


THE LIPID BILAYER 


MEMBRANE PROTEINS 


Cell membranes act as 
selective barriers. The plasma membrane 
separates a cell from its surroundings, 
enabling the molecular composition of a 
cell to differ from that of its environment. 
(A) In some bacteria, the plasma membrane 
is the only membrane. (B) In addition to a 
plasma membrane, eukaryotic cells also 
have internal membranes that enclose 
individual organelles. All cell membranes 
prevent molecules on one side from freely 
mixing with those on the other, as indicated 
schematically by the colored dots. 
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Figure 11-3 Internal membranes 

form many different compartments 

in a eukaryotic cell. Some of the main 
membrane-enclosed organelles in a typical 
animal cell are shown here. Note that 

the nucleus and mitochondria are each 
enclosed by two membranes. 
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Figure 11-2 The plasma membrane is involved in cell 
communication, import and export of molecules, and cell growth 
and motility. (1) Receptor proteins in the plasma membrane enable 
the cell to receive signals from the environment; (2) channels and 
transporters in the membrane enable the import and export of small 
molecules; (3) the flexibility of the membrane and its capacity for 
expansion allow the cell to grow, change shape, and move. 


exchange, the membrane is penetrated by highly selective channels and 
transporters—proteins that allow specific, small molecules and ions to be 
imported and exported. Other proteins in the membrane act as sensors, 
or receptors, that enable the cell to receive information about changes in 
its environment and respond to them in appropriate ways. The mechani- 
cal properties of the plasma membrane are equally impressive. When a 
cell grows, so does its membrane: this remarkable structure enlarges in 
area by adding new membrane without ever losing its continuity, and it 
can deform without tearing, allowing the cell to move or change shape 
(Figure 11-2). The membrane is also self-healing: if it is pierced, it nei- 
ther collapses like a balloon nor remains torn; instead, the membrane 
quickly reseals. 


As shown in Figure 11-1, many bacteria have only a single membrane— 
the plasma membrane—whereas eukaryotic cells also contain internal 
membranes that enclose intracellular compartments. The internal mem- 
branes form various organelles, including the endoplasmic reticulum, 
Golgi apparatus, endosomes, and mitochondria (Figure 11-3). Although 
these internal membranes are constructed on the same principles as the 
plasma membrane, they differ subtly in composition, especially in their 
resident proteins. 


Regardless of their location, all cell membranes are composed of lipids 
and proteins and share a common general structure (Figure 11-4). The 
lipids are arranged in two closely apposed sheets, forming a lipid bilayer 
(see Figure 11-4B). This lipid bilayer serves as a permeability barrier to 
most water-soluble molecules, while the proteins embedded within it 
carry out the other functions of the membrane and give different mem- 
branes their individual characteristics. 


In this chapter, we consider the structure of biological membranes and 
the organization of their two main constituents: lipids and proteins. 
Although we focus mainly on the plasma membrane, most of the con- 
cepts we discuss also apply to internal membranes. The functions of cell 
membranes, including their role in cell communication, the transport of 
small molecules, and energy generation, are considered in later chapters. 
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THE LIPID BILAYER 


Because cells are filled with—and surrounded by—water, the structure of 
cell membranes is determined by the way membrane lipids behave in a 
watery (aqueous) environment. Lipid molecules are not very soluble in 
water, although they do dissolve readily in organic solvents such as ben- 
zene. It was this property that scientists exploited in 1925, when they set 
out to investigate how lipids are arranged in cell membranes. 


Using benzene, investigators extracted all the lipids from the plasma 
membranes of purified red blood cells. These lipids were then spread out 
in a film on the surface of a trough filled with water, like an oil slick on a 
puddle. Using a movable barrier, the researchers then pushed the floating 
lipids together until they formed a continuous sheet only one molecule 
thick. When the investigators measured the surface area of this mon- 
olayer, they found that it occupied twice the area of the original, intact 
cells. Based on this observation, they deduced that, in an intact cell mem- 
brane, lipid molecules must double up to form a bilayer—a finding that 
had a profound influence on cell biology. 


In this section, we take a closer look at this lipid bilayer, which consti- 
tutes the fundamental structure of all cell membranes. We consider how 
lipid bilayers form, how they are maintained, and how their properties 
establish the general properties of all cell membranes. 


Membrane Lipids Form Bilayers in Water 


The lipids found in cell membranes combine two very different proper- 
ties in a single molecule: each lipid has a hydrophilic (“water-loving”) 
head and a hydrophobic (‘water-fearing”) tail. The most abundant lipids 
in cell membranes are the phospholipids, which have a phosphate-con- 
taining, hydrophilic head linked to a pair of hydrophobic, hydrocarbon 
tails (Figure 11-5). For example, phosphatidylcholine, one of the most 
abundant phospholipids in the membranes of animals and plants, has the 
small molecule choline attached to a phosphate group as its hydrophilic 
head (Figure 11-6). 


Phospholipids are not the only membrane lipids that are amphipathic, a 
term used to describe molecules with both hydrophilic and hydrophobic 
parts. Cholesterol, which is found in animal cell membranes, and gly- 
colipids, which have sugars as part of their hydrophilic head, are also 
amphipathic (Figure 11-7). 


Having both hydrophilic and hydrophobic parts plays a crucial part in driv- 
ing lipid molecules to assemble into bilayers in an aqueous environment. 


bilayer 


} M (5 nm) 
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Figure 11-4 A cell membrane consists 

of a lipid bilayer in which proteins are 
embedded. (A) An electron micrograph of 
a plasma membrane of a human red blood 
cell seen in cross section. In this image, the 
proteins that extend from either side of the 
bilayer form the two closely spaced dark 
lines indicated by the brackets; the thin, 
white layer between them is the lipid bilayer. 
(B) Schematic drawing showing a three- 
dimensional view of a cell membrane. 

(A, by permission of E.L. Bearer.) 
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Figure 11-5 Cell membranes are packed 
with phospholipids. A typical membrane 
phospholipid molecule has a hydrophilic 
head and two hydrophobic tails. 


368 CHAPTER 11 Membrane Structure 


polar 
(hydrophilic) 
head 
L . " 
GLYCEROL Cy cu—dn, ’ 
7 O © ? 
C=O C=O 
CH, CH, 
CH, ~~ CH) 
CH, CH, 
CH) CH 
CH, CH, 
CH, CH, 
nonpolar CH, = CH) 
(hydrophobic) CH, CH double 
tails CH, w bond 
CH, ‘Cet 
CH, cH 
CH, 
CH, ‘CH, 
CH3 ‘CH, 
CH, CH, 
CH) CH) 
CH, CH; 
L CH, 


(A) (B) (©) 


Figure 11-6 Phosphatidylcholine is the most common phospholipid in cell membranes. It is represented 
schematically in (A), as a chemical formula in (B), as a space-filling model in (C), and as a symbol in (D). This particular 
phospholipid is built from five parts: the hydrophilic head, which consists of choline linked to a phosphate group; 
two hydrocarbon chains, which form the hydrophobic tails, and a molecule of glycerol, which links the head to the 
tails. Each of the hydrophobic tails is a fatty acid—a hydrocarbon chain with a carboxyl (-COOH) group at one end; 
glycerol attaches via this carboxyl group, as shown in (B). A kink in one of the hydrocarbon chains occurs where there 
is a double bond between two carbon atoms. (The “phosphatidyl” part of the name of a phospholipid refers to the 
phosphate—glycerol-fatty acid portion of the molecule.) 


hydrophilic heads 


Figure 11-7 Different types of membrane 

lipids are all amphipathic. Each of the three 
types shown here has a hydrophilic head and 
one or two hydrophobic tails. The hydrophilic hydrocarbon tail 
head is serine phosphate (shaded blue 

and yellow) in phosphatidylserine, an -OH 

group (blue) in cholesterol, and the sugar 

galactose plus an -OH group (both blue) in 

galactocerebroside. See also Panel 2-4, phosphatidylserine cholesterol galactocerebroside 
pp. 72-73. (a phospholipid) (a sterol) (a glycolipid) 
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Figure 11-8 A hydrophilic molecule attracts water molecules. Both 
acetone and water are polar molecules: thus acetone readily dissolves 
in water. Polar atoms are shown in red and blue, with ô- indicating a 
partial negative charge, and ò* indicating a partial positive charge. 
Hydrogen bonds (red) and an electrostatic attraction (yellow) form 
between acetone and the surrounding water molecules. Nonpolar 
groups are shown in gray. 


As discussed in Chapter 2 (see Panel 2-2, pp. 68-69), hydrophilic molecules 
dissolve readily in water because they contain either charged groups or 
uncharged polar groups that can form electrostatic attractions or hydro- 
gen bonds with water molecules (Figure 11-8). Hydrophobic molecules, 
by contrast, are insoluble in water because all—or almost all—of their 
atoms are uncharged and nonpolar; they therefore cannot form favora- 
ble interactions with water molecules. Instead, they force adjacent water 
molecules to reorganize into a cagelike structure around them (Figure 
11-9). Because this cagelike structure is more highly ordered than the 
rest of the water, its formation requires free energy. This energy cost is 
minimized when the hydrophobic molecules cluster together, limiting 
their contacts with the surrounding water molecules. Thus, purely hydro- 
phobic molecules, like the fats found in the oils of plant seeds and the 
adipocytes (fat cells) of animals (Figure 11-10), coalesce into large fat 
droplets when dispersed in water. 
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QUESTION 11-1 


Water molecules are said “to 
reorganize into a cagelike structure” 
around hydrophobic compounds 
(e.g., see Figure 11-9). This 

seems paradoxical because water 
molecules do not interact with 

the hydrophobic compound. So 
how could they “know” about its 
presence and change their behavior 
to interact differently with one 
another? Discuss this argument 
and, in doing so, develop a clear 
concept of what is meant by a 
“cagelike” structure. How does it 
compare to ice? Why would this 
cagelike structure be energetically 
unfavorable? 


Figure 11-9 A hydrophobic molecule 
tends to avoid water. Because the 
2-methylpropane molecule is entirely 
hydrophobic, it cannot form favorable 
interactions with water. This causes the 
adjacent water molecules to reorganize into 
a cagelike structure around it, to maximize 
their hydrogen bonds with each other. 
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Figure 11-10 Fat molecules are entirely 
hydrophobic. Unlike phospholipids, 
triacylglycerols, which are the main 
constituents of animal fats and plant oils, 
are entirely hydrophobic. Here, the third 
hydrophobic tail of the triacylglycerol 
molecule is drawn facing upward for 
comparison with the structure of a 
phospholipid (see Figure 11-6), although 
normally it is depicted facing down (see 
Panel 2-5, pp. 74-75). 
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Amphipathic molecules, such as membrane lipids (see Figure 11-7), 
are subject to two conflicting forces: the hydrophilic head is attracted 
to water, while the hydrophobic tails shun water and seek to aggregate 
with other hydrophobic molecules. This conflict is beautifully resolved by 
the formation of a lipid bilayer—an arrangement that satisfies all parties 
and is energetically most favorable. The hydrophilic heads face water 
on both surfaces of the bilayer, while the hydrophobic tails are shielded 
from the water within the bilayer interior, like the filling in a sandwich 
(Figure 11-11). 


The same forces that drive the amphipathic molecules to form a bilayer 
help to make the bilayer self-sealing. Any tear in the sheet will create a 
free edge that is exposed to water. Because this situation is energetically 
unfavorable, the molecules of the bilayer will spontaneously rearrange 
to eliminate the free edge. If the tear is small, this spontaneous re- 
arrangement will exclude the water molecules and lead to repair of the 
bilayer, restoring a single continuous sheet. If the tear is large, the sheet 
may begin to fold in on itself and break up into separate closed vesi- 
cles. In either case, the overriding principle is that free edges are quickly 
eliminated. 


The prohibition on free edges has a profound consequence: the only way 
an amphipathic sheet can avoid having free edges is to bend and seal, 
forming a boundary around a closed space (Figure 11-12). Therefore, 
amphipathic molecules such as phospholipids necessarily assemble into 
self-sealing containers that define closed compartments—from vesicles 
and organelles to entire cells. This remarkable behavior, fundamental to 
the creation of a living cell, is essentially a by-product of the nature of 
membrane lipids: hydrophilic at one end and hydrophobic at the other. 


The Lipid Bilayer Is a Flexible Two-dimensional Fluid 


The aqueous environment inside and outside a cell prevents membrane 
lipids from escaping from the bilayer, but nothing stops these molecules 
from moving about and changing places with one another within the 
plane of the membrane. The lipid bilayer therefore behaves as a two- 
dimensional fluid, a fact that is crucial for membrane function and 
integrity (Movie 11.1; “laser tweezers” are explained in Movie 11.2). 


At the same time, the lipid bilayer is also flexible—that is, it is able to 
bend. Like fluidity, flexibility is important for membrane function, and it 
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Figure 11-11 Amphipathic phospholipids form a bilayer in water. (A) Schematic drawing of a phospholipid bilayer in water. 
(B) Computer simulation showing the phospholipid molecules (red heads and orange tails) and the surrounding water molecules 
(blue) in a cross section of a lipid bilayer. (B, adapted from R.M. Venable et al., Science 262:223-228, 1993.) 


sets a lower limit of about 25 nm to the vesicle diameter that cell mem- 
branes can form. 


The fluidity of lipid bilayers can be studied using synthetic lipid bilayers, 
which are easily produced by the spontaneous aggregation of amphi- 
pathic lipid molecules in water. Pure phospholipids, for example, will 
form closed, spherical vesicles, called liposomes, when added to water; 
these vesicles vary in size from about 25 nm to 1 mm in diameter 
(Figure 11-13). 


Using such simple synthetic bilayers, investigators can measure the 
movements of the lipid molecules in a lipid bilayer. These measure- 
ments reveal that some types of movement are rare, while others are 
frequent and rapid. Thus, in synthetic lipid bilayers, phospholipid mole- 
cules very rarely tumble from one half of the bilayer, or monolayer, to the 
other. Without proteins to facilitate the process, it is estimated that this 
event, called “flip-flop,” occurs less than once a month for any individual 
lipid molecule under conditions similar to those in a cell. On the other 
hand, as the result of random thermal motions, lipid molecules continu- 
ously exchange places with their neighbors within the same monolayer. 
This exchange leads to rapid lateral diffusion of lipid molecules within 
the plane of each monolayer, so that, for example, a lipid in an artifi- 
cial bilayer may diffuse a length equal to that of an entire bacterial cell 
(~2 um) in about one second. 


Similar studies show that individual lipid molecules not only flex their 
hydrocarbon tails, but they also rotate rapidly about their long axis— 
some reaching speeds of 500 revolutions per second. Studies of whole 
cells—and of isolated cell membranes—indicate that lipid molecules in 
cell membranes undergo the same movements as they do in synthetic 
bilayers. The movements of membrane phospholipid molecules are sum- 
marized in Figure 11-14. 


The Fluidity of a Lipid Bilayer Depends on Its 
Composition 


The fluidity of a cell membrane—the ease with which its lipid molecules 
move within the plane of the bilayer—is important for membrane func- 
tion and has to be maintained within certain limits. Just how fluid a lipid 
bilayer is at a given temperature depends on its phospholipid composi- 
tion and, in particular, on the nature of the hydrocarbon tails: the closer 
and more regular the packing of the tails, the more viscous and less fluid 
the bilayer will be. 


Two major properties of hydrocarbon tails affect how tightly they pack 
in the bilayer: their length and the number of double bonds they con- 
tain. A shorter chain length reduces the tendency of the hydrocarbon 
tails to interact with one another and therefore increases the fluidity of 
the bilayer. The hydrocarbon tails of membrane phospholipids vary in 
length between 14 and 24 carbon atoms, with 18 or 20 atoms being the 
most common. For most phospholipids, one of these hydrocarbon tails 
contains only single bonds between its adjacent carbon atoms, whereas 
the other tail includes one or more double bonds (see Figure 11-6). The 
chain that harbors a double bond does not contain the maximum num- 
ber of hydrogen atoms that could, in principle, be attached to its carbon 
backbone; it is thus said to be unsaturated with respect to hydrogen. The 


Figure 11-13 Pure phospholipids can form closed, spherical 
liposomes. (A) An electron micrograph of phospholipid vesicles, or 
liposomes. (B) A drawing of a small, spherical liposome seen in cross 
section. (A, courtesy of Jean Lepault.) 
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Figure 11-12 Phospholipid bilayers 
spontaneously close in on themselves to 
form sealed compartments. The closed 
structure is stable because it avoids the 
exposure of the hydrophobic hydrocarbon 
tails to water, which would be energetically 
unfavorable. 
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Figure 11-14 Membrane phospholipids 
move within the lipid bilayer. Because of 
these motions, the bilayer behaves as a two- 
dimensional fluid, in which the individual 
lipid molecules are able to move in their 
own monolayer. Note that lipid molecules 
do not move spontaneously from one 
monolayer to the other. 


QUESTION 11-2 


Five students in your class always 
sit together in the front row. This 
could be because (A) they really like 
each other or (B) nobody else in 
your class wants to sit next to them. 
Which explanation holds for the 
assembly of a lipid bilayer? Explain. 
Suppose, instead, that the other 
explanation held for lipid molecules. 
How would the properties of the 
lipid bilayer be different? 


hydrocarbon tail with no double bonds has a full complement of hydrogen 
atoms and is said to be saturated. Each double bond in an unsaturated 
tail creates a small kink in the tail (see Figure 11-6), which makes it more 
difficult for the tails to pack against one another. For this reason, lipid 
bilayers that contain a large proportion of unsaturated hydrocarbon tails 
are more fluid than those with lower proportions. 


In bacterial and yeast cells, which have to adapt to varying temperatures, 
both the lengths and the degree of saturation of the hydrocarbon tails 
in the bilayer are adjusted constantly to maintain a membrane with a 
relatively consistent fluidity: at higher temperatures, for example, the cell 
makes membrane lipids with tails that are longer and that contain fewer 
double bonds. A similar trick is used in the manufacture of margarine 
from vegetable oils. The fats produced by plants are generally unsatu- 
rated and therefore liquid at room temperature, unlike animal fats such 
as butter or lard, which are generally saturated and therefore solid at 
room temperature. To produce margarine, vegetable oils are “hydrogen- 
ated”: this addition of hydrogen removes their double bonds, making the 
oils more solid and butterlike at room temperature. 


In animal cells, membrane fluidity is modulated by the inclusion of the 
sterol cholesterol. This molecule is present in especially large amounts 
in the plasma membrane, where it constitutes approximately 20% of the 
lipids in the membrane by weight. With its short and rigid steroid ring 
structure, cholesterol can fill the spaces between neighboring phospho- 
lipid molecules left by the kinks in their unsaturated hydrocarbon tails 
(Figure 11-15). In this way, cholesterol tends to stiffen the bilayer, mak- 
ing it less flexible, as well as less permeable. The chemical properties of 
membrane lipids—and how they affect membrane fluidity—are reviewed 
in Movie 11.3 and Movie 11.4. 


For all cells, membrane fluidity is important for a number of reasons. It 
enables many membrane proteins to diffuse rapidly in the plane of the 
bilayer and to interact with one another, as is crucial, for example, in 
cell signaling (discussed in Chapter 16). It permits membrane lipids and 
proteins to diffuse from sites where they are inserted into the bilayer after 
their synthesis to other regions of the cell. It ensures that membrane mol- 
ecules are distributed evenly between daughter cells when a cell divides. 
And, under appropriate conditions, it allows membranes to fuse with one 
another and mix their molecules (discussed in Chapter 15). If biological 
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Figure 11-15 Cholesterol tends to stiffen cell membranes. (A) The shape of a cholesterol molecule. The chemical formula of 
cholesterol is shown in Figure 11-7. (B) How cholesterol fits into the gaps between phospholipid molecules in a lipid bilayer. 

(C) Space-filling model of the bilayer, with cholesterol molecules in green. Although the nonpolar hydrocarbon tail of cholesterol is 
shown in green—to visually distinguish it from the hydrocarbon tails of the membrane phospholipids—in reality, the hydrophobic 
tail of cholesterol is chemically equivalent to the hydrophobic tails of the phospholipids. (C, from H.L. Scott, Curr. Opin. Struct. Biol. 


12:495-502, 2002.) 


Figure 11-16 Newly synthesized phospholipids are added to the 
cytosolic side of the ER membrane and then redistributed by 
transporters that transfer them from one half of the lipid bilayer 
to the other. Biosynthetic enzymes bound to the cytosolic monolayer 
of the ER membrane (not shown) produce new phospholipids 

from free fatty acids and insert them into the cytosolic monolayer. 
Transporters called scramblases then randomly transfer phospholipid 
molecules from one monolayer to the other, allowing the membrane 
to grow as a bilayer in which the two leaflets even out continuously in 
size and lipid composition. 


membranes were not fluid, it is hard to imagine how cells could live, 
grow, and reproduce. 


Membrane Assembly Begins in the ER 


In eukaryotic cells, new phospholipids are manufactured by enzymes 
bound to the cytosolic surface of the endoplasmic reticulum (ER). Using 
free fatty acids as substrates (see Panel 2-5, pp. 74-75), these enzymes 
deposit the newly made phospholipids exclusively in the cytosolic half of 
the bilayer. 


Despite the unbalanced addition of newly made phospholipids, cell mem- 
branes manage to grow evenly. So how do new phospholipids make it 
to the opposite monolayer? As we saw in Figure 11-14, flip-flops that 
move lipids from one monolayer to the other rarely occur spontaneously. 
Instead, phospholipid transfers are catalyzed by a scramblase, a type of 
transporter protein that removes randomly selected phospholipids from 
one half of the lipid bilayer and inserts them in the other. (Transporters 
and their functions are discussed in detail in Chapter 12.) As a result 
of this scrambling, newly made phospholipids are redistributed equally 
between each monolayer of the ER membrane (Figure 11-16). 


Some of this newly assembled membrane will remain in the ER; the rest 
will be used to supply fresh membrane to other compartments in the cell, 
including the Golgi apparatus and plasma membrane (see Figure 11-3). 
We discuss this dynamic process—in which membranes bud from one 
organelle and fuse with another—in detail in Chapter 15. 


Certain Phospholipids Are Confined to One Side of the 
Membrane 


Most cell membranes are asymmetric: the two halves of the bilayer 
often include strikingly different sets of phospholipids. But if membranes 
emerge from the ER with an evenly assorted set of phospholipids, where 
does this asymmetry arise? It begins in the Golgi apparatus. 


The Golgi membrane contains another family of phospholipid-handling 
transporters, called flippases. Unlike scramblases, which move random 
phospholipids from one half of the bilayer to the other, flippases remove 
specific phospholipids from the side of the bilayer facing the exterior 
space and flip them into the monolayer that faces the cytosol (Figure 
41-17). 


Figure 11-17 Flippases help to establish and maintain the 
asymmetric distribution of phospholipids characteristic of animal 
cell membranes. When membranes leave the ER and are incorporated 
in the Golgi, they encounter a different set of transporters called 
flippases, which selectively remove phosphatidylserine (light green) 
and phosphatidylethanolamine (yellow) from the noncytosolic 
monolayer and flip them to the cytosolic side. This transfer leaves 
phosphatidylcholine (red) and sphingomyelin (brown) concentrated in 
the noncytosolic monolayer. The resulting curvature of the membrane 
may help drive subsequent vesicle budding. 
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QUESTION 11-3 


It seems paradoxical that a 
lipid bilayer can be fluid yet 
asymmetrical. Explain. 


Figure 11-19 Phospholipids and 
glycolipids are distributed asymmetrically 
in the lipid bilayer of an animal cell 
plasma membrane. Phosphatidylcholine 
(red) and sphingomyelin (brown) are 
concentrated in the noncytosolic monolayer, 
whereas phosphatidylserine (light green) 
and phosphatidylethanolamine (yellow) 
are found mainly on the cytosolic side. 

In addition to these phospholipids, 
phosphatidylinositols (dark green head 
group), a minor constituent of the plasma 
membrane, are shown in the cytosolic 
monolayer, where they participate in cell 
signaling. Glycolipids are drawn with 
hexagonal blue head groups to represent 
sugars; these are found exclusively in the 
noncytosolic monolayer of the membrane. 
Within the bilayer, cholesterol (green) 

is distributed almost equally in both 
monolayers. 


Figure 11-18 Membranes retain their orientation during transfer 
between cell compartments. Membranes are transported by a 
process of vesicle budding and fusing. Here, a vesicle is shown budding 
from the Golgi apparatus and fusing with the plasma membrane. Note 
that the orientations of both the membrane lipids and proteins are 
preserved during the process: the original cytosolic surface of the lipid 
bilayer (pink) remains facing the cytosol, and the noncytosolic surface 
(red) continues to face away from the cytosol, toward the lumen of 
the Golgi and the transport vesicle—or toward the extracellular fluid. 
Similarly, the glycoprotein shown here (blue and green) remains in the 
same orientation, with its attached sugar facing the noncytosolic side. 


The action of these flippases—and of similar transporters in the plasma 
membrane—initiates and maintains the asymmetric arrangement of 
phospholipids that is characteristic of the membranes of animal cells. 
This asymmetry is preserved as membranes bud from one organelle and 
fuse with another—or with the plasma membrane. This means that all 
cell membranes have distinct “inside” and “outside” faces: the cytosolic 
monolayer always faces the cytosol, while the noncytosolic monolayer 
is exposed to either the cell exterior—in the case of the plasma mem- 
brane—or the interior space (lumen) of an organelle. This conservation of 
orientation applies not only to the phospholipids that make up the mem- 
brane, but also to any proteins that might be inserted in the membrane 
(Figure 11-18). This positioning is very important, as a protein's orienta- 
tion within the lipid bilayer is crucial for its function (see Figure 11-20). 


Among lipids, those that show the most dramatically lopsided distribu- 
tion in cell membranes are the glycolipids, which are located mainly in 
the plasma membrane, and only in the noncytosolic half of the bilayer 
(Figure 11-19). The sugar groups of these membrane lipids face the cell 
exterior, where they form part of a continuous coat of carbohydrate that 
surrounds and protects animal cells. Glycolipid molecules acquire their 
sugar groups in the Golgi apparatus, where the enzymes that engineer 
this chemical modification are confined. These enzymes are oriented 
such that sugars are added only to lipid molecules in the noncytosolic 
half of the bilayer. Once a glycolipid molecule has been created in this 
way, it remains trapped in this monolayer, as there are no flippases that 
transfer glycolipids to the cytosolic side. Thus, when a glycolipid mol- 
ecule is finally delivered to the plasma membrane, it displays its sugars to 
the exterior of the cell. 


Other lipid molecules show different types of asymmetric distributions, 
which relate to their specific functions. For example, the inositol phos- 
pholipids—a minor component of the plasma membrane—have a special 
role in relaying signals from the cell surface into the cell interior (dis- 
cussed in Chapter 16); thus they are concentrated in the cytosolic half of 
the lipid bilayer. 


EXTRACELLULAR SPACE 


ARAARA RIS 
SANTEE 


CYTOSOL 


TRANSPORTERS AND ANCHORS RECEPTORS ENZYMES 
CHANNELS 
<4 
* 4 EXTRACELLULAR 
n Ç SPACE 
| CYTOSOL 
-° if N oo 


MEMBRANE PROTEINS 


Although the lipid bilayer provides the basic structure of all cell mem- 
branes and serves as a permeability barrier to the hydrophilic molecules 
on either side of it, most membrane functions are carried out by mem- 
brane proteins. In animals, proteins constitute about 50% of the mass 
of most plasma membranes, the remainder being lipid plus the relatively 
small amounts of carbohydrate found on some of the lipids (glycolipids) 
and many of the proteins (glycoproteins). Because lipid molecules are 
much smaller than proteins, however, a cell membrane typically contains 
about 50 times the number of lipid molecules compared to protein mol- 
ecules (see Figure 11-4B). 


Membrane proteins serve many functions. Some transport particular 
nutrients, metabolites, and ions across the lipid bilayer. Others anchor 
the membrane to macromolecules on either side. Still others function 
as receptors that detect chemical signals in the cell’s environment and 
relay them into the cell interior, or work as enzymes to catalyze specific 
reactions at the membrane (Figure 11-20 and Table 11-1). Each type of 
cell membrane contains a different set of proteins, reflecting the special- 
ized functions of the particular membrane. In this section, we discuss the 
structure of membrane proteins and how they associate with the lipid 
bilayer. 


TABLE 11-1 SOME EXAMPLES OF PLASMA MEMBRANE PROTEINS 
AND THEIR FUNCTIONS 


Protein Example | Specific Function 


Na* pump 


Functional Class 
Transporters actively pumps Na* out of cells and K* 
in (discussed in Chapter 12) 

lon channels K* leak channel allows K* ions to leave cells, thereby 
influencing cell excitability (discussed in 
Chapter 12) 


Anchors integrins link intracellular actin filaments to 
extracellular matrix proteins (discussed 
in Chapter 20) 
Receptors platelet-derived binds extracellular PDGF and, as a 
growth factor consequence, generates intracellular 
(PDGF) receptor signals that direct the cell to grow and 
divide (discussed in Chapters 16 and 18) 
Enzymes adenylyl cyclase catalyzes the production of the small 


intracellular signaling molecule cyclic 
AMP in response to extracellular signals 
(discussed in Chapter 16) 


Y 


Membrane Proteins 


Figure 11-20 Plasma membrane 
proteins have a variety of functions. 
They transport molecules and ions, act 
as anchors, detect signals, or catalyze 
reactions. 
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Membrane Proteins Associate with the Lipid Bilayer in 
Different Ways 


Although the lipid bilayer has a uniform structure, proteins can interact 
with a cell membrane in a number of different ways. 


e Many membrane proteins extend through the bilayer, with part of 
their mass on either side (Figure 11-21A). Like their lipid neighbors, 
these transmembrane proteins are amphipathic, having both 
hydrophobic and hydrophilic regions. Their hydrophobic regions lie 
in the interior of the bilayer, nestled against the hydrophobic tails 
of the lipid molecules. Their hydrophilic regions are exposed to the 
aqueous environment on either side of the membrane. 

e Other membrane proteins are located almost entirely in the cytosol 
and are associated with the cytosolic half of the lipid bilayer by an 
amphipathic a helix exposed on the surface of the protein (Figure 
41-216). 

e Some proteins lie entirely outside the bilayer, on one side or the 
other, attached to the membrane by one or more covalently attached 
lipid groups (Figure 11-21C). 

e Yet other proteins are bound indirectly to one face of the membrane 
or the other, held in place only by their interactions with other 
membrane proteins (Figure 11-21D). 


Proteins that are directly attached to the lipid bilayer—whether they are 
transmembrane, associated with the lipid monolayer, or lipid-linked—can 
be removed only by disrupting the bilayer with detergents, as discussed 
shortly. Such proteins are known as integral membrane proteins. The 
remaining membrane proteins are classified as peripheral membrane pro- 
teins; they can be released from the membrane by more gentle extraction 
procedures that interfere with protein-protein interactions but leave the 
lipid bilayer intact. 
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Figure 11-21 Membrane proteins can associate with the lipid bilayer in different ways. (A) Transmembrane 
proteins can extend across the bilayer as a single a helix, as multiple a helices, or as a rolled-up B sheet (called a 

B barrel). (B) Some membrane proteins are anchored to the cytosolic half of the lipid bilayer by an amphipathic 

a helix. (C) Others are linked to either side of the bilayer solely by a covalently attached lipid molecule (red zigzag 
lines). (D) Many proteins are attached to the membrane only by relatively weak, noncovalent interactions with other 
membrane proteins. (A-C) are examples of integral membrane proteins; the proteins shown in (D) are considered 
peripheral membrane proteins. 


A Polypeptide Chain Usually Crosses the Lipid Bilayer 
as an a Helix 


All membrane proteins have a unique orientation in the lipid bilayer, 
which is essential for their function. For a transmembrane receptor pro- 
tein, for example, the part of the protein that receives a signal from the 
environment must be on the outside of the cell, whereas the part that 
passes along the signal must be in the cytosol (see Figure 11-20). This 
orientation is a consequence of the way in which membrane proteins 
are synthesized (discussed in Chapter 15). The portions of a transmem- 
brane protein located on either side of the lipid bilayer are connected 
by specialized membrane-spanning segments of the polypeptide chain 
(see Figure 11-21A). These segments, which run through the hydropho- 
bic environment of the interior of the lipid bilayer, are composed largely 
of amino acids with hydrophobic side chains. Because these side chains 
cannot form favorable interactions with water molecules, they prefer to 
interact with the hydrophobic tails of the lipid molecules, where no water 
is present. 


In contrast to the hydrophobic side chains, however, the peptide bonds 
that join the successive amino acids in a protein are normally polar, mak- 
ing the polypeptide backbone itself hydrophilic (Figure 11-22). Because 
water is absent from the interior of the bilayer, atoms that are part of the 
polypeptide backbone are thus driven to form hydrogen bonds with one 
another. Hydrogen-bonding is maximized if the polypeptide chain forms 
a regular a helix, and so the great majority of the membrane-spanning 
segments of polypeptide chains traverse the bilayer as a helices (see 
Figure 4-12). In these membrane-spanning a helices, the hydrophobic 
side chains are exposed on the outside of the helix, where they contact 
the hydrophobic lipid tails, while the atoms of the hydrophilic polypep- 
tide backbone form hydrogen bonds with one another within the helix 
(Figure 11-23). 


For many transmembrane proteins, the polypeptide chain crosses the 
membrane only once (see Figure 11-21A, left). Many of these single- 
pass transmembrane proteins are receptors for extracellular signals. 
Other transmembrane proteins function as channels, forming aqueous 
pores across the lipid bilayer to allow small, water-soluble molecules to 
cross the membrane. Such channels cannot be formed by proteins with 
a single transmembrane a helix. Instead, they usually consist of a series 
of a helices that cross the bilayer a number of times (see Figure 11-21A, 
center). For many of these multipass transmembrane proteins, one or 
more of the membrane-spanning regions are amphipathic—formed from 
a helices that contain both hydrophobic and hydrophilic amino acid side 
chains. These amino acids tend to be arranged so that the hydrophobic 
side chains fall on one side of the helix, while the hydrophilic side chains 
are concentrated on the other side. In the hydrophobic environment of 
the lipid bilayer, a helices of this type pack side by side in a ring, with 
the hydrophobic side chains exposed to the hydrophobic lipid tails and 
the hydrophilic side chains forming the lining of a hydrophilic pore 


Figure 11-23 A transmembrane polypeptide chain usually crosses 
the lipid bilayer as an a helix. In this segment of a transmembrane 
protein, the hydrophobic side chains (light green) of the amino acids 
forming the a helix contact the hydrophobic hydrocarbon tails of the 
phospholipid molecules, while the hydrophilic parts of the polypeptide 
backbone form hydrogen bonds with one another (dashed red lines) 
along the interior of the helix. An a helix containing about 20 amino 
acids is required to completely traverse a cell membrane. 


Membrane Proteins 


peptide bonds 


Figure 11-22 The backbone of a 
polypeptide chain is hydrophilic. The 
atoms on either side of a peptide bond (red 


line) are polar and carry partial positive or 
negative charges (5* or 6-). These charges 
allow these atoms to hydrogen-bond with 


one another when the polypeptide folds 


into an a helix that spans the lipid bilayer 


(see Figure 11-23). 
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QUESTION 11-4 


Explain why the polypeptide chain 
of most transmembrane proteins 
crosses the lipid bilayer as an a helix 
or af barrel. 


Figure 11-25 Porin proteins form water- 
filled channels in the outer membrane 

of a bacterium. The protein illustrated is 
from E. coli, and it consists of a 16-stranded 
B sheet curved around on itself to form a 
transmembrane water-filled channel. The 
three-dimensional structure was determined 
by x-ray crystallography. Although not 
shown in the drawing, three porin proteins 
associate to form a trimer with three 
separate channels. 


Figure 11-24 A transmembrane hydrophilic pore can be formed 
by multiple amphipathic a helices. In this example, five amphipathic 
transmembrane a helices form a water-filled channel across the lipid 
bilayer. The hydrophobic amino acid side chains on one side of each 
helix (green) come in contact with the hydrophobic lipid tails of the 
lipid bilayer, while the hydrophilic side chains on the opposite side of 
the helices (red) form a water-filled pore. 


through the membrane (Figure 11-24). How such channels function 
in the selective transport of small, water-soluble molecules, especially 
inorganic ions, is discussed in Chapter 12. 


Although the a helix is by far the most common form in which a poly- 
peptide chain crosses a lipid bilayer, the polypeptide chain of some 
transmembrane proteins crosses the lipid bilayer as a B sheet that is rolled 
into a cylinder, forming a keglike structure called a p barrel (see Figure 
11-21A, right). As expected, the amino acid side chains that face the 
inside of the barrel, and therefore line the aqueous channel, are mostly 
hydrophilic, while those on the outside of the barrel, which contact the 
hydrophobic core of the lipid bilayer, are exclusively hydrophobic. A strik- 
ing example of a B-barrel structure is found in the porin proteins, which 
form large, water-filled pores in mitochondrial and bacterial outer mem- 
branes (Figure 11-25). Mitochondria and some bacteria are surrounded 
by a double membrane, and porins allow the passage of small nutrients, 
metabolites, and inorganic ions across their outer membranes, while 
preventing unwanted larger molecules from crossing. 


Membrane Proteins Can Be Solubilized in Detergents 


To understand a protein fully, one needs to know its structure in detail. 
For membrane proteins, this presents special problems. Most biochemi- 
cal procedures are designed for studying molecules in aqueous solution. 
Membrane proteins, however, are built to operate in an environment that 
is partly aqueous and partly fatty, and taking them out of this environ- 
ment to study in isolation—while preserving their essential structure—is 
no easy task. 


Before an individual protein can be examined in detail, it must be sepa- 
rated from all the other cell proteins. For most membrane proteins, the 
first step in this purification process involves solubilizing the membrane 
with agents that destroy the lipid bilayer by disrupting hydrophobic 
associations. The most widely used disruptive agents are detergents 
(Movie 11.5). These small, amphipathic, lipidlike molecules differ from 
membrane phospholipids in that they have only a single hydrophobic 
tail (Figure 11-26). Because they have one tail, detergent molecules are 
shaped like cones; in water, these conical molecules tend to aggregate 
into small clusters called micelles, rather than forming a bilayer as do 
the phospholipids, which—with their two tails—are more cylindrical in 
shape. 


When mixed in great excess with membranes, the hydrophobic ends 
of detergent molecules interact with the membrane-spanning hydro- 
phobic regions of the transmembrane proteins, as well as with the 
hydrophobic tails of the phospholipid molecules, thereby disrupting 
the lipid bilayer and separating the proteins from most of the phospho- 
lipids. Because the other end of the detergent molecule is hydrophilic, 
these interactions draw the membrane proteins into the aqueous solu- 
tion as protein-detergent complexes; at the same time, the detergent 


Figure 11-26 SDS and Triton X-100 are two commonly used 
detergents. Sodium dodecyl sulfate (SDS) is a strong ionic detergent— 
that is, it has an ionized (charged) group at its hydrophilic end (blue). 
Triton X-100 is a mild nonionic detergent—that is, it has a nonionized 
but polar structure at its hydrophilic end (blue). The hydrophobic 
portion of each detergent is shown in red. The bracketed portion of 
Triton X-100 is repeated about eight times. Strong ionic detergents like 
SDS not only displace lipid molecules from proteins but also unfold the 
proteins (see Panel 4-5, p. 167). 


also solubilizes the phospholipids (Figure 11-27). The protein-detergent 
complexes can then be separated from one another and from the lipid- 
detergent complexes for further analysis. 


We Know the Complete Structure of Relatively Few 
Membrane Proteins 


For many years, much of what we knew about the structure of mem- 
brane proteins was learned by indirect means. The standard method for 
determining a protein’s three-dimensional structure directly has been 
x-ray crystallography, but this approach requires ordered crystalline 
arrays of the molecule. Because membrane proteins have to be puri- 
fied in detergent micelles that are often heterogeneous in size, they are 
harder to crystallize than the soluble proteins that inhabit the cell cyto- 
sol or extracellular fluids. Nevertheless, with recent advances in x-ray 
crystallography, along with powerful new approaches such as cryoelec- 
tron microscopy, the structures of an increasing number of membrane 
proteins have now been determined to high resolution (see Panel 4-6, 
pp. 168-169). 


One example is bacteriorhodopsin, the structure of which first revealed 
exactly how a helices cross the lipid bilayer. Bacteriorhodopsin is a small 
protein found in large amounts in the plasma membrane of Halobacterium 
halobium, an archaean that lives in salt marshes. Bacteriorhodopsin acts 
as a membrane transport protein that pumps H* (protons) out of the cell. 
Each bacteriorhodopsin molecule contains a single chromophore, a light- 
absorbing, nonprotein molecule called retinal, that gives the protein—and 
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QUESTION 11-5 


For the two detergents shown 

in Figure 11-26, explain why the 
blue portions of the molecules are 
hydrophilic and the red portions 
hydrophobic. Draw a short stretch 
of a polypeptide chain made up of 
three amino acids with hydrophobic 
side chains (see Panel 2-6, pp. 76- 
77) and apply a similar color scheme. 
Indicate which portions of your 
polypeptide would form hydrogen 
bonds with water. 


Figure 11-27 Membrane proteins can 

be solubilized by a mild detergent such 
as Triton X-100. The detergent molecules 
(gold) are shown as both monomers and 
micelles, the form in which these molecules 
tend to aggregate in water. The detergent 
disrupts the lipid bilayer and interacts with 
the membrane-spanning hydrophobic 
portion of the protein (dark green). These 
actions bring the proteins into solution as 
protein—detergent complexes. As illustrated, 
the phospholipids in the membrane are also 
solubilized by the detergents, forming lipid- 
detergent micelles. 
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Figure 11-28 Bacteriorhodopsin acts as 
a proton pump. The polypeptide chain of 
this small protein (about 250 amino acids in 
length) crosses the lipid bilayer as seven a 
helices. The location of the retinal (purple) EXTRACELLULAR 
and the probable pathway taken by protons SPACE 
during the light-activated pumping cycle ; 

(red arrows) are highlighted. Strategically 
placed polar amino acid side chains— retinal 
shown in red, yellow, and blue—qguide the 
movement of the proton (H*) across the 
bilayer, allowing it to avoid contact with 
the lipid environment. The retinal is then lipid 
regenerated by taking up a H* from 
the cytosol, returning the protein to its 
original conformation—a cycle shown 
in Movie 11.6. Retinal is also used to 
detect light in our own eyes, where it is 
attached to a protein with a structure L 
very similar to that of bacteriorhodopsin. CYTOSOL 
(Adapted from H. Luecke et al., Science 

286:5438 255-260, 1999.) 


transmembrane 
helices 


the entire organism—a deep purple color. When retinal, which is cova- 
lently attached to one of bacteriorhodopsin’s transmembrane a helices, 
absorbs a photon of light, it changes shape. This shape change causes the 
surrounding helices to undergo a series of small conformational changes, 
which pump one proton from the retinal to the outside of the organism 
(Figure 11-28). 


In the presence of sunlight, thousands of bacteriorhodopsin mol- 
ecules pump H+ out of the cell, generating a concentration gradient of 
H* across the plasma membrane. The cell uses this proton gradient to 
store energy and convert it into ATP, as we discuss in detail in Chapter 
14. Bacteriorhodopsin is a pump, a class of transmembrane protein that 
actively moves small organic molecules and inorganic ions into and out 
of cells. We will discuss the action of other important transmembrane 
pumps in Chapter 12. 


The Plasma Membrane Is Reinforced by the Underlying 
Cell Cortex 


A cell membrane by itself is extremely thin and fragile. It would require 
nearly 10,000 cell membranes laid on top of one another to achieve the 
thickness of this paper. Most cell membranes are therefore strengthened 
and supported by a framework of proteins, attached to the membrane via 
transmembrane proteins. For plants, yeasts, and bacteria, the cell’s shape 
and mechanical properties are conferred by a rigid cell wall—a fibrous 
layer of proteins, sugars, and other macromolecules that encases the 
plasma membrane. By contrast, the plasma membrane of animal cells is 
stabilized by a meshwork of filamentous proteins, called the cell cortex, 
that is attached to the underside of the membrane. 


The cortex of the human red blood cell has a relatively simple and regular 
structure and has been especially well studied. Red blood cells are small 
and have a distinctive flattened shape (Figure 11—29A). The main com- 
ponent of their cortex is the dimeric protein spectrin, a long, thin, flexible 
rod about 100 nm in length. Spectrin forms a lattice that provides sup- 
port for the plasma membrane and maintains the cell’s biconcave shape. 
The spectrin network is connected to the membrane through intracellular 
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Figure 11-29 A cortex made largely of spectrin gives human red blood cells their characteristic shape. 

(A) Scanning electron micrograph showing human red blood cells, which have a flattened, biconcave shape. These 
cells lack a nucleus and other intracellular organelles. (B) In the cortex of a red blood cell, spectrin dimers (red) 
are linked end-to-end to form longer tetramers. The spectrin tetramers, together with a smaller number of actin 
molecules, are linked together into a mesh. This network is attached to the plasma membrane by the binding of 
at least two types of attachment proteins (shown here in yellow and blue) to two kinds of transmembrane proteins 
(shown here in green and brown). (A, courtesy of Bernadette Chailley.) 


attachment proteins that link spectrin to specific transmembrane pro- 

teins (Figure 11—29B and Movie 11.7). The importance of this meshwork QUESTION 11-6 

is seen in mice and humans that, due to genetic alterations, produce a 

form of spectrin with an abnormal structure. These individuals are ane- Look carefully at the transmembrane 


mic: they have fewer red blood cells than normal. The red cells they do proteins shown in Figure 11—29B. 


have are spherical instead of flattened and are abnormally fragile. What can you say about their 
mobility in the membrane? 


Proteins similar to spectrin, and to its associated attachment proteins, 
are present in the cortex of most animal cells. But the cortex in these cells 
is especially rich in actin and the motor protein myosin, and it is much 
more complex than that of red blood cells. Whereas red blood cells need 
their cortex mainly to provide mechanical strength as they are pumped 
through blood vessels, other cells also use their cortex to selectively take 
up materials from their environment, to change their shape, and to move, 
as we discuss in Chapter 17. In addition, cells also use their cortex to 
restrain the diffusion of proteins within the plasma membrane, as we see 
next. 


A Cell Can Restrict the Movement of Its Membrane 
Proteins 


Because a membrane is a two-dimensional fluid, many of its proteins, 
like its lipids, can move freely within the plane of the bilayer. This lateral 
diffusion was initially demonstrated by experimentally fusing a mouse 
cell with a human cell to form a double-sized hybrid cell and then moni- 
toring the distribution of certain mouse and human plasma membrane 
proteins. At first, the mouse and human proteins are confined to their 
own halves of the newly formed hybrid cell, but within half an hour or 
so the two sets of proteins become evenly mixed over the entire cell sur- 
face (Figure 11-30). We describe some other techniques for studying the 
movement of membrane proteins in How We Know, pp. 384-385. 


The picture of a cell membrane as a sea of lipid in which all proteins float 
freely is too simple, however. Cells have ways of confining particular 
proteins to localized areas within the bilayer, thereby creating function- 
ally specialized regions, or membrane domains, on the surface of the 
cell or organelle. 
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Figure 11-30 Formation of mouse-human 
hybrid cells shows that some plasma 
membrane proteins can move laterally 

in the lipid bilayer. When the mouse and 
human cells are first fused, their proteins are 
confined to their own halves of the newly 
formed hybrid-cell plasma membrane. 
Within a short time, however, the membrane 
proteins—and lipids—completely intermix. 
To monitor the movement of a selected 
sampling of the proteins, the cells are 
labeled with antibodies that bind to either 
human or mouse proteins; the antibodies 
are coupled to two different fluorescent 
tags—for example, rhodamine (red) and 
fluorescein (shown here in blue)—so they 
can be distinguished in a fluorescence 
microscope (see Panel 4-2, pp. 140-141). 
(Based on observations of L.D. Frye and 

M. Edidin, J. Cell Sci. 7:319-335, 1970.) 
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As illustrated in Figure 11-31, plasma membrane proteins can be teth- 
ered to structures outside the cell—for example, to molecules in the 
extracellular matrix or on an adjacent cell (discussed in Chapter 20)— 
or to relatively immobile structures inside the cell, especially to the cell 
cortex (see Figure 11-29B). Additionally, cells can create barriers that 
restrict particular membrane components to one membrane domain. In 
epithelial cells that line the gut, for example, it is important that transport 
proteins involved in the uptake of nutrients from the gut be confined to 
the apical surface of the cells (which faces the gut contents) and that other 
transport proteins—including those involved in the export of solutes out 
of the epithelial cell into the tissues and bloodstream—be confined to the 
basal and lateral surfaces (see Figure 12-17). This asymmetric distribution 
of membrane proteins is maintained by a barrier formed along the line 
where the cell is sealed to adjacent epithelial cells by a so-called tight 
junction (Figure 11-32). At this site, specialized junctional proteins form 
a continuous belt around the cell where the cell contacts its neighbors, 
creating a seal between adjacent plasma membranes (see Figure 20-22). 
Membrane proteins are unable to diffuse past the junction. 


The Cell Surface Is Coated with Carbohydrate 


We saw earlier that some of the lipids in the outer layer of the plasma 
membrane have sugars covalently attached to them. The same is true 
for most of the proteins in the plasma membrane. The great majority 
of these proteins have short chains of sugars, called oligosaccharides, 
linked to them; they are called glycoproteins. Other membrane proteins, 
the proteoglycans, contain one or more long polysaccharide chains. All of 
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Figure 11-31 The lateral mobility of plasma membrane proteins can be restricted 

in several ways. Proteins can be tethered (A) to the cell cortex inside the cell, (B) to 
extracellular matrix molecules outside the cell, or (C) to proteins on the surface of another 
cell. (D) Diffusion barriers (shown as black bars) can restrict proteins to a particular 
membrane domain. 
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the carbohydrate on the glycoproteins, proteoglycans, and glycolipids is 
located on the outside of the plasma membrane, where it forms a sugar 
coating called the carbohydrate layer or glycocalyx (Figure 11-33). 


This layer of carbohydrate helps protect the cell surface from mechanical 
damage. And because the oligosaccharides and polysaccharides attract 
water molecules, they also give the cell a slimy surface, which helps 
motile cells such as white blood cells squeeze through narrow spaces 
and prevents blood cells from sticking to one another or to the walls of 
blood vessels. 


Cell-surface carbohydrates do more than just protect and lubricate the 
cell, however. They have an important role in cell-cell recognition and 
adhesion. Transmembrane proteins called lectins are specialized to bind 
to particular oligosaccharide side chains. The oligosaccharide side chains 
of glycoproteins and glycolipids, although short (typically fewer than 15 
sugar units), are enormously diverse. Unlike proteins, in which the amino 
acids are all joined together in a linear chain by identical peptide bonds, 
sugars can be joined together in many different arrangements, often 
forming elaborate branched structures (see Panel 2-4, pp. 72-73). Using 
a variety of covalent linkages, even three different sugars can form hun- 
dreds of different trisaccharides. 


The carbohydrate layer on the surface of cells in a multicellular organism 
serves as a kind of distinctive clothing, like a police officer's uniform. It is 
characteristic of each cell type and is recognized by other cell types that 
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Figure 11-32 Membrane proteins are 
restricted to particular domains of the 
plasma membrane of epithelial cells in the 
gut. Protein A (green) and protein B (red) 
can diffuse laterally in their own membrane 
domains but are prevented from entering the 
other domain by a specialized cell junction 
called a tight junction. The basal lamina 
(yellow) is a mat of extracellular matrix that 
supports all epithelial sheets (discussed in 
Chapter 20). 


Figure 11-33 Eukaryotic cells are coated 
with sugars. This carbohydrate-rich layer 
is made of the oligosaccharide side chains 
attached to membrane glycolipids and 
glycoproteins, and of the polysaccharide 
chains on membrane proteoglycans. As 
shown, glycoproteins that have been 
secreted by the cell and then adsorbed 
back onto its surface can also contribute. 
Note that all the carbohydrate is on the 
external (noncytosolic) surface of the 
plasma membrane. 


= HOW WE KNOW 
MEASURING MEMBRANE FLOW 


An essential feature of the lipid bilayer is its fluidity, 
which is crucial for cell membrane integrity and func- 
tion. This property allows many membrane-embedded 
proteins to move laterally in the plane of the bilayer, 
so that they can engage in the various protein-protein 
interactions on which cells depend. The fluid nature of 
cell membranes is so central to their proper function 
that it may seem surprising that this property was not 
recognized until the early 1970s. 


Given its importance for membrane structure and func- 
tion, how do we measure and study the fluidity of cell 
membranes? The most common methods are visual: 
simply label some of the molecules native to the mem- 
brane and then watch where they go. Such an approach 
first demonstrated the lateral movement of membrane 
proteins that had been tagged with labeled antibodies 
(see Figure 11-30). This experiment seemed to suggest 
that membrane proteins diffuse freely, without restric- 
tion, in an open sea of lipids. We now know that this 
image is not entirely accurate. To probe membrane flu- 
idity more thoroughly, researchers had to invent more 
precise methods for tracking the movement of proteins 
within a membrane such as the plasma membrane of a 
living cell. 


The FRAP attack 


One such technique, called fluorescence recovery after 
photobleaching (FRAP), involves uniformly labeling 


FRAP fluorescently labeled 


membrane proteins 


lipid 


bilayer BLEACH PATCH 


WITH LASER BEAM 


bleached area 


@@@@@°:°2°70°7°70°7@e@ 
TERRE RHR RERReeee 


LABELED PROTEINS 
DIFFUSE RANDOMLY 

THROUGHOUT 

MEMBRANE 


ere 
@©G@0G000086060e 
FERRER REE 


FLUORESCENCE 
RETURNED TO 
BLEACHED PATCH 


fluorescence in 


the components of the cell membrane—its lipids or, 
more often, its proteins—with some sort of fluorescent 
marker. Labeling membrane proteins can be accom- 
plished by incubating cells with a fluorescent antibody 
or by covalently attaching a fluorescent protein such as 
green fluorescent protein (GFP) to a membrane protein 
using the DNA techniques discussed in Chapter 10. 


Once a protein has been labeled, a small patch of mem- 
brane is irradiated with an intense pulse of light from a 
sharply focused laser beam. This treatment irreversibly 
“Dleaches” the fluorescence from the labeled proteins 
in that small patch of membrane, typically an area 
about 1 um square. The fluorescence of this irradiated 
membrane is monitored in a fluorescence microscope, 
and the amount of time it takes for the neighboring, 
unbleached fluorescent proteins to migrate into the 
bleached region of the membrane is measured (Figure 
11-34). The rate of this “fluorescence recovery” is a 
direct measure of the rate at which the protein mol- 
ecules can diffuse within the membrane (Movie 11.8). 
Such experiments have revealed that, generally speak- 
ing, cell membranes are about as viscous as olive oil. 


One-by-one 


One drawback to the FRAP approach is that the tech- 
nique monitors the movement of fairly large populations 
of proteins—hundreds or thousands—across a rela- 
tively large area of the membrane. With this technique 
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Figure 11-34 Photobleaching techniques such as FRAP 
can be used to measure the rate of lateral diffusion of 

a membrane protein. A specific type of protein can be 
labeled with a fluorescent antibody (as shown here) or tagged 
with a fluorescent protein, such as GFP. A small area of the 
membrane containing these fluorescent protein molecules is 
then bleached using a laser beam. As the bleached molecules 
diffuse away, and unbleached, fluorescent molecules diffuse 
into the area, the intensity of the fluorescence is recovered 
(shown here in side and top views). The diffusion coefficient 

is then calculated from a graph of the rate of fluorescence 
recovery: the greater the diffusion coefficient of the membrane 
protein, the faster the recovery. 
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Figure 11-35 Proteins show different patterns of diffusion. 
Single-particle tracking studies reveal some of the pathways that 
single proteins follow on the surface of a living cell. Shown here 
are some trajectories representative of different kinds of proteins 
in the plasma membrane. (A) Tracks made by a protein that is 
free to diffuse randomly in the lipid bilayer. (B) Tracks made by 

a protein that is corralled within a small membrane domain by 
other proteins. (C) Tracks made by a protein that is tethered 

to the cytoskeleton and hence is essentially immobile. The 
movement of the proteins is monitored over a period of seconds. 


it is impossible to track the motion of individual mol- 
ecules, which can make analysis of the results difficult. 
If the labeled proteins fail to migrate into the bleached 
zone over the course of a FRAP study, for example, is 
it because they are immobile, essentially anchored in 
one place in the membrane? Or, alternatively, are they 
restricted to movement within a very small region— 
fenced in by cytoskeletal proteins—and thus only 
appear motionless? 


To get around this problem, researchers have devel- 
oped methods for labeling and observing the movement 
of individual molecules or small clusters of molecules. 
One such technique, dubbed single-particle tracking 
(SPT) microscopy, relies on tagging protein molecules 
with antibody-coated gold nanoparticles. The gold par- 
ticles look like tiny black dots when seen with a light 
microscope, and their movement, and thus the move- 
ment of individually tagged protein molecules, can be 
followed using video microscopy. 


From the studies carried out to date, it appears that 
membrane proteins can display a variety of patterns of 
movement, from random diffusion to complete immo- 
bility (Figure 11-35). Some proteins rapidly switch 
between these different kinds of motion. 


Freed from cells 


In many cases, researchers wish to study the behavior 
of a particular type of membrane protein in a synthetic 
lipid bilayer, in the absence of other proteins that might 
restrain its movement or alter its activity. For such stud- 
ies, membrane proteins can be isolated from cells and 
the protein of interest purified and reconstituted in arti- 
ficial phospholipid vesicles (Figure 11-36). The lipids 
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allow the purified protein to maintain its proper struc- 
ture and function, so that its activity and behavior can 
be analyzed in detail. 


It is apparent from such studies that membrane pro- 
teins diffuse more freely and rapidly in artificial lipid 
bilayers than in cell membranes. The fact that most 
proteins show reduced mobility in a cell membrane 
makes sense, as these membranes are crowded with 
many types of proteins and contain a greater variety of 
lipids than an artificial lipid bilayer. Furthermore, many 
membrane proteins in a cell are tethered to proteins in 
the extracellular matrix, or anchored to the cell cortex 
just under the plasma membrane, or both (as illustrated 
in Figure 11-31). 


Taken together, such studies have revolutionized our 
understanding of membrane proteins and of the archi- 
tecture and organization of cell membranes. 
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Figure 11-36 Mild detergents can be used to solubilize 
and reconstitute functional membrane proteins. Proteins 
incorporated into artificial lipid bilayers generally diffuse more 
freely and rapidly than they do in cell membranes. 
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endothelial cell 


Figure 11-37 The recognition of cell- 
surface carbohydrates on neutrophils 
allows these immune cells to begin 

to migrate out of the blood and 

into infected tissues. Specialized 
transmembrane proteins (called lectins) are 
made by the endothelial cells lining the 
blood vessel in response to chemical signals 
emanating from a site of infection. These 
proteins recognize particular sugar groups 
carried by glycolipids and glycoproteins on 
the surface of neutrophils (a type of white 
blood cell, also called a leukocyte) 
circulating in the blood. The neutrophils 
consequently stick to the endothelial 

cells that line the blood vessel wall. This 
association is not very strong, but it leads 
to another, much stronger protein-protein 
interaction (not shown) that helps the 
neutrophil slip between the endothelial 
cells, so it can migrate out of the 
bloodstream and into the tissue at the site 
of infection (Movie 11.9). 
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interact with it. Specific oligosaccharides in the carbohydrate layer are 
involved, for example, in the recognition of an egg by sperm (discussed 
in Chapter 19). Similarly, in the early stages of a bacterial infection, car- 
bohydrates on the surface of white blood cells called neutrophils are 
recognized by a lectin on the cells lining the blood vessels at the site of 
infection; this recognition causes the neutrophils to adhere to the blood 
vessel wall and then migrate from the bloodstream into the infected tis- 
sue, where they help destroy the invading bacteria (Figure 11-37). 


ESSENTIAL CONCEPTS 


Membranes enable cells to create barriers that confine particular 
molecules to specific compartments. They consist of a continuous 
double layer—a bilayer—of lipid molecules in which proteins are 
embedded. 


The lipid bilayer provides the basic structure and barrier function of 
all cell membranes. 


Membrane lipid molecules are amphipathic, having both hydrophobic 
and hydrophilic regions. This property promotes their spontaneous 
assembly into bilayers when placed in water, forming closed com- 
partments that reseal if torn. 


There are three major classes of membrane lipid molecules: phos- 
pholipids, sterols, and glycolipids. 


The lipid bilayer is fluid, and individual lipid molecules are able to 
diffuse within their own monolayer; they do not, however, spontane- 
ously flip from one monolayer to the other. 


The two monolayers of a cell membrane have different lipid com- 
positions, reflecting the different functions of the two faces of the 
membrane. 


Cells that live at different temperatures maintain their membrane flu- 
idity by modifying the lipid composition of their membranes. 


Membrane proteins are responsible for most of the functions of cell 
membranes, including the transport of small, water-soluble mol- 
ecules across the lipid bilayer. 


Transmembrane proteins extend across the lipid bilayer, usually as 
one or more a helices but sometimes as a f sheet rolled into the form 
of a barrel. 


Other membrane proteins do not extend across the lipid bilayer but 
are attached to one or the other side of the membrane, either by 
noncovalent association with other membrane proteins, by covalent 
attachment of lipids, or by association of an exposed amphipathic 
a helix with a single lipid monolayer. 


Most cell membranes are supported by an attached framework of 
proteins. An especially important example is the meshwork of fibrous 
proteins that forms the cell cortex underneath the plasma membrane. 


Although many membrane proteins can diffuse rapidly in the plane 
of the membrane, cells have ways of confining proteins to specific 
membrane domains. They can also immobilize particular mem- 
brane proteins by attaching them to intracellular or extracellular 
macromolecules. 


Many of the proteins and some of the lipids exposed on the surface 
of cells have attached sugar chains, which form a carbohydrate layer 
that helps protect and lubricate the cell surface, while also being 
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involved in specific cell-cell recognition. 


KEY TERMS 


amphipathic membrane domain 
bacteriorhodopsin membrane protein 
cell cortex phosphatidylcholine 
cholesterol phospholipid 
detergent plasma membrane 
fat droplet saturated 
glycocalyx unsaturated 

lipid bilayer 


QUESTIONS 


QUESTION 11-7 


Describe the different methods that cells use to restrict 
proteins to specific regions of the plasma membrane. Can a 
membrane with many of its proteins restricted still be fluid? 


QUESTION 11-8 


Which of the following statements are correct? Explain your 
answers. 


A. Lipids in a lipid bilayer spin rapidly around their long 
axis. 


B. Lipids in a lipid bilayer rapidly exchange positions with 
one another in their own monolayer. 


C. Lipids in a lipid bilayer do not flip-flop readily from one 
lipid monolayer to the other. 


D. Hydrogen bonds that form between lipid head groups 
and water molecules are continually broken and re-formed. 


E. Glycolipids move between different membrane-enclosed 
compartments during their synthesis but remain restricted 
to one side of the lipid bilayer. 


F. Margarine contains more saturated lipids than the 
vegetable oil from which it is made. 


G. Some membrane proteins are enzymes. 

H. The sugar layer that surrounds all cells makes cells more 
slippery. 

QUESTION 11-9 


What is meant by the term “two-dimensional fluid”? 


QUESTION 11-10 


The structure of a lipid bilayer is determined by the 
particular properties of its lipid molecules. What would 
happen if: 


A. phospholipids had only one hydrocarbon tail instead 
of two? 


B. the hydrocarbon tails were shorter than normal, say, 
about 10 carbon atoms long? 


C. all of the hydrocarbon tails were saturated? 
D. all of the hydrocarbon tails were unsaturated? 


E. the bilayer contained a mixture of two kinds of 
phospholipid molecules, one with two saturated 
hydrocarbon tails and the other with two unsaturated 
hydrocarbon tails? 


F. each phospholipid molecule were covalently linked 
through the end carbon atom of one of its hydrocarbon tails 
to a phospholipid tail in the opposite monolayer? 


QUESTION 1-1 


What are the differences between a phospholipid molecule 
and a detergent molecule? How would the structure of 

a phospholipid molecule need to change to make it a 
detergent? 


QUESTION 11-12 


A. Membrane lipid molecules exchange places with their 
lipid neighbors every 10-7 second. A lipid molecule diffuses 
from one end of a 2-um-long bacterial cell to the other in 
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about 0.2 seconds. Are these two numbers in agreement 
(assume that the diameter of a lipid head group is about 
0.5 nm)? If not, can you think of a reason for the difference? 


B. To get an appreciation for the great speed of molecular 
diffusion, assume that a lipid head group is about the size 
of a ping-pong ball (4 cm in diameter) and that the floor 

of your living room (6 m X 6 m) is covered wall-to-wall with 
these balls. If two neighboring balls exchanged positions 
once every 10-7 second, what would their speed be in 
kilometers per hour? How long would it take for a ball to 
move from one side of the room to the opposite side? 


QUESTION 11-13 


Why does a red blood cell plasma membrane need 
transmembrane proteins? 


QUESTION 11-14 


Consider a transmembrane protein that forms a hydrophilic 
pore across the plasma membrane of a eukaryotic cell. 
When this protein is activated by binding a specific 

ligand on its extracellular side it allows Na* to enter the 
cell. The protein is made of five similar transmembrane 
subunits, each containing a membrane-spanning a helix 
with hydrophilic amino acid side chains on one surface of 
the helix and hydrophobic amino acid side chains on the 
opposite surface. Considering the function of the protein as 
a channel for Nat ions to enter the cell, propose a possible 
arrangement of the five membrane-spanning a helices in the 
membrane. 


QUESTION 11-15 


In the membrane of a human red blood cell, the ratio of 
the mass of protein (average molecular weight 50,000) 
to phospholipid (molecular weight 800) to cholesterol 
(molecular weight 386) is about 2:1:1. How many lipid 
molecules are there for every protein molecule? 


QUESTION 11-16 


Draw a schematic diagram that shows a close-up view of 
two plasma membranes as they come together during 

cell fusion, as shown in Figure 11-30. Show membrane 
proteins in both cells that were labeled from the outside 
by the binding of differently colored fluorescent antibody 
molecules. Indicate in your drawing the fates of these color 
tags as the cells fuse. Will the fluorescent labels remain on 
the outside of the hybrid cell after cell fusion and still be 
there after the mixing of membrane proteins that occurs 
during the incubation at 37°C? How would the experimental 
outcome be different if the incubation were done at 0°C? 


QUESTION 11-17 


Compare the hydrophobic forces that hold a membrane 
protein in the lipid bilayer with those that help proteins 
fold into a unique three-dimensional structure (described in 
Chapter 4, pp. 121-122 and pp. 127-128). 


QUESTION 11-18 


Predict which one of the following organisms will have the 
highest percentage of unsaturated phospholipids in its 
membranes. Explain your answer. 


A. Antarctic fish 
B. Desert snake 
C. Human being 
D. Polar bear 

. Thermophilic bacterium that lives in hot springs at 
00°C. 
QUESTION 11-19 


Which of the three 20-amino-acid sequences listed below 
in the single-letter amino acid code is the most likely 
candidate to form a transmembrane region (a helix) of a 
transmembrane protein? Explain your answer. 


A ITTLIYFGNMSSVTOQOTILLIS 
B.LLLIFFGVMALVIVVILLIA 
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QUESTION 11-20 


Figure Q11-20 shows the structure of triacylglycerol. 
Would you expect this molecule to be incorporated into the 
lipid bilayer? If so, which part of the molecule would face 
the interior of the bilayer and which would face the water 
on either side of the bilayer? If not, what sort of structure 
would these molecules form in the aqueous environment 
inside a cell? 
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Transport Across Cell 
Membranes 


To survive and grow, cells must be able to exchange molecules with 
their environment. They must import nutrients such as sugars and amino 
acids and eliminate metabolic waste products. They must also regulate 
the concentrations of a variety of inorganic ions in their cytosol and orga- 
nelles. A few molecules, such as CO2 and Oz, can simply diffuse across 
the lipid bilayer of the plasma membrane. But the vast majority cannot. 
Instead, their movement depends on specialized membrane transport 
proteins that span the lipid bilayer, providing private passageways across 
the membrane for select substances ( f 


In this chapter, we consider how cell membranes control the traffic of 
inorganic ions and small, water-soluble molecules into and out of the 
cell and its membrane-enclosed organelles. Cells can also selectively 
transfer large macromolecules such as proteins across their membranes, 
but this transport requires more elaborate machinery and is discussed in 
Chapter 15. 


We begin by outlining some of the general principles that guide the pas- 
sage of ions and small molecules through cell membranes. We then 
examine, in turn, the two main classes of membrane proteins that medi- 
ate this transfer: transporters and channels. Transporters shift small 
organic molecules or inorganic ions from one side of the membrane 
to the other by changing shape. Channels, in contrast, form tiny hydro- 
philic pores across the membrane through which substances can pass 
by diffusion. Most channels only permit passage of ions and are there- 
fore called ion channels. Because these ions are electrically charged, their 
movements can create a powerful electric force—or voltage—across the 
membrane. In the final part of the chapter, we discuss how these voltage 
differences enable nerve cells to communicate—and, ultimately, to shape 
how we behave. 
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Figure 12-1 Cell membranes contain e e 
specialized membrane transport proteins = a ? Y @ 
that facilitate the passage of selected 
small, water-soluble molecules. (A) Protein- | & & 
free, artificial lipid bilayers such as liposomes L 


(see Figure 11-13) are impermeable to 
most water-soluble molecules. (B) Cell 


membranes, by contrast, contain membrane 


transport proteins (light green), each of 


which transfers a particular substance across 


the membrane. This selective transport can 
facilitate the passive diffusion of specific 
molecules or ions across the membrane 
(blue circles), as well as the active pumping 
of specific substances either out of (purple 
triangles) or into (green bars) the cell. 

For other molecules, the membrane is 
impermeable (red squares). The combined 
action of different membrane transport 
proteins allows a specific set of solutes 

to build up inside a membrane-enclosed 
compartment, such as the cytosol or an 
organelle. 
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Figure 12-2 The rate at which a solute 
crosses a protein-free, artificial lipid 
bilayer by simple diffusion depends on 
its size and solubility. Many of the organic 
molecules that a cell uses as nutrients (red) 
are too large and polar to pass efficiently 
through an artificial lipid bilayer that does 
not contain the appropriate membrane 
transport proteins. 
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(A) protein-free, artificial 
lipid bilayer (liposome) 
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(B) cell membrane 


PRINCIPLES OF TRANSMEMBRANE TRANSPORT 


As we saw in Chapter 11, the hydrophobic interior of the lipid bilayer 
creates a barrier to the passage of most hydrophilic molecules, including 
all ions. These molecules are as reluctant to enter a fatty environment 
as hydrophobic molecules are reluctant to interact with water. But cells 
and organelles must allow the passage of many hydrophilic, water-solu- 
ble molecules, such as inorganic ions, sugars, amino acids, nucleotides, 
and other cell metabolites. These molecules cross lipid bilayers far too 
slowly by simple diffusion, so their passage across cell membranes must 
be accelerated by specialized membrane transport proteins—a process 
called facilitated transport. In this section, we review the basic principles 
of such facilitated transmembrane transport and introduce the various 
types of membrane transport proteins that mediate this movement. We 
also discuss why the transport of inorganic ions, in particular, is of such 
fundamental importance for all cells. 


Lipid Bilayers Are Impermeable to lons and Most 
Uncharged Polar Molecules 


Given enough time, virtually any molecule will diffuse across a lipid 
bilayer. The rate at which it diffuses, however, varies enormously depend- 
ing on the size of the molecule and its solubility properties. In general, 
the smaller the molecule and the more hydrophobic, or nonpolar, it is, the 
more rapidly it will diffuse across the lipid bilayer. 


Of course, many of the molecules that are of interest to cells are polar 
and water-soluble. These solutes—substances that, in this case, are dis- 
solved in water—are unable to cross the lipid bilayer without the aid of 
membrane transport proteins. The relative ease with which a variety of 
solutes can cross a lipid bilayer that lacks membrane transport proteins 
is shown in Figure 12-2. 


1. Small, nonpolar molecules, such as molecular oxygen (Oz, molecu- 
lar mass 32 daltons) and carbon dioxide (CO2, 44 daltons), dissolve 
readily in lipid bilayers and therefore diffuse rapidly across them; 
indeed, cells depend on this permeability to gases for the cell respi- 
ration processes discussed in Chapter 14. 


2. Uncharged polar molecules (those with an uneven distribution of 
electric charge) also diffuse readily across a bilayer, but only if they 
are small enough. Water (H2O, 18 daltons) and ethanol (46 daltons), 
for example, cross at a measurable rate, whereas glycerol (92 dal- 
tons) crosses less rapidly. Larger uncharged polar molecules, such 
as glucose (180 daltons), cross hardly at all. 


3. In contrast, lipid bilayers are highly impermeable to all charged 
substances, including all inorganic ions, no matter how small. The 
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charges on these solutes, and their strong electrical attraction to 
water molecules, inhibit their entry into the inner, hydrocarbon 
phase of the bilayer. Thus protein-free lipid bilayers are a billion 
(10%) times more permeable to water, which is polar but uncharged, 
than they are to even small ions such as Na* or K+. 


The lon Concentrations Inside a Cell Are Very Different 
from Those Outside 


Because lipid bilayers are impermeable to inorganic ions, living cells are 
able to maintain internal ion concentrations that are very different from 
the concentrations of ions in the medium that surrounds them. These dif- 
ferences in ion concentration are crucial for a cell's survival and function. 
Among the most important inorganic ions for cells are Na*, K+, Ca?*, CF, 
and H* (protons). The movement of these ions across cell membranes 
plays an essential part in many biological processes, but is perhaps most 
striking in the production of ATP by all cells (discussed in Chapter 14) and 
in the communication of nerve cells (discussed later in this chapter). 


Na* is the most plentiful positively charged ion (cation) outside the cell, 
whereas K* is the most abundant inside (Table 12-1). For a cell to avoid 
being torn apart by electrical forces, the quantity of positive charge inside 
the cell must be balanced by an almost exactly equal quantity of nega- 
tive charge, and the same is true for the charge in the surrounding fluid. 
The high concentration of Na* outside the cell is electrically balanced 
chiefly by extracellular Cl, whereas the high concentration of K* inside 
is balanced by a variety of negatively charged inorganic and organic ions 
(anions), including nucleic acids, proteins, and many cell metabolites 
(see Table 12-1). 


Differences in the Concentration of Inorganic lons 
Across a Cell Membrane Create a Membrane Potential 


Although the electrical charges inside and outside the cell are generally 
kept in balance, tiny excesses of positive or negative charge, concen- 
trated in the neighborhood of the plasma membrane, do occur. Such 
electrical imbalances generate a voltage difference across the membrane 
called the membrane potential. 


When a cell is “unstimulated,” the movement of anions and cations 
across the membrane will be precisely balanced. In such steady-state 


TABLE 12-1 A COMPARISON OF ION CONCENTRATIONS INSIDE AND OUTSIDE A TYPICAL MAMMALIAN CELL 


one Intracellular Concentration (mM) Extracellular Concentration (mM) 


Cations 

Na* 5-15 145 

Kt 140 5 

Mg?* 5 1-2 

Ca^ 104 1-2 

inl 7 x 10° (10-7-2 M or pH 7.2) 4 x 10-9 (10-74 M or pH 7.4) 
Anions** 

CF 5-15 110 


*The concentrations of Mg?* and Ca?* given are for the free ions. There is a total of about 20 mM Mg?* and 1-2 mM Ca?* in cells, 
but most of these ions are bound to proteins and other organic molecules and, for Ca?*, stored within various organelles. 

**In addition to CI, a cell contains many other anions not listed in this table. In fact, most cell constituents are negatively charged 
(HCOz, PO4*, proteins, nucleic acids, metabolites carrying phosphate and carboxyl groups, and so on). 
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Figure 12-3 Inorganic ions and small, 
polar organic molecules can cross a cell 
membrane through either a transporter 
or a channel. (A) A channel forms a pore 
across the bilayer through which specific 
inorganic ions or, in some cases, polar 
organic molecules can diffuse. lon channels 
can exist in either an open or a closed 
conformation, and they transport only in the 
open conformation, as shown here. Channel 
opening and closing is usually controlled by 
an external stimulus or by conditions within 
the cell. (B) A transporter undergoes a series 
of conformational changes to transfer small 
solutes across the lipid bilayer. Transporters 
are very selective for the solutes that they 
bind, and they transfer them at a much 
slower rate than do channels. 


Transport Across Cell Membranes 


conditions, the voltage difference across the cell membrane—called the 
resting membrane potential—holds steady. But it is not zero. In animal 
cells, for example, the resting membrane potential can be anywhere 
between -20 and -200 millivolts (mV), depending on the organism and 
cell type. The value is expressed as a negative number because the inte- 
rior of the cell is more negatively charged than the exterior. 


The membrane potential allows cells to power the transport of certain 
metabolites, and it provides cells that are excitable with a means to com- 
municate with their neighbors. As we discuss shortly, it is the activity 
of different membrane transport proteins, embedded in the bilayer, that 
enables cells to establish and maintain their characteristic membrane 
potential. 


Cells Contain Two Classes of Membrane Transport 
Proteins: Transporters and Channels 


Membrane transport proteins occur in many forms and are present in all 
cell membranes. Each provides a private portal across the membrane for 
a particular small, water-soluble substance—an ion, sugar, or amino acid, 
for example. Most of these membrane transport proteins allow passage 
of only select members of a particular type: some permit transit of Na+ 
but not K+, others K* but not Na+, and so on. Each type of cell membrane 
has its own characteristic set of transport proteins, which determines 
exactly which solutes can pass into and out of that cell or organelle. 


As discussed in Chapter 11, most membrane transport proteins have poly- 
peptide chains that traverse the lipid bilayer multiple times—that is, they 
are multipass transmembrane proteins (see Figure 11-24). When these 
transmembrane segments cluster together, they establish a continuous 
protein-lined pathway that allows selected small, hydrophilic molecules 
to cross the membrane without coming into direct contact with the 
hydrophobic interior of the lipid bilayer. 


Cells contain two main classes of membrane transport proteins: trans- 
porters and channels. These proteins differ in the way they discriminate 
between solutes, transporting some but not others (Figure 12-3). Channels 
discriminate mainly on the basis of size and electric charge: when the 
channel is open, only ions of an appropriate size and charge can pass 
through. A transporter, on the other hand, transfers only those molecules 
or ions that fit into specific binding sites on the protein. Transporters bind 
their solutes with great specificity, in the same way an enzyme binds its 
substrate, and it is this requirement for specific binding that gives trans- 
porters their selectivity. 


Solutes Cross Membranes by Either Passive or Active 
Transport 
Transporters and channels allow small, hydrophilic molecules and ions 


to cross the cell membrane, but what controls whether these substances 
move into the cell (or organelle)—or out of it? In many cases, the direction 
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of transport depends only on the relative concentrations of the solute on 
either side of the membrane. Substances will spontaneously flow “down- 
hill” from a region of high concentration to a region of low concentration, 
provided a pathway exists. Such movements are called passive, because 
they need no additional driving force. If, for example, a solute is present 
at a higher concentration outside the cell than inside, and an appropriate 
channel or transporter is present in the plasma membrane, the solute will 
move into the cell by passive transport, without expenditure of energy 
by the membrane transport protein. This is because even though the sol- 
ute can move in either direction across the membrane, more solute will 
move in than out until the two concentrations equilibrate. All channels— 
and many transporters—act as conduits for such passive transport. 


To move a solute against its concentration gradient, however, a mem- 
brane transport protein must do work: it has to drive the flow of the 
substance “uphill” from a region of low concentration to a region of higher 
concentration. To do so, it couples the transport to some other process 
that provides an input of energy (as discussed in Chapter 3). The move- 
ment of a solute against its concentration gradient in this way is termed 
active transport, and it is carried out by special types of transporters 
called pumps, which harness an energy source to power the transport 
process (Figure 12-4). As discussed later, this energy can come from ATP 
hydrolysis, a transmembrane ion gradient, or sunlight. 


Both the Concentration Gradient and Membrane 
Potential Influence the Passive Transport of Charged 
Solutes 


For an uncharged molecule, the direction of passive transport is deter- 
mined solely by its concentration gradient, as we have outlined above. 
But for electrically charged substances, whether inorganic ions or small 
organic molecules, an additional force comes into play. As mentioned 
earlier, most cell membranes have a voltage across them—a difference 
in charge referred to as a membrane potential. This membrane poten- 
tial exerts a force on any substance that carries an electric charge. The 
cytosolic side of the plasma membrane is usually at a negative potential 
relative to the extracellular side, so the membrane potential tends to pull 
positively charged ions and molecules into the cell and drive negatively 
charged solutes out. 


At the same time, a charged solute—like an uncharged one—will also 
tend to move down its concentration gradient. The net force driving a 
charged solute across a cell membrane is therefore a composite of two 
forces, one due to the concentration gradient and the other due to the 
membrane potential. This net driving force, called the solute’s electro- 
chemical gradient, determines the direction in which each solute will 
flow across the membrane by passive transport. 
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Figure 12-4 Solutes cross cell membranes 
by either passive or active transport. 
Some small, nonpolar molecules such as 
COz (see Figure 12-2) can move passively 
down their concentration gradient across 
the lipid bilayer by simple diffusion, 

without the help of a membrane transport 
protein. Most solutes, however, require 

the assistance of a channel or transporter. 
Passive transport, which allows solutes to 
move down their concentration gradients, 
occurs spontaneously; active transport 
against a concentration gradient requires an 
input of energy. Only transporters can carry 
out active transport, and the transporters 
that perform this function are called pumps. 
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Figure 12-5 An electrochemical gradient 


has two components. The net driving force 


tending to move a charged solute across 
a cell membrane—its electrochemical 
gradient—is the sum of a force from the 
concentration gradient of the solute and 
a force from the membrane potential. 
The membrane potential is represented 
here by the + and -— signs on opposite 
sides of the membrane. The width of the 
green arrow represents the magnitude 
of the electrochemical gradient. (A) The 
concentration gradient and membrane 
potential work together to increase the 
driving force for movement of the solute. 
Such is the case for Nat. (B) The membrane 
potential acts against the concentration 
gradient, decreasing the electrochemical 
driving force. Such is the case for K+. 
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For some ions, the voltage and concentration gradients work in the same 
direction, creating a relatively steep electrochemical gradient (Figure 
12-5A). This is the case for Nat, which is positively charged and at a 
higher concentration outside cells than inside (see Table 12-1). Nat 
therefore tends to enter cells when given an opportunity. If, however, the 
voltage and concentration gradients have opposing effects, the resulting 
electrochemical gradient can be small (Figure 12-5B). This is the case for 
K+, which is present at a much higher concentration inside cells, where 
the resting membrane potential is negative. Because its electrochemical 
gradient across the plasma membrane of resting cells is small, there is 
little net movement of K+ across the membrane even when K* channels 
are open. 


Water Moves Across Cell Membranes Down Its 
Concentration Gradient—a Process Called Osmosis 


Cells are mostly water (generally about 70% by weight), and so the move- 
ment of water across cell membranes is crucially important for living 
things. Because water molecules are small and uncharged, they can 
diffuse directly across the lipid bilayer (see Figure 12-2). However, this 
movement is relatively slow. To facilitate the flow of water, some cells 
contain specialized channels called aquaporins in their plasma mem- 
brane (Figure 12-6 and Movie 12.1). For many cells, such as those in the 
kidney or in various secretory glands, aquaporins are essential for their 
function. 


But for water-filled cells in an aqueous environment, does water tend 
to enter the cell or leave it? As we saw in Table 12-1, cells contain a 
high concentration of solutes, including many charged molecules and 
ions. Thus the total concentration of solute particles inside the cell—also 
called its osmolarity—generally exceeds the solute concentration outside 
the cell. The resulting osmotic gradient tends to “pull” water into the cell. 
This movement of water down its concentration gradient—from an area 
of low solute concentration (high water concentration) to an area of high 
solute concentration (low water concentration)—is called osmosis. 


Figure 12-6 Water molecules diffuse rapidly through aquaporin 
channels in the plasma membrane of some cells. (A) Shaped like 
an hourglass, each aquaporin channel forms a pore across the bilayer, 
allowing the selective passage of water molecules. Shown here is an 
aquaporin tetramer, the biologically active form of the protein. (B) In 
this snapshot, taken from a real-time, molecular dynamics simulation, 
four columns of water molecules (blue) can be seen passing through 
the pores of an aquaporin tetramer (not shown). The space where the 
membrane would be located is indicated. (B, adapted from 

B. de Groot and H. Grubmiller, Science 294:2353-2357, 2001.) 
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Osmosis, if it occurs without constraint, can make a cell swell. Different 
cells cope with this osmotic challenge in different ways. Some freshwater 
protozoans, such as amoebae, eliminate excess water using contractile 
vacuoles that periodically discharge their contents to the exterior (Figure 
12-7A). Plant cells are prevented from swelling by their tough cell walls 
and so can tolerate a large osmotic difference across their plasma mem- 
brane (Figure 12-7B); indeed, plant cells make use of osmotic swelling 
pressure, or turgor pressure, to keep their cell walls tense, so that the 
stems of the plant are rigid and its leaves are extended. If turgor pressure 
is lost, plants wilt. Animal cells maintain osmotic equilibrium by using 
transmembrane pumps to expel solutes, such as the Na* ions that tend to 
leak into the cell (Figure 12-7C). 


TRANSPORTERS AND THEIR FUNCTIONS 


Transporters are responsible for the movement of most small, water- 
soluble, organic molecules and a handful of inorganic ions across cell 
membranes. Each transporter is highly selective, often transferring just 
one type of solute. To guide and propel the complex traffic of substances 
into and out of the cell, and between the cytosol and the different mem- 
brane-enclosed organelles, each cell membrane contains a characteristic 
set of different transporters appropriate to that particular membrane. 
For example, the plasma membrane contains transporters that import 
nutrients such as sugars, amino acids, and nucleotides; the lysosome 
membrane contains an H* transporter that imports H* to acidify the lyso- 
some interior and other transporters that move digestion products out 
of the lysosome into the cytosol; the inner membrane of mitochondria 
contains transporters for importing the pyruvate that mitochondria use 
as fuel for generating ATP, as well as transporters for exporting ATP once 
it is synthesized (Figure 12-8). 


In this section, we describe the general principles that govern the function 
of transporters, and we present a more detailed view of the molecular 
mechanisms that drive the movement of a few key solutes. 
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Figure 12-7 Cells use different tactics to 
avoid osmotic swelling. (A) A freshwater 
amoeba avoids swelling by periodically 
ejecting the water that moves into the cell 
and accumulates in contractile vacuoles. 
The contractile vacuole first accumulates 
solutes, which cause water to follow by 
osmosis; it then pumps most of the solutes 
back into the cytosol before emptying its 
contents at the cell surface. (B) The plant 
cell's tough cell wall prevents swelling. 

(C) The animal cell reduces its intracellular 
solute concentration by pumping out ions. 


Figure 12-8 Each cell membrane has its 
own characteristic set of transporters. 
These transporters allow each membrane to 
carry out its unique functions. Only a few of 
these transporters are shown here. 
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QUESTION 12-1 


A simple enzyme reaction can be 
described by the equation 

E + S <| ES & E + P where E is 
the enzyme, S the substrate, 

P the product, and ES the enzyme- 
substrate complex. 

A. Write a corresponding equation 
describing the workings of a 
transporter (T) that mediates the 
transport of a solute (S) down its 
concentration gradient. 

B. What does this equation tell you 
about the function of a transporter? 
C. Why would this equation be an 
inappropriate choice to represent 
the function of a channel? 
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Passive Transporters Move a Solute Along Its 
Electrochemical Gradient 


An important example of a transporter that mediates passive transport 
is the glucose transporter in the plasma membrane of many mammalian 
cell types. The protein, which consists of a polypeptide chain that crosses 
the membrane at least 12 times, can adopt several conformations—and 
it switches reversibly and randomly between them. In one conformation, 
the transporter exposes binding sites for glucose to the exterior of the 
cell; in another, it exposes the sites to the cell interior. 


Because glucose is uncharged, the electrical component of its electro- 
chemical gradient is zero. Thus the direction in which it is transported is 
determined by its concentration gradient alone. When glucose is plenti- 
ful outside cells, as it is after a meal, the sugar binds to the transporter’s 
externally displayed binding sites; if the protein then switches confor- 
mation—spontaneously and at random—it will carry the bound sugar 
inward and release it into the cytosol, where the glucose concentration 
is low (Figure 12-9). Conversely, when blood glucose levels are low—as 
they are when you are hungry—the hormone glucagon stimulates liver 
cells to produce large amounts of glucose by the breakdown of glycogen. 
As a result, the glucose concentration is higher inside liver cells than 
outside. This glucose can bind to the internally displayed binding sites 
on the transporter. When the protein then switches conformation in the 
opposite direction—again spontaneously and randomly—the glucose will 
be transported out of the cells and made available for import by other, 
energy-requiring cells. The net flow of glucose can thus go either way, 
according to the direction of the glucose concentration gradient across 
the plasma membrane: inward if more glucose is binding to the trans- 
porter’s externally displayed sites, and outward if the opposite is true. 


Although passive transporters themselves play no part in controlling 
the direction of solute transport, they are highly selective in terms of 
which solutes they will move. For example, the binding sites in the glu- 
cose transporter bind only D-glucose and not its mirror image L-glucose, 
which the cell cannot use as an energy source. 


Pumps Actively Transport a Solute Against Its 
Electrochemical Gradient 


Cells cannot rely solely on passive transport to maintain the proper 
balance of solutes. The active transport of solutes against their electro- 
chemical gradient is essential to achieving the appropriate intracellular 
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Figure 12-9 Conformational changes in a transporter mediate the passive transport of a solute such as glucose. The transporter 
is shown in three conformational states: in the outward-open state (left), the binding sites for solute are exposed on the outside; in 

the inward-open state (right), the sites are exposed on the inside of the bilayer; and in the occluded state (center), the sites are not 
accessible from either side. The transition between the states occurs randomly, is completely reversible, and—most importantly for 
the function of the transporter shown—does not depend on whether the solute-binding site is occupied. Therefore, if the solute 
concentration is higher on the outside of the bilayer, solute will bind more often to the transporter in the outward-open conformation 
than in the inward-open conformation, and there will be a net transport of glucose down its concentration gradient. 
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ionic composition and for importing solutes that are at a lower concen- 
tration outside the cell than inside. For these purposes, cells depend on 
transmembrane pumps, which can carry out active transport in three 
main ways (Figure 12-10): (i) gradient-driven pumps link the uphill 
transport of one solute across a membrane to the downhill transport of 
another; (ii) ATP-driven pumps use the energy released by the hydrolysis 
of ATP to drive uphill transport; and (iii) light-driven pumps, which are 
found mainly in bacterial cells, use energy derived from sunlight to drive 
uphill transport, as discussed in Chapter 11 for bacteriorhodopsin (see 
Figure 11-28). 


These different forms of active transport are often linked. Thus, in the 
plasma membrane of an animal cell, an ATP-driven Na* pump transports 
Na* out of the cell against its electrochemical gradient; this Na+ can then 
flow back into the cell, down its electrochemical gradient, through various 
Nat gradient-driven pumps. The influx of Na+ through these gradient- 
driven pumps provides the energy for the active transport of many other 
substances into the cell against their electrochemical gradients. If the 
ATP-driven Na+ pump ceased operating, the Na* gradient would soon run 
down, and transport through Nat gradient-driven pumps would come to 
a halt. For this reason, the ATP-driven Na* pump has a central role in the 
active transport of small molecules across the plasma membrane of ani- 
mal cells. Plant cells, fungi, and many bacteria use ATP-driven H* pumps 
in an analogous way: in pumping H* out of the cell, these proteins create 
an electrochemical gradient of H+ across the plasma membrane that is 
subsequently harnessed for solute transport, as we discuss later. 


The Nat Pump in Animal Cells Uses Energy Supplied by 
ATP to Expel Na* and Bring in K+ 


The ATP-driven Na* pump plays such a central part in the energy econ- 
omy of animal cells that it typically accounts for 30% or more of their 
total ATP consumption. This pump uses the energy derived from ATP 
hydrolysis to transport Nat out of the cell as it carries K* in. The pump is 
therefore sometimes called the Na*-K* ATPase or the Na*-K* pump. 


During the pumping process, the energy from ATP hydrolysis fuels a step- 
wise series of protein conformational changes that drives the exchange 
of Nat and K* ions. As part of the process, the phosphate group removed 
from ATP gets transferred to the pump itself (Figure 12-11). 


The transport of Na* ions out, and K* ions in, takes place in a cycle in 
which each step depends on the one before (Figure 12—12). If any of the 
individual steps is prevented from occurring, the entire cycle halts. The 
toxin ouabain, for example, inhibits the Na* pump by preventing the bind- 
ing of extracellular K+, arresting the cycle. 
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Figure 12-10 Pumps carry out active 
transport in three main ways. The actively 
transported solute is shown in gold, and the 
energy source is shown in red. 
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Figure 12-11 The Na* pump uses the (+) 
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The Na* pump is very efficient: the whole pumping cycle takes only 10 
milliseconds. Furthermore, the tight coupling between steps in the cycle 
ensures that the pump operates only when the appropriate ions—both 
Nat and K*—are available to be transported, thereby avoiding a wasteful 
hydrolysis of ATP. 


The Nat Pump Generates a Steep Concentration 
Gradient of Na+ Across the Plasma Membrane 


The Na* pump functions like a bilge pump in a leaky ship, ceaselessly 
expelling the Nat that is constantly slipping into the cell through other 
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Figure 12-12 The Na* pump undergoes a series of conformational changes as it exchanges Nat? ions for K*. 
The binding of cytosolic Na* (1) and the subsequent phosphorylation by ATP of the cytosolic face of the pump (2) 
induce the protein to undergo conformational changes that transfer the Na* across the membrane and release 
it outside the cell (3). The high-energy linkage of the phosphate to the protein provides the energy to drive the 
conformational changes. The binding of K* from the extracellular space (4) and the subsequent dephosphorylation 
(5) allow the protein to return to its original conformation, which transfers the K* across the membrane and releases 
it into the cytosol (6). 

The cycle is shown in Movie 12.2. The changes in conformation are analogous to those shown for the 
glucose transporter in Figure 12-9, except that here the Na*-dependent phosphorylation and K*-dependent 
dephosphorylation of the protein cause the conformational changes to occur in an orderly fashion, enabling the 
protein to do useful work. For simplicity, only one binding site is shown for each ion. The real pump in mammalian 
cells contains three binding sites for Na* and two for K*. The net result of one cycle of the pump is therefore the 
transport of three Nat out and two K+ in. Ouabain inhibits the pump by preventing K* binding (4). 


Figure 12—13 The high concentration of Na+ outside the cell is like 
water behind a high dam. The water behind the dam has potential 
energy, which can be used to drive energy-requiring processes. In 

the same way, an ion gradient across a membrane can be used to 
drive active processes in a cell, including the active transport of other 
molecules across the plasma membrane. Shown here is the Table Rock 
Dam in Branson, Missouri, USA. (Gary Saxe/Shutterstock.) 


transporters and ion channels in the plasma membrane. In this way, 
the pump keeps the Na* concentration in the cytosol about 10-30 times 
lower than that in the extracellular fluid and the K* concentration about 
10-30 times higher (see Table 12-1, p. 391). 


This steep concentration gradient of Na+ across the plasma membrane 
acts together with the membrane potential to create a large Na* elec- 
trochemical gradient (see Figure 12-5A). This high concentration of Na* 
outside the cell, on the uphill side of its electrochemical gradient, is like 
a large volume of water behind a high dam: it represents a very large 
store of energy (Figure 12-13). Even if one artificially halts the operation 
of the Na* pump with ouabain, this stored energy is sufficient to sus- 
tain for many minutes the various gradient-driven pumps in the plasma 
membrane that are fueled by the downhill flow of Na*, which we discuss 
shortly. 


Ca2+ Pumps Keep the Cytosolic Ca2+ Concentration Low 


Ca?*, like Na*, is also kept at a low concentration in the cytosol com- 
pared with its concentration in the extracellular fluid. But Ca?* is much 
less plentiful than Nat, both inside and outside cells (see Table 12-1). 
The movement of this ion across cell membranes is nonetheless crucial, 
because Ca? can bind tightly to a variety of proteins in the cell, altering 
their activities. An influx of Ca?+ into the cytosol through Ca?* channels, 
for example, is used by different cells as an intracellular signal to trig- 
ger various complex processes, such as muscle contraction (discussed in 
Chapter 17), fertilization (discussed in Chapters 16 and 19), and nerve cell 
communication, which is discussed later. 


The lower the background concentration of free Ca?* in the cytosol, the 
more sensitive the cell is to an increase in cytosolic Ca?*. Thus eukaryotic 
cells in general maintain a very low concentration of free Ca? in their 
cytosol (about 10% mM) compared to the much higher concentration of 
Ca?* outside of the cell (typically 1-2 mM). This huge concentration dif- 
ference is achieved mainly by means of ATP-driven Ca2* pumps in both 
the plasma membrane and the endoplasmic reticulum membrane, which 
actively remove Ca?* from the cytosol. 


Ca?t pumps are ATPases that work in much the same way as the Nat 
pump depicted in Figure 12-12. The main difference is that Ca** pumps 
return to their original conformation without a requirement for binding 
and transporting a second ion (Figure 12-14). The Nat and Ca** pumps 
have similar amino acid sequences and structures, indicating that they 
share a common evolutionary origin. 


Gradient-driven Pumps Exploit Solute Gradients to 
Mediate Active Transport 


A gradient of any solute across a membrane, like the electrochemical 
Na* gradient generated by the Na* pump, can be used to drive the active 
transport of a second molecule. The downhill movement of the first sol- 
ute down its gradient provides the energy to power the uphill transport 
of the second solute. The active transporters that work in this way are 
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Figure 12-14 The Ca?* pump in the 
sarcoplasmic reticulum was the first 
ATP-driven ion pump to have its three- 
dimensional structure determined by 
x-ray crystallography. When a muscle 
cell is stimulated, Ca** floods into the 
cytosol from the sarcoplasmic reticulum—a 
specialized form of endoplasmic reticulum. 
The influx of Ca?* stimulates the cell to 
contract; to recover from the contraction, 
Ca?* must be pumped back into the 
sarcoplasmic reticulum by this Ca2* pump. 
The Ca?* pump uses ATP to 
phosphorylate itself, inducing a series of 
conformational changes (similar to the ones 
of the Na* pump shown in Figure 12-12); 
when the pump is open to the lumen of the 
sarcoplasmic reticulum, the Ca?*-binding 
sites are eliminated, ejecting the two Ca?+ 
ions into the organelle (Movie 12.3). 


Figure 12—15 Gradient-driven pumps 
can act as symports or antiports. They 
transfer solutes either in the same direction, 
in which case they are called symports, or 
in opposite directions, which are antiports 
(Movie 12.4). Uniports, by contrast, only 
facilitate the movement of a solute down 
its concentration gradient. Because such 
movement does not require an additional 
energy source, uniports are not pumps. 
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called gradient-driven pumps (see Figure 12-10). They can couple the 
movement of one inorganic ion to that of another, the movement of an 
inorganic ion to that of a small organic molecule, or the movement of 
one small organic molecule to that of another. If the pump moves both 
solutes in the same direction across the membrane, it is called a symport. 
If it moves them in opposite directions, it is called an antiport. A trans- 
porter that ferries only one type of solute across the membrane down 
its concentration gradient (and is therefore not a pump) is called a uni- 
port (Figure 12-15). The glucose transporter described earlier (see Figure 
12-9) is an example of a uniport. 


The Electrochemical Na+ Gradient Drives the Transport of 
Glucose Across the Plasma Membrane of Animal Cells 


Symports that make use of the inward flow of Na* down its steep electro- 
chemical gradient have an especially important role in driving the import 
of solutes into animal cells. The epithelial cells that line the gut, for exam- 
ple, transport glucose from the gut lumen across the gut epithelium and, 
ultimately, into the blood. If these cells had only a passive glucose uni- 
port (the transporter shown in Figure 12-9), they would release glucose 
into the gut lumen after fasting just as freely as they take it up from the 
gut after a feast. However, these epithelial cells also possess a glucose- 
Na* symport, which they can use to take up glucose from the gut lumen, 
even when the concentration of glucose is higher in the epithelial cell’s 
cytosol than it is inside the gut. As the electrochemical gradient for Na* 
is so steep, when Nat moves into the cell down its gradient, glucose is, 
in a sense, “dragged” into the cell along with it. Because the binding of 
Na* and glucose is cooperative—the binding of one enhances the bind- 
ing of the other—if one of the two solutes is missing, the other fails to 
bind; therefore both molecules must be present for this gradient-driven 
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Figure 12-16 A glucose—Na* symport uses the electrochemical Na* gradient to drive the active import of 
glucose. The pump oscillates randomly between alternate states. In one state ("outward-open”) the pump is open 
to the extracellular space; in another state ("inward-open”) it is open to the cytosol. Although Na* and glucose can 
each bind to the pump in either of these “open” states, the pump can transition between them only through an 
“occluded” state in which both glucose and Na* are bound (“occluded-occupied”) or neither is bound ("occluded- 
empty”). Because the Na* concentration is high in the extracellular space, the Na*-binding site is readily occupied in 
the outward-open state, and the transporter must wait for a rare glucose molecule to bind. At that point, the pump 
flips to the occluded-occupied state, trapping both solutes. 

Because conformational transitions are reversible, one of two things can happen to the pump in the occluded- 
occupied state. The transporter could flip back to the outward-open state; in this case, the solutes would 
dissociate, and nothing would be gained. Alternatively, it could flip into the inward-open state, exposing the solute- 
binding sites to the cytosol where the Na* concentration is very low. Thus sodium readily dissociates (and will be 
subsequently pumped back out of the cell by the Na* pump, shown in Figure 12—11, to maintain the steep Na* 
gradient). The transporter is now trapped with a partially occupied binding site until the glucose molecule also 
dissociates. At this point, with no solute bound, it can transition into the occluded-empty state and from there back 
to the outward-open state to repeat the transport cycle. 


transport to occur and Na* will not leak into the cell without doing useful 
work (Figure 12—16). 


If the gut epithelial cells had only this symport, however, they would take 
up glucose and never release it for use by the other cells of the body. 
These epithelial cells, therefore, have two types of glucose transporters 
located at opposite ends of the cell. In the apical domain of the plasma 
membrane, which faces the gut lumen, they have the glucose-Na* sym- 
ports. These use the energy of the Na+ gradient to actively import glucose, 
creating a high concentration of the sugar in the cytosol. In the basal and 
lateral domains of the plasma membrane, the cells have passive glucose 
uniports, which release the glucose down its concentration gradient for 
use by other tissues (Figure 12-17). As shown in Figure 12-17, the two 
types of glucose transporters are kept segregated in their proper domains 
of the plasma membrane by a diffusion barrier formed by a tight junction 
around the apex of the cell. This prevents mixing of membrane compo- 
nents between the two domains, as discussed in Chapter 11 (see Figure 


11-32). 


Cells in the lining of the gut and in many other organs, including the 
kidney, contain a variety of symports in their plasma membrane that are 
similarly driven by the electrochemical gradient of Na*; each of these 
gradient-driven pumps specifically imports a small group of related sug- 
ars or amino acids into the cell. At the same time, Nat-driven pumps 
that operate as antiports are also important for cells. For example, the 
Na*-H* exchanger in the plasma membrane of many animal cells uses 
the downhill influx of Na* to pump Ht out of the cell; it is one of the main 
devices that animal cells use to control the pH in their cytosol—prevent- 
ing the cell interior from becoming too acidic. 
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Figure 12-17 Two types of glucose 
transporters enable gut epithelial cells 
to transfer glucose across the epithelial 
lining of the gut. Na* that enters the 

cell via the Na*-driven glucose symport 

is subsequently pumped out by Na* 
pumps in the basal and lateral plasma 
membranes, keeping the concentration 
of Na* in the cytosol low—and the Na* 
electrochemical gradient steep. The diet 
provides ample Na* in the gut lumen to 
drive the Na* gradient-driven glucose 
symport. The process is shown in 

Movie 12.5. 


QUESTION 12-2 


A rise in the intracellular Ca?* 
concentration causes muscle cells 
to contract. In addition to an ATP- 
driven Ca? pump, muscle cells 
that contract quickly and regularly, 
such as those of the heart, have an 
additional type of Ca?* pump—an 
antiport that exchanges Ca** for 
extracellular Na* across the plasma 
membrane. The majority of the 
Ca** ions that have entered the 
cell during contraction are rapidly 
pumped back out of the cell by 
this antiport, thus allowing the 

cell to relax. Ouabain and digitalis 
are used for treating patients with 
heart disease because they make 
heart muscle cells contract more 
strongly. Both drugs function by 
partially inhibiting the Na* pump in 
the plasma membrane of these cells. 
Can you propose an explanation 
for the effects of the drugs in the 
patients? What will happen if too 
much of either drug is taken? 
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EXTRACELLULAR FLUID 


Electrochemical H+ Gradients Drive the Transport of 
Solutes in Plants, Fungi, and Bacteria 


Plant cells, bacteria, and fungi (including yeasts) do not have Na+ pumps 
in their plasma membrane. Instead of an electrochemical Nat gradient, 
they rely mainly on an electrochemical gradient of H* to import solutes 
into the cell. The gradient is created by H+ pumps in the plasma mem- 
brane that pump H* out of the cell, thus setting up an electrochemical 
proton gradient across this membrane and creating an acid pH in the 
medium surrounding the cell. The import of many sugars and amino 
acids into bacterial cells is then mediated by H+ symports, which use the 
electrochemical H* gradient in much the same way that animal cells use 
the electrochemical Na* gradient to import these nutrients. 


In some photosynthetic bacteria, the H* gradient is created by the activity 
of light-driven H* pumps such as bacteriorhodopsin (see Figure 11-28). 
In other bacteria, fungi, and plants, the H* gradient is generated by H* 
pumps in the plasma membrane that use the energy of ATP hydrolysis 
to pump H* out of the cell; these H* pumps resemble the Na+ pumps and 
Ca? pumps of animal cells discussed earlier. 


A different type of ATP-dependent H* pump is found in the membranes 
of some intracellular organelles, such as the lysosomes of animal cells 
and the central vacuole of plant and fungal cells. These pumps—which 
resemble the turbine-like enzyme that synthesizes ATP in mitochondria 
and chloroplasts (discussed in Chapter 14)—actively transport H* out of 
the cytosol into the organelle, thereby helping to keep the pH of the cyto- 
sol neutral and the pH of the interior of the organelle acidic. An acid 
environment is crucial to the function of many organelles, as we discuss 
in Chapter 15. 


Some of the transmembrane pumps considered in this chapter are shown 
in Figure 12-18 and are listed in Table 12-2. 
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10 um 
(A) ANIMAL CELL (B) PLANT CELL (0) PLANT CELLS 
ION CHANNELS AND THE MEMBRANE Figure 12-18 Animal and plant cells use 
a variety of transmembrane pumps to 
POTENTIAL drive the active transport of solutes. 
a . (A) In animal cells, an electrochemical Nat 
In principle, the simplest way to allow a small, water-soluble substance gradient across the plasma membrane, 
to cross from one side of a membrane to the other is to create a hydro- generated by the Na* pump, is used by 
philic channel through which the solute can pass. Channel proteins, or symports to import various solutes. (B) In 


channels, perform this function in cell membranes, forming transmem- plant cells, an electrochemical gradient of 
H*, set up by an H* pump, is often used 


brane pores that allow the passive movement of small, water-soluble forthis purpose; asimilar au aeayre used 
molecules and ions into or out of the cell or organelle. by bacteria and fungi (not shown). The 


lysosomes in animal cells and the vacuoles 
in plant and fungal cells contain a similar 


A few channels form relatively large, aqueous pores; examples are 


the proteins that form gap junctions between two adjacent cells (see H* pump in their membranes that pumps 
Figure 20-28) and the porins that form pores in the outer membrane in H+, helping to keep the internal 

of mitochondria and some bacteria (see Figure 11-25). But such large, environment of these organelles acidic. 
permissive channels would lead to disastrous leaks if they directly con- (C) An electron micrograph shows the 


vacuole in plant cells in a young tobacco 


nected the cytosol of a cell to the extracellular space. Thus most of the 
leaf. (C, courtesy of J. Burgess.) 


channels in the plasma membrane form narrow, highly selective pores. 


TABLE 12-2 SOME EXAMPLES OF TRANSMEMBRANE PUMPS 


Na*-driven glucose pump apical plasma membrane of kidney and Na* gradient active import of glucose 
(glucose-Na* symport) intestinal cells 
Na*-H* exchanger plasma membrane of animal cells Na* gradient active export of H* ions, pH 
regulation 
Na* pump (Na*-Kt plasma membrane of most animal cells = ATP hydrolysis active export of Na* and import of K* 
ATPase) 
Ca?* pump (Ca2* ATPase) plasma membrane of eukaryotic cells ATP hydrolysis active export of Ca2* 
Ca** pump (Ca2* ATPase) sarcoplasmic reticulum membrane of ATP hydrolysis active import of Ca?* into 
muscle cells and endoplasmic reticulum sarcoplasmic reticulum or 
membrane of most animal cells endoplasmic reticulum 
H* pump (H* ATPase) plasma membrane of plant cells, fungi, ATP hydrolysis active export of H* 
and some bacteria 
H* pump (H* ATPase) membranes of lysosomes in animal cells © ATP hydrolysis active export of H* from cytosol into 
and of vacuoles in plant and fungal cells lysosome or vacuole 


Bacteriorhodopsin plasma membrane of some bacteria light active export of H* 
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Figure 12-19 An ion channel has a 
selectivity filter that controls which 
inorganic ions it will allow to cross the 
membrane. Shown here is a portion of 

a bacterial K* channel. One of the four 
protein subunits has been omitted from 
the drawing to expose the interior structure 
of the pore (blue). From the cytosolic side, 
the pore opens into a vestibule that sits 

in the middle of the membrane. K* ions 

in the vestibule are still partially cloaked 
with associated water molecules. The 
narrow selectivity filter, which connects the 
vestibule with the outside of the cell, 

is lined with polar groups (not shown) 

that form transient binding sites for the 

K* ions once the ions have shed their water 
shell. To observe this selectivity in action. 
(Adapted from D.A. Doyle et al., Science 
280:69-77, 1998.) 
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Figure 12-20 A typical ion channel 
fluctuates between closed and open 
conformations. The channel shown here 

in cross section forms a hydrophilic pore 
across the lipid bilayer only in the “open” 
conformation. As illustrated in Figure 12-19, 
the pore narrows to atomic dimensions in 
the selectivity filter, where the ion selectivity 
of the channel is largely determined. 
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The aquaporins discussed earlier, for example, facilitate the flow of water 
across the plasma membrane of some prokaryotic and eukaryotic cells. 
These pores are structured in such a way that they allow the passive dif- 
fusion of uncharged water molecules, while prohibiting the movement of 
ions, including even the smallest ion, H+. 


The bulk of a cell's channels facilitate the passage of select inorganic 
ions. It is these ion channels that we discuss in this section. 


lon Channels Are lon-selective and Gated 


Two important properties distinguish ion channels from simple holes in 
the membrane. First, they show ion selectivity, permitting some inorganic 
ions to pass but not others. Ion selectivity depends on the diameter and 
shape of the ion channel and on the distribution of the charged amino 
acids that line it. Each ion in aqueous solution is surrounded by a small 
shell of water molecules, most of which have to be shed for the ions to 
pass, in single file, through the selectivity filter in the narrowest part of the 
ion channel (Figure 12-19). An ion channel is narrow enough in places 
to force ions into contact with the channel wall, so that only those ions of 
appropriate size and charge are able to pass (Movie 12.6). 


The second important distinction between ion channels and simple holes 
in the membrane is that ion channels are not continuously open. Ion 
transport would be of no value to the cell if the many thousands of ion 
channels in a cell membrane were open all the time and there were no 
means of controlling the flow of ions through them. Instead, ion chan- 
nels open only briefly and then close again (Figure 12-20). As we discuss 
later, most ion channels are gated: a specific stimulus triggers them to 
switch between a closed and an open state by inducing a change in their 
conformation. 


Unlike a transporter, an ion channel does not need to undergo conforma- 
tional changes for each ion it passes, and so it has a large advantage over 
a transporter with respect to its maximum rate of transport. More than 
a million ions can pass through an open channel each second, which is 
1000 times greater than the fastest rate of transfer known for any trans- 
porter. On the other hand, channels cannot couple the ion flow to an 
energy source to carry out active transport; they simply make the mem- 
brane transiently permeable to selected inorganic ions, mainly Na‘, K+, 
Ca”, or CF. 
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Figure 12-21 A Venus flytrap uses electrical signaling to capture 
its prey. The leaves snap shut in less than half a second when an insect 
moves across them. The response is triggered by touching any two 

of the three trigger hairs in succession in the center of each leaf. This 
mechanical stimulation opens ion channels in the plasma membrane 
and thereby sets off an electrical signal, which, by an unknown 
mechanism, leads to a rapid change in turgor pressure that closes the 
leaf. (Gabor Izso/Getty Images.) 


Thanks to active transport by pumps, the concentrations of many ions 
are far from equilibrium across a cell membrane. When an ion channel 
opens, therefore, ions usually flow through it, moving rapidly down their 
electrochemical gradients. This rapid shift of ions changes the membrane 
potential, as we discuss next. 


Membrane Potential Is Governed by the Permeability of 
a Membrane to Specific lons 


Changes in membrane potential are the basis of electrical signaling in 
many types of cells, whether they are the nerve or muscle cells in ani- 
mals, or the touch-sensitive cells of a carnivorous plant (Figure 12-21). 
Such electrical changes are mediated by alterations in the permeability 
of membranes to ions. As we saw earlier, in an animal cell that is in 
an unstimulated, or “resting,” state, the negative charges on the many 
types of organic molecules found inside the cell are largely balanced by 
K+, the predominant intracellular ion (see Table 12-1). K+ is continuously 
imported into the cell by the Na+ pump, which generates a K* gradient 
across the plasma membrane as it pumps Na* out and K* in (see Figure 
12-11). 


The plasma membrane, however, also contains a set of K* channels, 
known as K* leak channels, that allow K* to move freely across the 
membrane. In a resting cell, these are the main ion channels open in the 
plasma membrane, rendering the membrane much more permeable to 
K* than to other ions. When K* flows out of the cell—down the concentra- 
tion gradient generated by the ceaseless operation of the Nat pump—the 
loss of positive charge inside the cell creates a voltage difference, or 
membrane potential (Figure 12-22). Because this charge imbalance will 
oppose any further movement of K* out of the cell, an equilibrium condi- 
tion is established in which the membrane potential keeping K* inside the 
cell is just strong enough to counteract the tendency of K* to move down 
its concentration gradient and out of the cell. In this state of equilibrium, 
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(A) exact balance of charges on each side (B) a few positive ions (red) cross the 
of the membrane: membrane potential = 0 membrane from right to left, setting up 


a nonzero membrane potential 


Figure 12-22 The distribution of ions 

on either side of a cell membrane gives 
rise to its membrane potential. The 
membrane potential results from a thin 

(<1 nm) layer of ions close to the membrane, 
held in place by their electrical attraction 

to oppositely charged ions on the other 

side of the membrane. (A) When there is 

an exact balance of charges on either side 

of the membrane, there is no membrane 
potential. (B) When ions of one type cross the 
membrane, they establish a charge difference 
across the two sides of the membrane that 
creates a membrane potential. The number 
of ions that must move across the membrane 
to set up a membrane potential is a tiny 
fraction of all those present on either side. In 
the case of the plasma membrane in animal 
cells, for example, 6000 K* ions crossing 

1 um? of membrane are enough to shift 

the membrane potential by about 100 mV; 
the number of K* ions in 1 pum? of cytosol is 
70,000 times larger than this. 
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Figure 12-23 The Kt concentration 
gradient and K* leak channels play major 


parts in generating the resting membrane 


potential across the plasma membrane in 
animal cells. (A) A hypothetical situation in 
which the K* leak channels are closed and 
the membrane potential is zero. (B) As soon 
as the channels open, K* will tend to leave 
the cell, moving down its concentration 
gradient. Assuming the membrane contains 
no open channels permeable to other ions, 
K* will cross the membrane but negative 
ions will be unable to follow. The resulting 
charge imbalance gives rise to a membrane 
potential that tends to drive K* back into 
the cell. At equilibrium, the effect of the Kt 
concentration gradient is exactly balanced 
by the effect of the membrane potential, 
and there is no net movement of K* across 
the membrane. 

The Na* pump (not shown here) also 
contributes to the resting potential—both 
by helping to establish the K* gradient 
and by pumping 3 Na* ions out of the cell 
for every 2 K* ions it pumps in (see Figure 
12-11). Moving one more positively charged 
ion out of the cell with each pumping cycle 
helps to keep the inside of the cell more 
negative than the outside. 


The force tending to drive an ion across a 
membrane is made up of two components: 
one due to the electrical membrane 
potential and one due to the concentration 
gradient of the ion. At equilibrium, the two 
forces are balanced and satisfy a simple 
mathematical relationship given by the 


Nernst equation 
V = 62 logyy (Cy /G) 


where V is the membrane potential in 
millivolts, and C, and G are the outside and 
inside concentrations of the ion, respectively. 
This form of the equation assumes that the 
ion carries a single positive charge and that the 
temperature is 37°C. 
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the electrochemical gradient for K* is zero, even though there is still a 
much higher concentration of K* inside the cell than out (Figure 12-23). 


The membrane potential in such steady-state conditions—in which the 
flow of positive and negative ions across the plasma membrane is pre- 
cisely balanced, so that no further difference in charge accumulates 
across the membrane—is called the resting membrane potential. A 
simple formula called the Nernst equation expresses this equilibrium 
quantitatively and makes it possible to calculate the theoretical resting 
membrane potential if the ion concentrations on either side of the mem- 
brane are known (Figure 12-24). In animal cells, the resting membrane 
potential—which varies between -20 and -200 mV—is chiefly a reflec- 
tion of the electrochemical K* gradient across the plasma membrane, 
because, at rest, the plasma membrane is chiefly permeable to K+, and K* 
is the main positive ion inside the cell. 


When a cell is stimulated, other ion channels in the plasma membrane 
open, changing the membrane’s permeability to those ions. Whether the 
ions enter or leave the cell depends on the direction of their electrochem- 
ical gradients. Thus the membrane potential at any time depends on both 
the state of the membrane’s ion channels and the ion concentrations on 
either side of the plasma membrane. Bulk changes in ion concentrations 
cannot occur quickly enough to drive the rapid changes in membrane 
potential that are associated with electrical signaling. Instead, it is the 
rapid opening and closing of ion channels, which occurs within millisec- 
onds, that matters most for this type of cell signaling. 


Figure 12-24 The Nernst equation can be used to calculate the 
contribution of each ion to the resting potential of the membrane. 
The relevant ion concentrations are those on either side of the 
membrane. From this equation, we see that each tenfold change in 
the ion concentration ratio (Co/Ci) across the membrane alters the 
membrane potential by 62 millivolts. The resting potential can then be 
calculated by combining the individual ion gradient contributions and 
adjusting for the relative permeability for each ion. 
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Figure 12-25 Patch-clamp recording is used to monitor ion channel activity. First, a microelectrode is filled with an aqueous 
conducting solution, and its tip is pressed against the surface of the cell. (A) With gentle suction, a tight seal is formed where 

the cell membrane contacts the mouth of the microelectrode. Because of the extremely tight seal, current can enter or leave the 
microelectrode only by passing through the ion channel or channels in the patch of membrane covering its tip. (B) To expose the 
cytosolic face of the membrane, the patch of membrane held in the microelectrode can be torn from the cell. This technique makes it 
easy to alter the composition of the solution on either side of the membrane to test the effect of various solutes on channel activity. 

(C) A micrograph showing an isolated nerve cell held in a suction pipette (the tip of which is shown on the left), while a microelectrode 
is being used for patch-clamp recording. (D) The circuitry for patch-clamp recording. At the open end of the microelectrode, a metal 
wire is inserted. Current that enters the microelectrode through ion channels in the small patch of membrane covering its tip passes via 
the wire, through measuring instruments, back into the bath of medium surrounding the cell or the detached patch. (C, from T.D. Lamb, 
H.R. Matthews, and V. Torre, J. Physiol. 372:315-349, 1986. With permission from Blackwell Publishing.) 


lon Channels Randomly Snap Between Open and 
Closed States 


Measuring changes in electrical current is the main method used to 
study ion movements and ion channels in living cells. Amazingly, elec- 
trical recording techniques can detect and measure the current flowing 
through a single channel molecule. The procedure developed for doing 
this is known as patch-clamp recording, and it provides a direct and 
surprising picture of how individual ion channels behave. 


In patch-clamp recording, a fine glass tube is used as a microelectrode to 
isolate and make electrical contact with a small area of the membrane 
at the surface of the cell (Figure 12-25). When a sufficiently small area of 
membrane is trapped in the patch, sometimes only a single ion channel 
will be present. Modern electrical instruments are sensitive enough to 
monitor the ion flow through this single channel, detected as a minute 
electric current (of the order of 10-!2 ampere or 1 picoampere). 


Monitoring individual ion channels in this way revealed something 
surprising about the way they behave: even when conditions are held 
constant, the currents abruptly appear and disappear, as though an on/ 
off switch were being jiggled randomly (Figure 12-26). This behavior 
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Figure 12-26 The behavior of a single 
ion channel can be observed using the 
patch-clamp technique. The voltage (the 
membrane potential) across the isolated 
patch of membrane is held constant during 
the recording. (A) In this example, the 
neurotransmitter acetylcholine is present, 
and the membrane patch from a muscle 
cell contains a single channel protein that is 
responsive to acetylcholine (discussed later, 
see Figure 12-42). This ion channel opens 
to allow passage of positive ions when 
acetylcholine binds to the exterior face of 
the channel. But even when acetylcholine 
is bound to the channel, as is the case 
during the three channel openings shown 
here, the channel does not remain open all 
the time. Instead, it flickers between open 
and closed states. Note that how long the 
channel remains open is variable. (B) When 
acetylcholine is not present, the channel 
opens very rarely. (Courtesy of David 
Colquhoun.) 
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Figure 12-27 Different types of gated CLOSED 
ion channels respond to different types 
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indicates that the channel has moving parts and is snapping back and 
forth between open and closed conformations as the channel is knocked 
from one conformation to the other by the random thermal movements 
of the molecules in its environment. Patch-clamp recording was the 
first technique that could detect such conformational changes, and the 
picture it paints—of a jerky piece of machinery subjected to constant 
external buffeting—is now known to apply also to other proteins with 
moving parts. 


The activity of each ion channel is very much “all-or-none”: when an ion 
channel is open, it is fully open; when it is closed, it is fully closed. That 
raises a fundamental question: If ion channels randomly snap between 
open and closed conformations even when conditions on each side of 
the membrane are held constant, how can their state be regulated by 
conditions inside or outside the cell? The answer is that when the appro- 
priate conditions change, the random behavior continues but with a 
greatly changed bias: if the altered conditions tend to open the channel, 
for example, the channel will now spend a much greater proportion of its 
time in the open conformation, although it will not remain open continu- 
ously (see Figure 12-26). 


Different Types of Stimuli Influence the Opening and 
Closing of lon Channels 


There are more than a hundred types of ion channels, and even simple 
organisms can possess many different types. The human genome con- 
tains 80 genes that encode different but related Kt channels alone. Ion 
channels differ from one another primarily with respect to their ion selec- 
tivity—the type of ions they allow to pass—and their gating—the conditions 
that influence their opening and closing. For a voltage-gated channel, 
the probability of being open is controlled by the membrane potential 
(Figure 12-27A). For a ligand-gated channel, opening is controlled by 
the binding of some molecule (a ligand) to the channel (Figure 12-27B 
and C). For a mechanically-gated channel, opening is controlled by a 
mechanical force applied to the channel (Figure 12-27D). 


The auditory hair cells in the ear are an important example of cells that 
depend on mechanically-gated channels. Sound vibrations pull the chan- 
nels open, causing ions to flow into the hair cells; this ion flow sets up 
an electrical signal that is transmitted from the hair cell to the auditory 
nerve, which then conveys the signal to the brain (Figure 12-28). 
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Figure 12-28 Mechanically-gated ion channels allow us to hear. (A) A section through the organ of Corti, 
which runs the length of the cochlea, the auditory portion of the inner ear. Each auditory hair cell has a tuft of 
spiky extensions called stereocilia projecting from its upper surface. The hair cells are embedded in an epithelial 
sheet of supporting cells, which is sandwiched between the basilar membrane below and the tectorial membrane 
above. (These are not lipid bilayer membranes but sheets of extracellular matrix.) (B) Sound vibrations cause the 
basilar membrane to vibrate up and down, causing the stereocilia to tilt. Each stereocilium in the staggered array 
of stereocilia on a hair cell is attached to the next, shorter stereocilium by a fine filament. The tilting stretches 
the filaments, which pull open mechanically-gated ion channels in the stereocilium plasma membrane, allowing 
positively charged ions to enter from the surrounding fluid (Movie 12.7). The influx of ions activates the hair cells, 
which stimulate underlying nerve endings of the auditory nerve fibers that relay the auditory signal to the brain. 
The hair-cell mechanism is astonishingly sensitive: the faintest sounds we can hear have been estimated to stretch 
the filaments by an average of about 0.04 nm, which is less than the diameter of a hydrogen ion (Movie 12.8). 


Voltage-gated lon Channels Respond to the Membrane 
Potential 


Voltage-gated ion channels play a major role in propagating electrical 
signals along all nerve cell extensions, such as those that relay signals 
from our brain to our toe muscles. But voltage-gated ion channels are 
present in many other cell types, too, including muscle cells, egg cells, 
protozoans, and even plant cells, where they enable electrical signals 
to travel from one part of the plant to another, as in the leaf-closing 
response of a Mimosa pudica plant (Figure 12-29). 


Voltage-gated ion channels have domains called voltage sensors that 
are extremely sensitive to changes in the membrane potential: changes 


(A) time 0 sec (B) 1 sec (©) 3 sec (D) 5 sec 


Figure 12-29 Both mechanically-gated and voltage-gated ion channels underlie the leaf-closing response in the touch-sensitive 
plant Mimosa pudica. (A) Resting leaf. (B-D) Successive leaflet closures in response to touch. A few seconds after the leaf on the left is 
touched, its leaflets snap shut. The response involves the opening of mechanically-gated ion channels in touch-sensitive sensory cells, 
which then pass a signal to cells containing voltage-gated ion channels, generating an electric impulse. When the impulse reaches 
specialized hinge cells at the base of each leaflet, a rapid loss of water by these cells occurs, causing the leaflets to fold into a closed 
conformation suddenly and progressively down the leaf stalk (Movie 12.9). 
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The figure above shows a recording 
from a patch-clamp experiment in 
which the electrical current passing 
across a patch of membrane is 
measured as a function of time. 
The membrane patch was plucked 
from the plasma membrane of 

a muscle cell by the technique 
shown in Figure 12-25 and contains 
molecules of the acetylcholine 
receptor, which is a ligand-gated 
cation channel that is opened by 
the binding of acetylcholine to the 
extracellular face of the channel. To 
obtain a recording, acetylcholine 
was added to the solution inside the 
microelectrode. (A) Describe what 
you can learn about the channels 
from this recording. (B) How would 
the recording differ if acetylcholine 
were (i) omitted or (ii) added to the 
solution outside the microelectrode 
only? 


Figure 12-30 A typical neuron has a 
cell body, a single axon, and multiple 
dendrites. The axon conducts electrical 
signals away from the cell body toward its 
target cells, while the multiple dendrites 
receive signals from the axons of other 
neurons. The red arrows indicate the 
direction in which signals travel. During 
brain development, neurons probe their 
environment for guidance clues to extend 
axons and dendrites in appropriate 
directions to make useful connections 
(Movies 12.10 and 12.11). 
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QUESTION 12-3 


above a certain threshold value exert sufficient electrical force on these 
domains to encourage the channel to switch from its closed to its open 
conformation (see Figure 12—27A). As discussed earlier, a change in 
the membrane potential does not affect how wide the channel is open, 
but instead alters the probability that it will open. Thus, in a large patch 
of membrane containing many molecules of the channel protein, one 
might find that on average 10% of them are open at any instant when the 
membrane is at one potential, whereas 90% are open after this potential 
changes. 


When one type of voltage-gated ion channel opens, the membrane poten- 
tial of the cell can change. This in turn can activate or inactivate other 
voltage-gated ion channels. Such circuits, which couple the opening of 
ion channels to changes in membrane potential to the opening of addi- 
tional ion channels, are fundamental to all electrical signaling in cells. In 
the next section, we consider the special case of nerve cells: they—more 
than any other cell type—have made a profession of electrical signaling, 
and they employ ion channels in very sophisticated ways. 


ION CHANNELS AND NERVE CELL SIGNALING 


The fundamental task of a nerve cell, or neuron, is to receive, integrate, 
and transmit signals. Neurons carry signals from sense organs, such as 
eyes and ears, to the central nervous system—the brain and spinal cord. In 
the central nervous system, neurons signal from one to another through 
networks of enormous complexity, allowing the brain and spinal cord to 
analyze, interpret, and respond to the signals coming in from the sense 
organs. 


Every neuron consists of a cell body, which contains the nucleus and has 
a number of long, thin extensions radiating outward from it. Usually, a 
neuron has one long extension called an axon, which conducts electrical 
signals away from the cell body toward distant target cells; it also usually 
has several shorter, branching extensions called dendrites, which radi- 
ate from the cell body like antennae and provide an enlarged surface area 
to receive signals from the axons of other neurons (Figure 12-30). The 
axon commonly divides at its far end into many branches, each of which 
ends in a nerve terminal, so that the neuron’s message can be passed 
simultaneously to many target cells—muscle or gland cells or other neu- 
rons. Likewise, the branching of the dendrites can be extensive, in some 
cases sufficient to receive as many as 100,000 inputs on a single neuron 
(see Figure 12-43A). 


No matter what the meaning of the signal a neuron carries—whether it is 
visual information from the eye, a motor command to a muscle, or one 
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Figure 12-31 The squid Loligo has a nervous system that is adept 
at responding rapidly to threats in the animal's environment. 
Among the nerve cells that make up this escape system is one that 
possesses a “giant axon,” with a very large diameter. Long before 
patch clamping allowed recordings from single ion channels in small 
cells (see Figure 12-25), the squid giant axon was routinely used to 
record and study action potentials. (NOAA.) 


step in a complex network of neural processing in the brain—the form 
of the signal is always the same: it consists of changes in the electrical 
potential across the neuron’s plasma membrane. 


Action Potentials Allow Rapid Long-Distance 
Communication Along Axons 


A neuron is stimulated by a signal—typically from another neuron—deliv- 
ered to a localized site on its surface. This signal initiates a change in the 
membrane potential at that site. To transmit the signal onward, this local 
change in membrane potential has to spread from this initial site, which 
is usually on a dendrite or the cell body, to the axon terminals. There, the 
signal is relayed to the next cells in the pathway—forming a neural circuit. 
The distances covered by such circuits can be substantial: a signal that 
leaves a motor neuron in your spinal cord may have to travel a meter or 
more before it reaches a muscle in your foot. 


The local change in membrane potential generated by a signal will spread 
passively along an axon or a dendrite to adjacent regions of the plasma 
membrane. Over long distances, such passive spread is inadequate, as the 
signal rapidly becomes weaker with increasing distance from the source. 
Neurons solve this long-distance communication problem by employing 
an active signaling mechanism. In this case, a local electrical stimulus 
of sufficient strength triggers a burst of electrical activity in the plasma 
membrane that propagates rapidly along the membrane of the axon, 
continuously renewing itself all along the way. This traveling wave of 
electrical excitation, known as an action potential, or a nerve impulse, 
can carry a message, without weakening, all the way from one end of a 
neuron to the other, at speeds of up to 100 meters per second. 


The early research that established this mechanism of electrical signal- 
ing along axons was done on the giant axon of the squid (Figure 12-31). 
This axon has such a large diameter that it is possible to record its elec- 
trical activity from an electrode inserted directly into it (How We Know, 
pp. 412-413). From such studies, it was deduced how action potentials 
are the direct consequence of the properties of voltage-gated ion chan- 
nels in the axonal plasma membrane, as we now explain. 


Action Potentials Are Mediated by Voltage-gated Cation 
Channels 


When a neuron is stimulated, the membrane potential of the plasma 
membrane shifts to a less negative value (that is, toward zero). If this 
depolarization is sufficiently large, it will cause voltage-gated Nat 
channels in the membrane to open transiently at the site. As these chan- 
nels flicker open, they allow a small amount of Na* to enter the cell down 
its steep electrochemical gradient. The influx of positive charge depo- 
larizes the membrane further (that is, it makes the membrane potential 
even less negative), thereby opening additional voltage-gated Na+ chan- 
nels and causing still further depolarization. This process continues in 
an explosive, self-amplifying fashion until, within about a millisecond, 
the membrane potential in the local region of the neuron’s plasma 
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QUESTION 12-4 


Using the Nernst equation and 

the ion concentrations given in 
Table 12-1 (p. 391), calculate the 
equilibrium membrane potential of 
K+ and Nat—that is, the membrane 
potential where there would be no 
net movement of the ion across the 
plasma membrane (assume that the 
concentration of intracellular Na* is 
10 mM). What membrane potential 
would you predict in a resting animal 
cell? Explain your answer. What 
would happen if a large number 

of Na* channels suddenly opened, 
making the membrane much more 
permeable to Na* than to K+? 
(Note that because few ions need 
to move across the membrane 

to change drastically the charge 
distribution across that membrane, 
you can safely assume that the 

ion concentrations on either side 

of the membrane do not change 
significantly.) What would you 
predict would happen next if the 
Na* channels closed again? 


™ HOW WE KNOW 


SQUID REVEAL SECRETS OF MEMBRANE EXCITABILITY 


Each spring, Loligo pealei migrate to the shallow 
waters off Cape Cod on the eastern coast of the United 
States. There they spawn, launching the next genera- 
tion of squid. But more than just meeting and breeding, 
these animals provide neuroscientists summering 
at the Marine Biological Laboratory in Woods Hole, 
Massachusetts, with a golden opportunity to study the 
mechanism of electrical signaling along nerve axons. 


Like most animals, squid survive by catching prey and 
escaping predators. Fast reflexes and an ability to accel- 
erate rapidly and make sudden changes in swimming 
direction help them avoid danger while chasing down a 
decent meal. Squid derive their speed and agility from a 
specialized biological jet propulsion system: they draw 
water into their mantle cavity and then contract their 
muscular body wall to expel the collected water rapidly 
through a tubular siphon, thus propelling themselves 
through the water. 


Controlling such quick and coordinated muscle contrac- 
tion requires a nervous system that can convey signals 
with great speed down the length of the animal’s body. 
Indeed, Loligo pealei possesses some of the largest nerve 
cell axons found in nature. Squid giant axons can reach 
10 cm in length and are over 100 times the diameter of 
a mammalian axon—about the width of a pencil lead. 
Generally speaking, the larger the diameter of an axon, 
the more rapidly signals can travel along its length. 


In the 1930s, scientists first started to take advantage 
of the squid giant axon for studying the electrophysi- 
ology of the nerve cell. Because of its relatively large 
size, an investigator can isolate an individual axon and 
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insert an electrode into it to measure the axon’s mem- 
brane potential and monitor its electrical activity. This 
experimental system allowed researchers to address a 
variety of questions, including which ions are impor- 
tant for establishing the resting membrane potential 
and for initiating and propagating an action potential, 
and how changes in the membrane potential control ion 
permeability. 


Set-up for action 


Because the squid axon is so long and wide, an electrode 
made from a glass capillary tube containing a conduct- 
ing solution can be thrust down the axis of the isolated 
axon So that its tip lies deep in the cytoplasm. This set-up 
allowed investigators to measure the voltage difference 
between the inside and the outside of the axon—that is, 
the membrane potential—as an action potential sweeps 
past the tip of the electrode (Figure 12-32). The action 
potential itself would be triggered by applying a brief 
electrical stimulus to one end of the axon. It didn’t mat- 
ter which end was stimulated, as the action potential 
could travel in either direction; it also didn’t matter how 
big the stimulus was, as long as it exceeded a certain 
threshold (see Figure 12-35), indicating that an action 
potential is an “all or nothing” response. 


Once researchers could reliably generate and meas- 
ure an action potential, they could use the preparation 
to answer other questions about membrane excitabil- 
ity. For example, which ions are critical for an action 
potential? The three most plentiful ions, both inside 
and outside an axon, are Nat, Kt, and CI. Do they have 
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Figure 12-32 Scientists can study nerve cell excitability using an isolated axon from 
squid. (A) An electrode can be inserted into the cytoplasm (axoplasm) of a squid giant axon 
to (B) measure the resting membrane potential and monitor action potentials induced when 


the axon is electrically stimulated. 
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Figure 12-33 The cytoplasm in a squid axon can be removed and replaced with an artificial solution of pure ions. (A) The axon 
cytoplasm (axoplasm) is extruded using a rubber roller. (B) A perfusion fluid containing the desired concentration of ions is pumped 


gently through the emptied-out axon. 


equal importance when it comes to the action potential? 
Because the squid axon is so large and strong, investi- 
gators could extrude the cytoplasm from the axon like 
toothpaste from a tube (Figure 12-33A). The emptied- 
out axon could then be reinflated by filling it with a pure 
solution of Na*, K*, or Cl (Figure 12-33B). Thus, the 
ions inside the axon and in the bath solution could be 
varied independently (see Figure 12-32A). Using this 
set-up, the researchers discovered that the axon would 
generate a normal action potential if, and only if, the 
concentrations of Nat and K* approximated the natu- 
ral concentrations found inside and outside the cell. 
Thus, they concluded that the cell components crucial 
to the action potential are the plasma membrane, Nat 
and K+ ions, and the energy provided by the concentra- 
tion gradients of these ions across the membrane; all 
other components, including other sources of metabolic 
energy, were presumably removed when the axon was 
emptied and refilled. 


Channel traffic 


Once Nat and K* had been singled out as critical for an 
action potential, the questions then became: What does 
each of these ions contribute to the action potential? 
How permeable is the membrane to each, and how does 
the membrane permeability change as an action poten- 
tial sweeps by? Again, the squid giant axon provided 
some answers. The concentrations of Na+ and K* inside 
and outside the axon could be altered, and the effects 
of these changes on the membrane potential could be 
measured directly. From such studies, it was determined 
that, at rest, the membrane potential of an axon is close 
to the equilibrium potential for Kt: when the external 
concentration of K* was varied, the resting potential of 
the axon changed roughly in accordance with the Nernst 
equation (see Figure 12-24). The results suggested that 
at rest, the membrane is chiefly permeable to K*; we 
now know that K+ leak channels provide the main path- 
way that these ions can take through the resting plasma 
membrane. 


The situation for Nat is very different. When the external 
concentration of Na* was varied, there was no effect on 
the resting potential of the axon. However, the height of 
the peak of the action potential varied with the concen- 
tration of Na+ outside the axon (Figure 12-34). During 
the action potential, therefore, the membrane appeared 
to be chiefly permeable to Na+, presumably as the result 
of the opening of Na* channels. In the aftermath of the 
action potential, the Nat permeability decreased and the 
membrane potential reverted to a negative value, which 
depended on the external concentration of Kt. As the 
membrane lost its permeability to Nat, it became even 
more permeable to K+ than before, presumably because 
additional K* channels opened, accelerating the reset- 
ting of the membrane potential to the resting state, and 
readying the membrane for the next action potential. 


These studies on the squid giant axon made an enor- 
mous contribution to our understanding of nerve cell 
excitability, and the researchers who made these discov- 
eries in the 1940s and 1950s—Alan Hodgkin and Andrew 
Huxley—received a Nobel Prize in 1963. However, it was 
years before the various ion channel proteins that they 
had hypothesized to exist would be biochemically iden- 
tified. We now know the three-dimensional structures of 
many of these channel proteins, allowing us to marvel 
at the fundamental beauty of these molecular machines. 


time (msec) 


Figure 12-34 The shape of the action potential depends 

on the concentration of Na* outside the squid axon. 

Shown here are action potentials recorded when the external 
medium contains 100%, 50%, or 33% of the normal extracellular 
concentration of Na*. 
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Figure 12-35 An action potential is 
triggered by a depolarization of a 
neuron’s plasma membrane. The resting 


membrane potential in this neuron is —60 mV, 


and a stimulus that depolarizes the plasma 
membrane to about —40 mV (the threshold 
potential) is applied. This depolarizing 
stimulus is sufficient to open voltage- 

gated Na* channels in the membrane and 
thereby trigger an action potential. As the 
membrane rapidly depolarizes further, the 
membrane potential (red curve) swings past 
zero, reaching +40 mV before it returns 

to its resting negative value as the action 
potential terminates. The green curve shows 
how the membrane potential would simply 
have relaxed back to the resting value after 
the initial depolarizing stimulus if there had 
been no amplification by voltage-gated ion 
channels in the plasma membrane. 


Figure 12-36 A voltage-gated Nat 
channel can flip from one conformation 
to another, depending on the membrane 
potential. When the membrane is at rest 
and highly polarized, positively charged 
amino acids in the voltage sensors of the 
channel (red bars) are oriented by the 
membrane potential in a way that keeps 
the channel in its closed conformation. 
When the membrane is depolarized, 

the voltage sensors shift, changing the 
channel's conformation so the channel has 
a high probability of opening. But in the 
depolarized membrane, the inactivated 
conformation is even more stable than the 
open conformation, and so, after a brief 
period spent in the open conformation, 
the channel becomes temporarily 
inactivated and cannot open. The red 
arrows indicate the sequence that follows 
a sudden depolarization, and the black 
arrow indicates the return to the original 
conformation after the membrane has 
repolarized. 
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membrane has shifted from its resting value of about -60 mV to about 
+40 mV (Figure 12-35). 


The voltage of +40 mV is close to the membrane potential at which the 
electrochemical driving force for movement of Na* across the membrane 
is zero—that is, the effects of the membrane potential and the concentra- 
tion gradient for Na* are equal and opposite; therefore Na* has no further 
tendency to enter or leave the cell. 


If these voltage-gated channels continued to respond to the depolarized 
membrane potential, the cell would get stuck with most of its Na+ chan- 
nels open. The cell is saved from this fate because voltage-gated Na+ 
channels have an automatic inactivating mechanism—a kind of “timer” 
that causes them to rapidly adopt (within a millisecond or so) a special 
inactivated conformation in which the channel is closed, even though 
the membrane is still depolarized. The Na+ channels remain in this inac- 
tivated state until the membrane potential has returned to its resting, 
negative value. A schematic illustration of these three distinct states of 
the voltage-gated Na+ channel—closed, open, and inactivated—is shown 
in Figure 12-36. How they contribute to the rise and fall of an action 
potential is shown in Figure 12-37. 


During an action potential, voltage-gated Na* channels do not act alone. 
The depolarized axonal membrane is helped to return to its resting 
potential by the opening of voltage-gated K* channels. These also open 
in response to depolarization, but not as promptly as the Nat channels, 
and they stay open as long as the membrane remains depolarized. As the 
local depolarization reaches its peak, K* ions (carrying positive charge) 
therefore start to flow out of the cell, down their electrochemical gradient, 
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through these newly opened K+ channels—temporarily unhindered by the 
negative membrane potential that normally restrains them in the rest- 
ing cell. The rapid outflow of K* through the voltage-gated K* channels 
brings the membrane back to its resting state much more quickly than 
could be achieved by K* outflow through the K* leak channels alone. 


Once it begins, the self-amplifying depolarization of a small patch of 
plasma membrane quickly spreads outward: the Nat flowing in through 
open Na* channels begins to depolarize the neighboring region of the 
membrane, which then goes through the same self-amplifying cycle. In 
this way, an action potential spreads outward as a traveling wave from 
the initial site of depolarization, eventually reaching the axon terminals 
(Figure 12-38). 


Na* CHANNELS 


qose —— nacrivareD = REN toseD 


Na* 


Nat axon plasma membrane 
yY . 


time = 0 (action potential triggered) 


Na* CHANNELS 


REPOLARIZED 


EÇ SIMIC SIMIC JimC) 


time = 1 millisecond (action potential travels) 


Figure 12-37 Voltage-gated Na* channels 
change their conformation during an 
action potential. In this example, the 
action potential is triggered by a brief pulse 
of electric current (arrow), which partially 
depolarizes the membrane, as shown in the 
plot of membrane potential versus time. 
The course of the action potential reflects 
the opening and subsequent inactivation 

of voltage-gated Na* channels, as shown 
(top). Even if restimulated, the plasma 
membrane cannot produce a second action 
potential until the Na* channels have 
returned from the inactivated to the closed 
conformation (see Figure 12-36). Until then, 
the membrane is resistant, or refractory, to 
stimulation. 


QUESTION 12-5 


Explain as precisely as you can, but 
in no more than 100 words, the ionic 
basis of an action potential and how 
it is passed along an axon. 


Figure 12-38 An action potential 
propagates along the length of an 
axon. The changes in the Na* channels 
and the consequent flow of Na* across 
the membrane (red arrows) alters the 
membrane potential and gives rise to 
the traveling action potential, as shown 
here and in Movie 12.12. The region of 
the axon with a depolarized membrane 
is shaded in blue. Note that an action 
potential can only travel forward; that 

is, away from the site of depolarization. 
This is because Na* channel inactivation 
in the aftermath of an action potential 
prevents the advancing front of 
depolarization from spreading backward 
(see also Figure 12-37). 
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Once an action potential has passed, Na+ pumps in the axon plasma 
membrane labor to restore the Nat and K+ ion gradients to their levels 
in the resting cell. The human brain consumes 20% of the total energy 
generated from the metabolism of food, mostly to power these pumps. 


Voltage-gated Ca?+ Channels in Nerve Terminals 
Convert an Electrical Signal into a Chemical Signal 


When an action potential reaches the nerve terminals at the end of an 
axon, the signal must somehow be relayed to the target cells that the 
terminals contact—usually neurons or muscle cells. The signal is trans- 
mitted to the target cells at specialized junctions known as synapses. 
At most synapses, the plasma membranes of the cells transmitting and 
receiving the message—the presynaptic and the postsynaptic cells, respec- 
tively—are separated from each other by a narrow synaptic cleft (typically 
20 nm across), which the electrical signal cannot cross. To transmit the 
message across this gap, the electrical signal is converted into a chemical 
signal, in the form of a small, secreted signal molecule called a neuro- 
transmitter. Neurotransmitters are stored in the nerve terminals within 
membrane-enclosed synaptic vesicles (Figure 12-39). 


When an action potential reaches the nerve terminal, some of the 
synaptic vesicles fuse with the plasma membrane, releasing their neu- 
rotransmitter into the synaptic cleft. This link between the arrival of an 
action potential and the secretion of neurotransmitter involves the acti- 
vation of yet another type of voltage-gated cation channel: voltage-gated 
Ca** channels located in the plasma membrane of the presynaptic nerve 
terminal. Because the Ca% concentration outside the nerve terminal is 
more than 1000 times greater than the free Ca?* concentration in its cyto- 
sol (see Table 12-1), Ca?* rushes into the nerve terminal through the open 
channels. The resulting increase in Ca?* concentration in the cytosol of 
the terminal immediately triggers the fusion of synaptic vesicles with the 
plasma membrane, which releases the neurotransmitter into the synaptic 
cleft. Thanks to these voltage-gated Ca? channels, the electrical signal 
has now been converted into a chemical signal (Figure 12-40). 
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Figure 12-39 Neurons connect to their target cells at synapses. (A) An electron micrograph and (B) a drawing of a cross section 

of two nerve terminals (yellow) forming synapses on a single nerve cell dendrite (blue) in the mammalian brain. Neurotransmitters 

carry the signal across the synaptic cleft that separates the presynaptic and postsynaptic cells. The neurotransmitter in the presynaptic 
terminal is contained within synaptic vesicles, which release neurotransmitter into the synaptic cleft. Note that both the presynaptic and 
postsynaptic membranes are thickened and highly specialized at the synapse. (A, courtesy of Cedric Raine.) 
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an Electrical Signal 


The released neurotransmitter rapidly diffuses across the synaptic cleft 
and binds to neurotransmitter receptors concentrated in the plasma mem- 
brane of the postsynaptic target cell. Once released, neurotransmitters 
are rapidly removed from the synaptic cleft—either by enzymes that 
destroy them or by pumps that return them to the nerve terminal or that 
transport them into neighboring non-neuronal cells. This rapid removal 
of the neurotransmitter limits the duration and spread of the signal and 
ensures that when the presynaptic cell falls quiet, the postsynaptic cell 
will do the same. 


Neurotransmitter receptors can be of various types; some mediate rela- 
tively slow effects in the target cell, whereas others trigger more rapid 
responses. Rapid responses—on a time scale of milliseconds—depend on 
receptors that are transmitter-gated ion channels (also called ion-chan- 
nel-coupled receptors). These constitute a subclass of ligand-gated ion 
channels (see Figure 12-27B), and their function is to convert the chemi- 
cal signal carried by a neurotransmitter back into an electrical signal. The 
channels open transiently in response to the binding of the neurotrans- 
mitter, thus changing the ion permeability of the postsynaptic membrane. 
This in turn causes a change in the membrane potential (Figure 12-41). 
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Figure 12-40 An electrical signal is 
converted into a secreted chemical 
signal at a nerve terminal. When 

an action potential reaches a nerve 
terminal, it opens voltage-gated Ca?* 
channels in the plasma membrane, 
allowing Ca?* to flow into the terminal. 
The increased Ca?* in the nerve terminal 
stimulates the synaptic vesicles to fuse 
with the plasma membrane, releasing 
their neurotransmitter into the synaptic 
cleft—a process called exocytosis 
(discussed in Chapter 15). 
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Figure 12-41 A chemical signal is 
= —— activated receptor 


converted into an electrical signal by 
postsynaptic transmitter-gated ion 
channels at a synapse. The released 
neurotransmitter binds to and opens the 
transmitter-gated ion channels in the 
plasma membrane of the postsynaptic cell. 
The resulting ion flows alter the membrane 
potential of the postsynaptic cell, thereby 
converting the chemical signal back into an 
electrical one (Movie 12.13). 
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Figure 12-42 The acetylcholine receptor in the plasma membrane of vertebrate skeletal muscle cells opens 
when it binds the neurotransmitter acetylcholine. (A) This transmitter-gated ion channel is composed of five 
transmembrane protein subunits, two of which (green) are identical. The subunits combine to form a transmitter- 
gated aqueous pore across the lipid bilayer. There are two acetylcholine-binding sites, one formed by parts of a 
green and blue subunit, the other by parts of a green and orange subunit, as shown. (B) The closed conformation. 
The blue subunit has been removed here and in (C) to show the interior of the pore. Negatively charged amino acid 
side chains at either end of the pore (indicated here by red minus signs) ensure that only positively charged ions, 
mainly Na* and K*, can pass. But when acetylcholine is not bound and the channel is in its closed conformation, 
the pore is occluded (blocked) by hydrophobic amino acid side chains in the region called the gate. (C) The open 
conformation. When acetylcholine, released by a motor neuron, binds to both binding sites, the channel undergoes 
a conformational change; the hydrophobic side chains move apart and the gate opens, allowing Na* to flow across 
the membrane down its electrochemical gradient, depolarizing the membrane. Even with acetylcholine bound, the 
channel flickers randomly between the open and closed states (see Figure 12-26); without acetylcholine bound, the 
channel rarely opens. 


If the change is large enough, the postsynaptic membrane will depolarize 


and trigger an action potential in the postsynaptic cell. 
QUESTION 12—6 . ON O 
A well-studied example of a neurotransmitter in action is found at the 


In the disease myasthenia gravis, the neuromuscular junction—the specialized synapse formed between 
human body makes—by mistake— a motor neuron and a skeletal muscle cell. In vertebrates, the neuro- 
antibodies to its own acetylcholine transmitter acetylcholine stimulates muscle contraction by binding to the 
receptor molecules. These acetylcholine receptor, a transmitter-gated ion channel in the muscle cell’s 
antibodies bind to and inactivate membrane (Figure 12-42). However, not all neurotransmitters excite the 
acetylcholine receptors on the postsynaptic cell, as we consider next. 

plasma membrane of muscle cells. 

Uo e esos ta Gove stariog Neurotransmitters Can Be Excitatory or Inhibitory 
progressive weakening of the 

muscles of people affected. Early Neurotransmitters can either excite or inhibit a postsynaptic cell, and it 
on, they may have difficulty opening is the character of the receptor that recognizes the neurotransmitter that 
their eyelids, for example, and, in an determines how the postsynaptic cell will respond. The chief receptors 
animal model of the disease, rabbits for excitatory neurotransmitters, such as acetylcholine and glutamate, 


have difficulty holding their ears up. 
As the disease progresses, most 
muscles weaken, and people with 
myasthenia gravis have difficulty 
speaking and swallowing. Eventually, 


are ligand-gated cation channels. When a neurotransmitter binds, these 
channels open to allow an influx of Na*, which depolarizes the plasma 
membrane and thus tends to activate the postsynaptic cell, encouraging 
it to fire an action potential. By contrast, the main receptors for inhibitory 


impaired breathing can cause neurotransmitters, such as y-aminobutyric acid (GABA) and glycine, are 
death. Explain which step of muscle ligand-gated Cl channels. When neurotransmitters bind, these channels 
function is affected. open, allowing CF to enter the cell; this influx of Cl inhibits the postsyn- 


aptic cell by making its plasma membrane harder to depolarize. 
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TABLE 12-3 SOME EXAMPLES OF ION CHANNELS 


Typical Location 


plasma membrane of 
most animal cells 


lon Channel 


K* leak channel maintenance of resting 


membrane potential 


Voltage-gated Na* plasma membrane of 
channel nerve cell axon 


generation of action 
potentials 


return of membrane to 
resting potential after 
initiation of an action 
potential 


Voltage-gated K* plasma membrane of 
channel nerve cell axon 


stimulation of 
neurotransmitter release 


Voltage-gated Ca?* plasma membrane of 
channel nerve terminal 


Acetylcholine receptor plasma membrane 
(acetylcholine-gated of muscle cell (at 
cation channel) neuromuscular junction) 


excitatory synaptic signaling 


Glutamate receptor 
(glutamate-gated 
cation channel) 


plasma membrane of 
many neurons 
(at synapses) 


excitatory synaptic signaling 


GABA receptor plasma membrane of inhibitory synaptic signaling 
(GABA-gated many neurons 
CI- channel) (at synapses) 


plasma membrane of 
many neurons 
(at synapses) 


Glycine receptor inhibitory synaptic signaling 
(glycine-gated 


CF channel) 


detection of sound 
vibrations 


Mechanically-gated auditory hair cell in 
cation channel inner ear 


Toxins that bind to any of these excitatory or inhibitory neurotransmitter 
receptors can have dramatic effects on an animal—or a human. Curare, 
for example, causes muscle paralysis by blocking excitatory acetylcholine 
receptors at the neuromuscular junction. This drug was used by South 
American Indians to make poison arrows and is still used by surgeons 
to relax muscles during an operation. By contrast, strychnine—a com- 
mon ingredient in rat poisons—causes muscle spasms, convulsions, and 
death by blocking inhibitory glycine receptors on neurons in the brain 
and spinal cord. 


The locations and functions of the ion channels discussed in this chapter 
are summarized in Table 12-3. 


Most Psychoactive Drugs Affect Synaptic Signaling by 
Binding to Neurotransmitter Receptors 


Many drugs used in the treatment of insomnia, anxiety, depression, and 
schizophrenia act by binding to transmitter-gated ion channels in the 
brain. Sedatives and tranquilizers such as barbiturates, Valium, Ambien, 
and Restoril, for example, bind to GABA-gated Cl channels. Their bind- 
ing makes the channels easier to open by GABA, rendering the neuron 
more sensitive to GABA’s inhibitory action. By contrast, the antidepres- 
sant Prozac blocks the Nat-driven symport responsible for the reuptake 
of the excitatory neurotransmitter serotonin, increasing the amount of 
serotonin available in the synapses that use it. This drug has changed the 
lives of many people who suffer from depression—although why boost- 
ing serotonin can elevate mood is still unknown. 


in the plasma membrane of a 


postsynaptic neuron, why does this 
make it harder for an excitatory 
neurotransmitter to excite the 


neuron? 
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QUESTION 12-7 


When an inhibitory neurotransmitter 
such as GABA opens CI channels 
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Figure 12-43 Thousands of synapses 
form on the cell body and dendrites of a 
motor neuron in the spinal cord. (A) Many 
thousands of nerve terminals synapse on 
this neuron, delivering signals from other 
parts of the animal to control the firing of 
action potentials along the neuron’s axon. 
(B) A rat nerve cell in culture. Its cell body 
and dendrites (green) are stained with a 
fluorescent antibody that recognizes a 
cytoskeletal protein. Thousands of axon 
terminals (red) from other nerve cells (not 
visible) make synapses on the cell’s surface; 
they are stained with a fluorescent antibody 
that recognizes a protein in synaptic 
vesicles, which are located in the nerve 
terminals (see Figure 12-39). (B, courtesy of 
Olaf Mundigl and Pietro de Camilli.) 
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The number of distinct types of neurotransmitter receptors is very large, 
although they fall into a small number of families. There are, for example, 
many subtypes of acetylcholine, glutamate, GABA, glycine, and serotonin 
receptors; they are usually located on different neurons and often dif- 
fer only subtly in their electrophysiological properties. With such a large 
variety of receptors, it may be possible to design a new generation of 
psychoactive drugs that will act more selectively on specific sets of neu- 
rons to mitigate the mental illnesses that devastate so many people’s 
lives. One percent of the human population, for example, have schiz- 
ophrenia, another 1% have bipolar disorder, about 1% have an autistic 
disorder, and many more suffer from anxiety or depressive disorders. The 
fact that these disorders are so prevalent suggests that the complexity of 
synaptic signaling may make the brain especially vulnerable to genetic 
alterations. But complexity also provides some distinct advantages, as 
we discuss next. 


The Complexity of Synaptic Signaling Enables Us to 
Think, Act, Learn, and Remember 


For a process so critical for animal survival, the mechanism that gov- 
erns synaptic signaling seems unnecessarily cumbersome, as well as 
error-prone. For a signal to pass from one neuron to the next, the nerve 
terminal of the presynaptic cell must convert an electrical signal into a 
secreted chemical. This chemical signal must then diffuse across the syn- 
aptic cleft so that a postsynaptic cell can convert it back into an electric 
one. Why would evolution have favored such an apparently inefficient 
and vulnerable method for passing a signal between two cells? It would 
seem more efficient and robust to have a direct electrical connection 
between them—or to do away with the synapse altogether and use a 
single continuous cell. 


The value of synapses that rely on secreted chemical signals becomes 
clear when we consider how they function in the context of the nerv- 
ous system—an elaborate network of neurons, interconnected by many 
branching circuits, performing complex computations, storing memories, 
and generating plans for action. To carry out these functions, neurons 
have to do more than merely generate and relay signals: they must also 
combine them, interpret them, and record them. Chemical synapses 
make these activities possible. A motor neuron in the spinal cord, for 
example, receives inputs from hundreds or thousands of other neurons 
that make synapses on it (Figure 12-43). Some of these signals tend to 


— dendrites 


presynaptic cae y 
nerve ; 
terminals 


(A) 


(B) 
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stimulate the neuron, while others inhibit it. The motor neuron has to 
combine all of the information it receives and react, either by stimulating 
a muscle to contract or by remaining quiet. 


This task of computing an appropriate output from a babble of inputs is 
achieved by a complicated interplay between different types of ion chan- 
nels in the neuron’s plasma membrane. Each of the hundreds of types of 
neurons in the brain has its own characteristic set of receptors and ion 
channels that enables the cell to respond in a particular way to a certain 
set of inputs and thus to perform its specialized task. 


Ion channels are thus critical components of the machinery that ena- 
bles us to act, think, feel, speak, learn, and remember. Given that these 
channels operate within neuronal circuits that are dauntingly complex, 
will we ever be able to deeply understand the molecular mechanisms 
that direct the complex behaviors of organisms such as ourselves? 
Although cracking this problem in humans is still far in the future, we 
now have increasingly powerful ways to study the neural circuits—and 
molecules—that underlie behavior in experimental animals. One of the 
most promising techniques makes use of a different type of ion channel, 
a light-gated ion channel borrowed from unicellular algae, as we now 
discuss. 


Light-gated lon Channels Can Be Used to Transiently 
Activate or Inactivate Neurons in Living Animals 


Photosynthetic green algae use light-gated channels to sense and navigate 
toward sunlight. In response to blue light, one of these channels—called 
channelrhodopsin—allows Na* to flow into the cell. This depolarizes the 
plasma membrane and, ultimately, modulates the beating of the flagella 
that the organism uses to swim. Although these channels are peculiar to 
unicellular green algae, they function perfectly well when they are arti- 
ficially transferred into other cell types, thereby rendering the recipient 
cells responsive to light. 


Because nerve cells are also activated by a depolarizing influx of Nat, as 
we have discussed (see Figure 12-38), channelrhodopsin can be used to 
manipulate the activity of neurons and neural circuits—including those in 
living animals. In one particularly stunning experiment, the channelrho- 
dopsin gene was introduced into a select subpopulation of neurons in the 
mouse hypothalamus—a brain region involved in many functions, includ- 
ing aggression. The activity of these neurons could then be controlled by 
light that was provided by a thin, optic fiber implanted in the animal's 
brain. When the channels were illuminated, the mouse would launch an 
attack on any object in its path—including other mice or, in one comical 
instance, an inflated rubber glove. When the light was switched off, the 
neurons once again fell silent, and the mouse’s behavior would immedi- 
ately return to normal (Figure 12-44 and Movie 12.14). 


Because the approach relies on a light-gated channel that is introduced 
into cells by genetic engineering techniques (discussed in Chapter 10), 
the method has been dubbed optogenetics. This tool is revolutionizing 
neurobiology, allowing investigators to dissect the neural circuits that 
govern even the most complex behaviors in a variety of experimental 
animals, from fruit flies to monkeys. But its implications extend beyond 
the laboratory. As genetic studies continue to identify genes associated 
with various human neurological and psychiatric disorders, the ability 
to exploit light-gated ion channels to study where and how these genes 
function in model organisms promises to greatly advance our under- 
standing of the molecular and cellular basis of our own behavior. 


421 


422 CHAPTER 12 Transport Across Cell Membranes 


Figure 12-44 Light-gated ion channels 
can control the activity of specific 
neurons in a living animal. (A) In 

this experiment, the gene encoding 
channelrhodopsin was introduced 

into a subset of neurons in the mouse 
hypothalamus. (B) When the neurons 
are exposed to blue light using a tiny 
fiber-optic cable implanted into the 
animal's brain, channelrhodopsin opens, 
depolarizing and stimulating the channel- 
containing neurons. (C) When the light 
is switched on, the mouse immediately 
becomes aggressive; when the light is 
switched off, its behavior immediately 
returns to normal. (C, from D. Lin et al., 
Nature 470:221-226, 2011.) 
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ESSENTIAL CONCEPTS 


e The lipid bilayer of cell membranes is highly permeable to small, non- 
polar molecules such as oxygen and carbon dioxide and, to a lesser 
extent, to very small, polar molecules such as water. It is highly 
impermeable to most large, water-soluble molecules and to all ions. 


e Transfer of nutrients, metabolites, and inorganic ions across cell 
membranes depends on membrane transport proteins. 


e Cell membranes contain a variety of transport proteins that function 
either as transporters or channels, each responsible for the transfer 
of a particular type of solute. 


e Channel proteins form pores across the lipid bilayer through which 
solutes can passively diffuse. 


e Both transporters and channels can mediate passive transport, in 
which an uncharged solute moves spontaneously down its concen- 
tration gradient. 


e For the passive transport of a charged solute, its electrochemical 
gradient determines its direction of movement, rather than its con- 
centration gradient alone. 


e Transporters can act as pumps to mediate active transport, in which 
solutes are moved uphill against their concentration or electrochemi- 
cal gradients; this process requires energy that is provided by ATP 
hydrolysis, a downhill flow of Na* or H* ions, or sunlight. 


e Transporters transfer specific solutes across a membrane by under- 
going conformational changes that expose the solute-binding site 
first on one side of the membrane and then on the other. 


e The Nat pump in the plasma membrane of animal cells is an ATPase; 
it actively transports Na* out of the cell and K* in, maintaining a steep 
Nat gradient across the plasma membrane that is used to drive other 
active transport processes and to convey electrical signals. 
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e fon channels allow inorganic ions of appropriate size and charge to 
cross the membrane. Most are gated and open transiently in response 
to a specific stimulus. 


e Even when activated by a specific stimulus, ion channels do not 
remain continuously open: they flicker randomly between open and 
closed conformations. An activating stimulus increases the propor- 
tion of time that the channel spends in the open state. 


e The membrane potential is determined by the unequal distribution of 
charged ions on the two sides of a cell membrane; it is altered when 
these ions flow through open ion channels in the membrane. 


e In most animal cells, the negative value of the resting membrane 
potential across the plasma membrane depends mainly on the Kt 
gradient and the operation of K*-selective leak channels; at this rest- 
ing potential, the driving force for the movement of K* across the 
membrane is almost zero. 


e Neurons produce electrical impulses in the form of action potentials, 
which can travel long distances along an axon without weaken- 
ing. Action potentials are propagated by voltage-gated Na* and Kt 
channels that open sequentially in response to depolarization of the 
plasma membrane. 


e Voltage-gated Ca?* channels in a nerve terminal couple the arrival 
of an action potential to neurotransmitter release at a synapse. 
Transmitter-gated ion channels convert this chemical signal back 
into an electrical one in the postsynaptic target cell. 


e Excitatory neurotransmitters open transmitter-gated cation channels 
that allow the influx of Na*, which depolarizes the postsynaptic cell’s 
plasma membrane and encourages the cell to fire an action potential. 
Inhibitory neurotransmitters open transmitter-gated Cl channels in 
the postsynaptic cell’s plasma membrane, making it harder for the 
membrane to depolarize and fire an action potential. 


e Complex sets of nerve cells in the human brain exploit all of the 
above mechanisms to make human behaviors possible. 


KEY TERMS 


action potential Nernst equation 

active transport nerve terminal 

antiport neuron 

axon neurotransmitter 

Ca?* pump (or Ca2+ ATPase) optogenetics 

channel osmosis 

dendrite passive transport 
depolarization patch-clamp recording 
electrochemical gradient pump 

gradient-driven pump resting membrane potential 
H* pump (or H* ATPase) symport 

ion channel synapse 

K+ leak channel synaptic vesicle 
ligand-gated channel transmitter-gated ion channel 
mechanically-gated channel transporter 

membrane potential voltage-gated channel 
membrane transport protein voltage-gated Na* channel 


Na* pump (or Na*-K* ATPase) 
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QUESTIONS 


QUESTION 12-8 


The diagram in Figure 12-9 shows a transporter that 
mediates the passive transfer of a solute down its 
concentration gradient across the membrane. How would 
you need to change the diagram to convert the transporter 
into a pump that moves the solute up its concentration 
gradient by hydrolyzing ATP? Explain the need for each of 
the steps in your new illustration. 


QUESTION 12-9 


Which of the following statements are correct? Explain your 
answers. 


A. The plasma membrane is highly impermeable to all 
charged molecules. 


B. Channels have specific binding pockets for the solute 
molecules they allow to pass. 


C. Transporters allow solutes to cross a membrane at much 
faster rates than do channels. 


D. Certain H* pumps are fueled by light energy. 


E. The plasma membrane of many animal cells contains 
open K* channels, yet the Kt concentration in the cytosol is 
much higher than outside the cell. 


F. Asymport would function as an antiport if its orientation 
in the membrane were reversed (i.e., if the portion of the 
molecule normally exposed to the cytosol faced the outside 
of the cell instead). 


G. The membrane potential of an axon temporarily 
becomes more negative when an action potential excites it. 
QUESTION 12-10 

List the following compounds in order of decreasing lipid- 
bilayer permeability: RNA, Ca?+, glucose, ethanol, N2, water. 
QUESTION 12-11 


Name at least one similarity and at least one difference 
between the following (it may help to review the definitions 
of the terms using the Glossary): 


Symport and antiport 

Active transport and passive transport 

Membrane potential and electrochemical gradient 
. Pump and transporter 


Axon and telephone wire 


mmond wD > 


Solute and ion 


QUESTION 12-12 


Discuss the following statement: “The differences between 
a channel and a transporter are like the differences between 
a bridge and a ferry.” 


QUESTION 12-13 

The neurotransmitter acetylcholine is made in the cytosol 
and then transported into synaptic vesicles, where its 
concentration is more than 100-fold higher than in the 


cytosol. When synaptic vesicles are isolated from neurons, 
they can take up additional acetylcholine added to the 
solution in which they are suspended, but only when ATP 

is present. Nat ions are not required for the uptake, but, 
curiously, raising the pH of the solution in which the synaptic 
vesicles are suspended increases the rate of uptake. 
Furthermore, transport is inhibited when drugs are added 
that make the membrane permeable to H* ions. Suggest a 
mechanism that is consistent with all of these observations. 


QUESTION 12-14 


The resting membrane potential of a typical animal cell is 
about -70 mV, and the thickness of a lipid bilayer is about 
4.5 nm. What is the strength of the electric field across the 
membrane in V/cm? What do you suppose would happen 
if you applied this field strength to two metal electrodes 
separated by a 1-cm air gap? 


QUESTION 12-15 


Phospholipid bilayers form sealed, spherical vesicles 

in water (discussed in Chapter 11). Assume you have 
constructed lipid vesicles that contain Nat pumps as 

the sole membrane protein, and assume for the sake of 
simplicity that each pump transports one Na* one way and 
one K* the other way in each pumping cycle. All the Na* 
pumps have the portion of the molecule that normally faces 
the cytosol oriented toward the outside of the vesicles. With 
the help of Figures 12-11 and 12-12, determine what would 
happen in each of the following cases. 


A. Your vesicles were suspended in a solution containing 
both Na* and K* ions and had a solution with the same ionic 
composition inside them. 


B. You add ATP to the suspension described in (A). 


C. You add ATP but the solution—outside as well as inside 
the vesicles—contains only Na* ions and no K* ions. 


D. The concentrations of Na* and Kt were as in (A), but 
half of the pump molecules embedded in the membrane of 
each vesicle were oriented the other way around, so that 
the normally cytosolic portions of these molecules faced the 
inside of the vesicles. You then add ATP to the suspension. 


E. You add ATP to the suspension described in (A), but in 
addition to Na* pumps, the membrane of your vesicles also 
contains K* leak channels. 


QUESTION 12-16 


Name the three ways in which an ion channel can be gated. 


QUESTION 12-17 


One thousand Ca?* channels open in the plasma membrane 
of a cell that is 1000 um? in size and has a cytosolic 

Ca** concentration of 100 nM. For how long would the 
channels need to stay open in order for the cytosolic Ca?* 
concentration to rise to 5 uM? There is virtually unlimited 
Ca?* available in the outside medium (the extracellular 
Ca?* concentration in which most animal cells live is a few 
millimolar), and each channel passes 106 Ca?* ions per 
second. 


QUESTION 12-18 


Amino acids are taken up by animal cells using a symport 
in the plasma membrane. What is the most likely ion 
whose electrochemical gradient drives the import? Is ATP 
consumed in the process? If so, how? 


QUESTION 12-19 


We will see in Chapter 15 that endosomes, which are 
membrane-enclosed intracellular organelles, need an acidic 
lumen in order to function. Acidification is achieved by an 
H* pump in the endosomal membrane, which also contains 
CI channels. If the channels do not function properly (e.g., 
because of a mutation in the genes encoding the channel 
proteins), acidification is also impaired. 


A. Can you explain how CI- channels might help 
acidification? 


B. According to your explanation, would the CI- channels 
be absolutely required to lower the pH inside the 
endosome? 


QUESTION 12-20 


Some bacterial cells can grow on either ethanol 
(CH3CH2OH) or acetate (CH3COO_) as their only carbon 
source. Dr. Schwips measured the rate at which the two 
compounds traverse the bacterial plasma membrane but, 
due to excessive inhalation of one of the compounds (which 
one?), failed to label his data accurately. 


A. Plot the data from the table below. 


Concentration of 
Carbon Source (mM) 


Rate of Transport (umol/min) 


Compound A 
0.1 2.0 18 
0.3 6.0 46 
1.0 20 100 
3.0 60 150 
10.0 200 182 


B. Determine from your graph whether the data describing 
compound A correspond to the uptake of ethanol or 
acetate. 


Explain your answers. 


Questions 


QUESTION 12-21 


Acetylcholine-gated cation channels do not discriminate 
between Na’, K*, and Ca?* ions, allowing all to pass 
through them freely. So why is it that when acetylcholine 
binds to this protein in the plasma membrane of muscle 
cells, the channel opens and there is a large net influx of 
primarily Na* ions? 


QUESTION 12-22 


The ion channels that are regulated by binding of 
neurotransmitters, such as acetylcholine, glutamate, GABA, 
or glycine, have a similar overall structure. Yet each class 

of these channels consists of a very diverse set of subtypes 
with different transmitter affinities, different channel 
conductances, and different rates of opening and closing. 
Do you suppose that such extreme diversity is a good or 

a bad thing from the standpoint of the pharmaceutical 
industry? 
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How Cells Obtain Energy 
from Food 


To be able to grow, divide, and carry out day-to-day activities, cells require 
a constant supply of energy. This energy comes from the chemical-bond 
energy in food molecules, which thereby serve as fuel for cells. 


Perhaps the most important fuel molecules are the sugars (too much of 
which can, unfortunately, lead to obesity and type 2 diabetes). Plants 
make their own sugars from CO2 by photosynthesis. Animals obtain sug- 
ars—and other organic molecules that can be chemically transformed 
into sugars—by eating plants and other organisms. Nevertheless, the pro- 
cess whereby all these sugars are broken down to generate energy is very 
similar in both animals and plants. In both cases, the organism’s cells 
harvest useful energy from the chemical-bond energy locked in sugars as 
the sugar molecule is broken down and oxidized to carbon dioxide (CO2) 
and water (H2O)—a process called cell respiration. The energy released 
during these reactions is captured in the form of “high-energy” chemi- 
cal bonds—covalent bonds that release large amounts of energy when 
hydrolyzed—in activated carriers such as ATP and NADH. These carriers 
in turn serve as portable sources of the chemical groups and electrons 
needed for biosynthesis (discussed in Chapter 3). 


In this chapter, we trace the major steps in the breakdown of sugars and 
show how ATP, NADH, and other activated carriers are produced along 
the way. We concentrate on the breakdown of glucose because it gener- 
ates most of the energy produced in the majority of animal cells. A very 
similar pathway operates in plants, fungi, and many bacteria. Other mol- 
ecules, such as fatty acids and proteins, can also serve as energy sources 
if they are funneled through appropriate enzymatic pathways. We will 
see how cells use many of the molecules generated from the breakdown 
of sugars and fats as starting points to make other organic molecules. 
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THE BREAKDOWN AND 
UTILIZATION OF SUGARS 
AND FATS 


REGULATION OF METABOLISM 
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(A) DIRECT BURNING OF SUGAR 
IN NONLIVING SYSTEM 


large activation 


energy overcome 
by the heat from 


a fire 


+ 


free energy 


Figure 13-1 The controlled, stepwise 
oxidation of sugar in cells captures useful 
energy, unlike the simple burning of the 
same fuel molecule. (A) The direct burning 
of sugar in nonliving conditions releases 

a large amount of energy all at once. This 
quantity is too large to be captured by 

any carrier molecule, and all of the energy 
is released as heat. (B) In a cell, enzymes 
catalyze the breakdown of sugars via a 
series of small steps, in which a portion of 
the free energy released is captured by the 
formation of activated carriers—most often 
ATP and NADH. Each step is catalyzed 

by an enzyme that lowers the activation- 
energy barrier that must be surmounted by 
the random collision of molecules at the 
temperature of cells (body temperature), 

so as to allow the reaction to occur (see 
Figures 3-12 and 3-13). Note that the total 
free energy released by the complete 
oxidative breakdown of glucose to CO? and 
H2O—2880 kJ/mole—is exactly the same in 
(A) and (B). 


outer mitochondrial membrane 


intermembrane inner mitochondrial 
space membrane 


Figure 13-2 A mitochondrion has two 
membranes and a large internal space 
called the matrix. Most of the energy 
from food molecules is harvested in 
mitochondria—both in the matrix and in 
the inner mitochondrial membrane. 
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(B) STEPWISE OXIDATION OF SUGAR IN CELLS 


small activation energies 
overcome by enzymes that 
work at body temperature 


Finally, we examine how cells regulate their metabolism and how they 
store food molecules for their future metabolic needs. We will save our 
discussion of the elaborate mechanism cells use to produce the bulk of 
their ATP for Chapter 14. 


THE BREAKDOWN AND UTILIZATION OF 
SUGARS AND FATS 


If a fuel molecule such as glucose were oxidized to CO2 and H20 in a sin- 
gle step—by, for example, the direct application of fire—it would release 
an amount of energy many times larger than any carrier molecule could 
capture (Figure 13-1A). Instead, cells use enzymes to carry out the oxi- 
dation of sugars in a tightly controlled series of reactions. Thanks to 
the action of enzymes—which operate at temperatures typical of living 
things—cells degrade each glucose molecule step by step, paying out 
energy in small packets to activated carriers by means of coupled reac- 
tions (Figure 13-1B). In this way, much of the energy released by the 
oxidative breakdown of glucose is saved in the high-energy bonds of ATP 
and other activated carriers, which can then be made available to do use- 
ful work for the cell. 


Animal cells make ATP in two ways. First, certain energetically favorable, 
enzyme-catalyzed reactions involved in the breakdown of food-derived 
molecules are coupled directly to the energetically unfavorable reac- 
tion ADP + Pi — ATP. Thus the oxidation of food molecules can provide 
energy for the immediate production of ATP. Most ATP synthesis, how- 
ever, requires an intermediary. In this second pathway to ATP production, 
the energy from other activated carriers is used to drive ATP synthesis. 
This process, called oxidative phosphorylation, takes place on the inner 
mitochondrial membrane of eukaryotic cells (Figure 13-2)—or on the 
plasma membrane of aerobic prokaryotes—and it is described in detail in 
Chapter 14. In this chapter, we focus on the first sequence of reactions by 
which food molecules are oxidized—both in the cytosol and in the mito- 
chondrial matrix (see Figure 13-2). These reactions produce both ATP 
and the additional activated carriers that can subsequently help drive the 
production of much larger amounts of ATP by oxidative phosphorylation. 


Food Molecules Are Broken Down in Three Stages 


The proteins, fats, and polysaccharides that make up most of the food 
we eat must be broken down into smaller molecules before our cells can 


The Breakdown and Utilization of Sugars and Fats 


use them—either as a source of energy or as building blocks for making 
other organic molecules. This breakdown process—in which enzymes 
degrade complex organic molecules into simpler ones—is called 
catabolism. The process takes place in three stages, as illustrated in 
Figure 13-3. 


In stage 1 of catabolism, enzymes convert the large polymeric molecules 
in food into simpler monomeric subunits: proteins into amino acids, 
polysaccharides into sugars, and fats into fatty acids and glycerol. This 
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Figure 13-3 In animals, the breakdown 
of food molecules occurs in three stages. 
(A) Stage 1 mostly occurs outside cells, with 
the breakdown of large food molecules 

in the mouth and the gut—although 
intracellular lysosomes can also digest such 
large molecules. Stage 2 starts intracellularly 
with glycolysis in the cytosol, and ends with 
the conversion of pyruvate to acetyl groups 
on acetyl CoA in the mitochondrial matrix. 
Stage 3 begins with the citric acid cycle in 
the mitochondrial matrix and concludes 
with oxidative phosphorylation on the 
mitochondrial inner membrane. The NADH 
generated in stage 2 adds to the NADH 
produced by the citric acid cycle to drive 
the production of large amounts of ATP by 
oxidative phosphorylation. 

(B) The net products of the complete 
oxidation of food include ATP, NADH, CO», 
and H20. The ATP and NADH provide 
the energy and electrons needed for 
biosynthesis; the CO2 and H20 are waste 
products. 
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stage—also called digestion—occurs either outside cells (mainly in the 
intestine) or in specialized organelles within cells (called lysosomes, as 
discussed in Chapter 15). After digestion, the small organic molecules 
derived from food enter the cytosol of a cell, where their gradual oxida- 
tive breakdown begins. 


In stage 2 of catabolism, a chain of reactions called glycolysis splits each 
molecule of glucose into two smaller molecules of pyruvate. Sugars other 
than glucose can also be used, after first being converted into one of the 
intermediates in this sugar-splitting pathway. Glycolysis takes place in 
the cytosol and, in addition to producing pyruvate, it generates two types 
of activated carriers: ATP and NADH. The pyruvate is transported from 
the cytosol into the mitochondrion’s large, internal compartment called 
the matrix. There, a giant enzyme complex converts each pyruvate mol- 
ecule into CO? plus acetyl CoA, another of the activated carriers discussed 
in Chapter 3 (see Figure 3-37). NADH is also produced in this reaction. In 
the same compartment, large amounts of acetyl CoA are also produced 
by the stepwise oxidative breakdown of fatty acids derived from fats (see 
Figure 13-3). 


Stage 3 of catabolism takes place entirely in mitochondria. The acetyl 
group in acetyl CoA is transferred to an oxaloacetate molecule to form 
citrate, which enters a series of reactions called the citric acid cycle. In 
these reactions, the transferred acetyl group is oxidized to CO2, with the 
production of large amounts of NADH. Finally, the high-energy electrons 
from NADH are passed along a series of enzymes within the mitochon- 
drial inner membrane called an electron-transport chain, where the energy 
released by their transfer is used to drive oxidative phosphorylation—a 
process that produces ATP and consumes molecular oxygen (O2 gas). It is 
in these final steps of catabolism that the majority of the energy released 
by oxidation is harnessed to produce most of the cell's ATP. 


Through the production of ATP, the energy derived from the breakdown 
of sugars and fats is redistributed into packets of chemical energy in a 
form convenient for use in the cell. In total, nearly 50% of the energy that 
could, in theory, be derived from the breakdown of glucose or fatty acids 
to H20 and CO; is captured and used to drive the energetically unfavora- 
ble reaction ADP + Pi — ATP. By contrast, a modern combustion engine, 
such as a car engine, can convert no more than 20% of the available 
energy in its fuel into useful work. In both cases, the remaining energy is 
released as heat, which in animals helps to keep the body warm. 


Roughly 10? molecules of ATP are in solution in a typical cell at any 
instant. In many cells, all of this ATP is turned over (that is, consumed 
and replaced) every 1-2 minutes. Thus, an average person at rest will 
hydrolyze his or her weight in ATP molecules every 24 hours. 


Glycolysis Extracts Energy from the Splitting of Sugar 


The central process in stage 2 of catabolism is the oxidative breakdown 
of glucose by the sequential reactions of glycolysis. The reactions take 
place in the cytosol of most cells, and they do not require the participa- 
tion of molecular oxygen. Indeed, many anaerobic microorganisms that 
thrive in the absence of oxygen use glycolysis to produce ATP. Thus, this 
energy-generating series of reactions probably evolved early in the his- 
tory of life, before photosynthetic organisms introduced oxygen into the 
Earth's atmosphere. 


The term “glycolysis” comes from the Greek glykys, “sweet,” and lysis, 
“splitting.” It is an appropriate name, as glycolysis splits a molecule 
of glucose, which has six carbon atoms, to form two molecules of 
pyruvate, each of which contains three carbon atoms. The series of 
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chemical re-arrangements that ultimately generate pyruvate release 
energy because the electrons in a molecule of pyruvate are, overall, at 
a lower energy state than those in a molecule of glucose. Nevertheless, 
for each molecule of glucose that enters glycolysis, two molecules of ATP 
are initially consumed to provide the energy needed to prepare the sugar 
to be split. This investment of energy is more than recouped in the later 
steps of glycolysis, when four molecules of ATP are produced. During this 
“payoff phase,” energy is also captured in the form of NADH. Thus, at the 
end of glycolysis, there is a net gain of two molecules of ATP and two 
molecules of NADH for each molecule of glucose that is oxidized (Figure 
13-4). 


Glycolysis Produces both ATP and NADH 


Piecing together the complete glycolytic pathway in the 1930s was a 
major triumph of biochemistry, as the pathway consists of a sequence 
of 10 separate reactions, each producing a different sugar intermediate 
and each catalyzed by a different enzyme. These reactions are presented 
in outline in Figure 13-5 and in detail in Panel 13-1 (pp. 436-437). The 
different enzymes participating in the reactions of glycolysis, like most 
enzymes, all have names ending in -ase—like isomerase and dehydroge- 
nase—which specify the type of reaction they catalyze (Table 13-1). 


Much of the energy released by the breakdown of glucose is used to 
drive the synthesis of ATP molecules from ADP and Pj. This form of ATP 


TABLE 13-1 SOME TYPES OF ENZYMES INVOLVED IN GLYCOLYSIS 


Kinase catalyzes the addition of a kinase transfers a phosphate 
a phosphate group to group from ATP to a substrate 
molecules in steps 1 and 3; other kinases 


transfer a phosphate to ADP to 
form ATP in steps 7 and 10 


Isomerase catalyzes the rearrangement isomerases in steps 2 and 5 
of bonds within a single prepare molecules for the 
molecule chemical alterations to come 

Dehydrogenase catalyzes the oxidation of the enzyme glyceraldehyde 
a molecule by removing 3-phosphate dehydrogenase 
a hydrogen atom plus an generates NADH in step 6 
electron (a hydride ion, H7) 

Mutase catalyzes the shifting of a the movement of a phosphate 
chemical group from one by phosphoglycerate mutase 
position to another within a in step 8 helps prepare the 
molecule substrate to transfer this group 


to ADP to make ATP in step 10 


ATP | 


two molecules 
of pyruvate 


Figure 13-4 Glycolysis splits a molecule 
of glucose to form two molecules of 
pyruvate. The process requires an input of 
energy, in the form of ATP, at the start. This 
energy investment is later recouped by the 
production of two NADHs and four ATPs. 
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Figure 13-5 The stepwise breakdown CHOH 
of sugars begins with glycolysis. Each of 
the 10 steps of glycolysis is catalyzed by a 

different enzyme. Note that step 4 cleaves 

a six-carbon sugar into two three-carbon 

intermediates, so that the number of 

molecules at every stage after this doubles. 

Note also that one of the two products 

of step 4 is modified (isomerized) in step 

5 to convert it into a second molecule of 

glyceraldehyde 3-phosphate, the other 

product of step 4 (see Panel 13-1). As 

indicated, step 6 launches the energy- 


generation phase of glycolysis, which results ®—OoH,C o 
in the net synthesis of ATP and NADH P 
(see also Figure 13-4). Glycolysis is also 
sometimes referred to as the Embden- 
Meyerhof pathway, named for the chemists OH 
who first described it. All the steps of 
glycolysis are reviewed in Movie 13.1. 
ie 
de 
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synthesis, which takes place in steps 7 and 10 in glycolysis, is known 
as substrate-level phosphorylation because it occurs by the transfer of a 
phosphate group directly from a substrate molecule—one of the sugar 
intermediates—to ADP. By contrast, most phosphorylations in cells occur 
by the transfer of a phosphate group from ATP to a substrate molecule. 


The remainder of the useful energy harnessed by the cell during glyco- 
lysis is stored in the electrons in the NADH molecule produced in step 6 
by an oxidation reaction. As discussed in Chapter 3, oxidation does not 
always involve oxygen; it occurs in any reaction in which electrons are 
lost from one atom and transferred to another. So, although no molecular 
oxygen is involved in glycolysis, oxidation does occur: in step 6, a hydro- 
gen atom plus an electron (a hydride ion, H) is removed from the sugar 
intermediate, glyceraldehyde 3-phosphate, and transferred to NAD+, pro- 
ducing NADH (see Panel 13-1, p. 437). 


Over the course of glycolysis, two molecules of NADH are formed per 
molecule of glucose consumed. In eukaryotic organisms, these NADH 
molecules are transported into mitochondria, where they donate their 
electrons to an electron-transport chain that produces ATP by oxidative 
phosphorylation in the inner mitochondrial membrane, as described in 
detail in Chapter 14. These electrons pass along the electron-transport 
chain to O2, eventually forming water. 
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In giving up its electrons, NADH is converted back into NAD*, which is 
then available to be used again for glycolysis. In the absence of oxygen, 
NAD* can be regenerated by an alternate type of energy-yielding reaction 
called a fermentation, as we discuss next. 


Fermentations Can Produce ATP in the Absence of 
Oxygen 


For most animal and plant cells, glycolysis is only a prelude to the third 
and final stage of the breakdown of food molecules, in which large 
amounts of ATP are generated in mitochondria by oxidative phospho- 
rylation, a process that requires the consumption of oxygen. However, 
for many anaerobic microorganisms, which can grow and divide in the 
absence of oxygen, glycolysis is the principal source of ATP. Certain ani- 
mal cells also rely on ATP produced by glycolysis when oxygen levels fall, 
such as skeletal muscle cells during vigorous exercise. 


In these anaerobic conditions, the pyruvate and NADH made by glycoly- 
sis remain in the cytosol (see Figure 13-3). There, pyruvate is converted 
into products that are excreted from the cell: lactate in muscle cells, 
for example, or ethanol and CO; in the yeast cells used in brewing and 
breadmaking. In the process, the NADH gives up its electrons and is con- 
verted back to the NAD* required to maintain the reactions of glycolysis. 
Such energy-yielding pathways that break down sugar in the absence 
of oxygen are called fermentations (Figure 13-6). Scientific studies of 
the commercially important fermentations carried out by yeasts laid the 
foundations for early biochemistry. 
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QUESTION 13-1 


At first glance, the final steps 

in fermentation appear to be 
unnecessary: the generation of 
lactate or ethanol does not produce 
any additional energy for the cell. 
Explain why cells growing in the 
absence of oxygen could not simply 
discard pyruvate as a waste product. 
Which products derived from 
glucose would accumulate in cells 
unable to generate either lactate or 
ethanol by fermentation? 


Figure 13-6 Pyruvate is broken 

down in the absence of oxygen by 
fermentation. (A) When inadequate 
oxygen is present, for example in a muscle 
cell undergoing repeated contraction, 
glycolysis produces pyruvate, which is 
converted to lactate in the cytosol. This 
reaction restores the NAD* consumed in 
step 6 of glycolysis. The whole pathway 
yields much less energy overall than if the 
pyruvate were oxidized in mitochondria. 
(B) In some microorganisms that can 
grow anaerobically, pyruvate is converted 
into carbon dioxide and ethanol. Again, 
this pathway regenerates NAD* from 
NADH, as required to enable glycolysis 
to continue. In both (A) and (B), for 

each molecule of glucose that enters 
glycolysis, two molecules of pyruvate are 
generated (only a single pyruvate is shown 
here); these two pyruvates subsequently 
yield two molecules of lactate—or two 
molecules of CO2 and ethanol—plus two 
molecules of NAD*. 
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Figure 13-7 A pair of coupled reactions 
drives the energetically unfavorable 
formation of NADH and ATP in steps 

6 and 7 of glycolysis. In this diagram, 
energetically favorable reactions are 
represented by blue arrows (see Figure 
3-17) and energetically costly reactions by 
red arrows. In step 6, the energy released 
by the energetically favorable oxidation of 
a C-H bond in glyceraldehyde 3-phosphate 
(blue arrow) is large enough to drive 

two energetically costly reactions: the 
formation of both NADH and a high-energy 
phosphate bond in 1,3-bisphosphoglycerate 
(red arrows). The subsequent energetically 
favorable hydrolysis of that high-energy 
phosphate bond in step 7 then drives 

the formation of ATP. The formation of 
1,3-bisphosphoglycerate (in step 6) and of 
ATP (in step 7) both represent a substrate- 
level phosphorylation. 
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Figure 13-8 Differences in the energies 
of different phosphate bonds allow the 
formation of ATP by substrate-level 
phosphorylation. Examples of molecules 
formed during glycolysis that contain 
different types of phosphate bonds are 
shown, along with the free-energy change 
for hydrolysis of those bonds in kJ/mole. 
The transfer of a phosphate group from 
one molecule to another is energetically 
favorable if the standard free-energy change 
(AG*) for hydrolysis of the phosphate bond 
is more negative for the donor molecule 
than for the acceptor. (The hydrolysis 
reactions can be thought of as the transfer 
of the phosphate group to water.) Thus, 

a phosphate group is readily transferred 
from 1,3-bisphosphoglycerate to ADP to 
form ATP. Transfer reactions involving the 
phosphate groups in these molecules are 
detailed in Panel 13-1 (pp. 436-437). 
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Many bacteria and archaea can also generate ATP in the absence of oxy- 
gen by anaerobic respiration, a process that uses a molecule other than 
oxygen as a final electron acceptor. Anaerobic respiration differs from 
fermentation in that it involves an electron-transport chain embedded in 
a membrane—in this case, the plasma membrane of the prokaryote. 


Glycolytic Enzymes Couple Oxidation to Energy Storage 
in Activated Carriers 


Cells harvest useful energy from the oxidation of organic molecules by 
coupling an energetically unfavorable reaction to an energetically favora- 
ble one (see Figure 3-17). Here, we take a closer look at a key pair of 
glycolytic reactions that demonstrate how enzymes catalyze such cou- 
pled reactions, facilitating the transfer of chemical energy to form ATP 
and NADH. 


The reactions in question, steps 6 and 7 of glycolysis (see Panel 13-1), 
transform the three-carbon sugar intermediate glyceraldehyde 3-phos- 
phate into 3-phosphoglycerate. This two-step chemical conversion 
oxidizes the aldehyde group in glyceraldehyde 3-phosphate to the car- 
boxylic acid group in 3-phosphoglycerate. The overall reaction releases 
enough free energy to transfer two electrons from the aldehyde to NAD* 
to form NADH (in step 6) and to transfer a phosphate group to a molecule 
of ADP to form ATP (in step 7). It also releases enough heat to the envi- 
ronment to make the overall reaction energetically favorable: the AG° for 
step 6 followed by step 7 is -12.5 kJ/mole (Figure 13-7). 


The reaction in step 6 is the only one in glycolysis that creates a high- 
energy phosphate linkage directly from inorganic phosphate. This reaction 
generates a high-energy intermediate—1 ,3-bisphosphoglycerate—whose 
phosphate bonds contain more energy than those found in ATP. Such 
molecules readily transfer a phosphate group to ADP to form ATP. 
Figure 13-8 compares the high-energy phosphoanhydride bond in ATP 
with a few of the other phosphate bonds that are generated during gly- 
colysis. The energy contained in these phosphate bonds is determined 
by measuring the standard free-energy change (AG°) when each bond is 
broken by hydrolysis. As explained in Panel 13-1, such bonds are often 
described as having “high energy” because their hydrolysis is particularly 
energetically favorable. 
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The important substrate-level phosphorylation reaction at the center of 
glycolysis, in which a high-energy linkage in 1,3-bisphosphoglycerate is 
generated in step 6—and then consumed in step 7 to produce ATP—is 
presented in detail in Figure 13-9. 
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QUESTION 13-2 


Arsenate (AsO4%-) is chemically very 
similar to phosphate (PO4°-) and can 
be used as an alternative substrate 
by many phosphate-requiring 
enzymes. In contrast to phosphate, 
however, an anhydride bond 
between arsenate and carbon is very 
quickly hydrolyzed nonenzymatically 
in water. Knowing this, suggest why 
arsenate is a compound of choice 
for murderers but not for cells. 
Formulate your explanation in the 
context of Figure 13-7. 


Figure 13-9 The oxidation of 
glyceraldehyde 3-phosphate is coupled 
to the formation of ATP and NADH in 
steps 6 and 7 of glycolysis. (A) In step 6, 
the enzyme glyceraldehyde 3-phosphate 
dehydrogenase couples the energetically 
favorable oxidation of an aldehyde to 

the energetically unfavorable formation 

of a high-energy phosphate bond. At 

the same time, it enables energy to be 
stored in NADH. In step 7, the newly 
formed high-energy phosphate bond in 
1,3-bisphosphoglycerate is transferred 

to ADP, forming a molecule of ATP and 
leaving a free carboxylic acid group on the 
oxidized sugar. The part of the molecule 
that undergoes a change is shaded in blue; 
the rest of the molecule remains unchanged 
throughout all these reactions. (B) Summary 
of the overall chemical change produced by 
the reactions of steps 6 and 7. 
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For each step, the part of the molecule that undergoes a change is shadowed in blue, and the name of the enzyme that catalyzes the 
reaction is in a yellow box. Reactions represented by double arrows ( == ) are readily reversible, whereas those represented by 
single arrows (— ) are effectively irreversible. To watch a video of the reactions of glycolysis, see 


STEP 1 


Glucose is phosphorylated 
by ATP to form a sugar 
phosphate. The negative 
charge of the phosphate 
prevents passage of the 
sugar phosphate through 
the plasma membrane, 
trapping glucose inside 
the cell. 


STEP 2 


A readily reversible 
rearrangement of the 
chemical structure 
(isomerization) 
moves the carbonyl 
oxygen from 
carbon 1 to 

carbon 2, forming 
a ketose from an 
aldose sugar. 

(See Panel 2-4, 

pp. 72-73.) 
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STEP 3 


The new hydroxyl group on 
carbon 1 is phosphorylated by 
ATP, in preparation for the 
formation of two three-carbon 
sugar phosphates. The entry of 
sugars into glycolysis is 
controlled at this step, through 
regulation of the enzyme 
phosphofructokinase. 
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STEP 5 


The other product of step 4, = 
dihydroxyacetone phosphate, C=O 

is isomerized to form a second | 

molecule of glyceraldehyde CHO —P 
3-phosphate. 
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STEP 6 


The two molecules of 
glyceraldehyde 3-phosphate 
produced in steps 4 and 5 are 
oxidized. The energy- 
generation phase of glycolysis 
begins, as NADH and a new 
high-energy anhydride linkage 
to phosphate are formed (see 
Figure 13-5). 
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forms ATP. 


STEP 8 


The remaining phosphate 
ester linkage in 
3-phosphoglycerate, 
which has a relatively low 
free energy of hydrolysis, 
is moved from carbon 3 
to carbon 2 to form 
2-phosphoglycerate. 


STEPS) 


The removal of water from 
2-phosphoglycerate creates a 
high-energy enol phosphate 
linkage. 


STEP 10 


The transfer to ADP of the 
high-energy phosphate 
group that was generated 
in step 9 forms ATP, 
completing glycolysis. 
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In addition to the pyruvate, the net products of glycolysis 
are two molecules of ATP and two molecules of NADH. 
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Figure 13-10 Pyruvate is converted into acetyl CoA and CO2 

by the pyruvate dehydrogenase complex in the mitochondrial 
matrix. The pyruvate dehydrogenase complex contains multiple 
copies of three enzymes—pyruvate dehydrogenase (1), dihydrolipoyl 
transacetylase (2), and dihydrolipoyl dehydrogenase (3). This enzyme 
complex removes a CO? from pyruvate to generate NADH and acetyl 
CoA. Pyruvate and its products—including the waste product, CO2— 
are shown in red lettering. In the large multienzyme complex, reaction 
intermediates are passed directly from one enzyme to another. To get 
a sense of scale, a single pyruvate dehydrogenase complex is larger 
than a ribosome. 


Several Types of Organic Molecules Are Converted to 
Acetyl CoA in the Mitochondrial Matrix 


In aerobic metabolism in eukaryotic cells, the pyruvate produced by glyc- 
olysis is actively pumped into the mitochondrial matrix (see Figure 13-3). 
There, it is rapidly decarboxylated by a giant complex of three enzymes, 
called the pyruvate dehydrogenase complex. The products of this set of 
reactions are CO2 (a waste product), NADH, and acetyl CoA (Figure 
13-10). The latter is produced when the acetyl group derived from pyru- 
vate is linked to coenzyme A (CoA). 


In addition to sugar, which is broken down during glycolysis, fat is a 
major source of energy for most nonphotosynthetic organisms, includ- 
ing humans. Like the pyruvate derived from glycolysis, the fatty acids 
derived from fat are also converted into acetyl CoA in the mitochondrial 
matrix (see Figure 13-3). Fatty acids are first activated by covalent link- 
age to CoA and are then broken down completely by a cycle of reactions 
that trims two carbons at a time from their carboxyl end, generating one 
molecule of acetyl CoA for each turn of the cycle; two activated carri- 
ers—NADH and another high-energy electron carrier, FADH2—are also 
produced in this process (Figure 13-11). 


In addition to pyruvate and fatty acids, some amino acids are transported 
from the cytosol into the mitochondrial matrix, where they are also con- 
verted into acetyl CoA or one of the other intermediates of the citric acid 
cycle (see Figure 13-3). Thus, in the eukaryotic cell, the mitochondrion 
represents the center toward which all energy-yielding catabolic pro- 
cesses lead, whether they begin with sugars, fats, or proteins. In aerobic 
prokaryotes—which have no mitochondria—glycolysis and acetyl CoA 
production, as well as the citric acid cycle, take place in the cytosol. 


Catabolism does not end with the production of acetyl CoA. In the pro- 
cess of converting food molecules to acetyl CoA, only a part of their 
stored energy is captured in the bonds of activated carriers: most remains 
locked up in acetyl CoA. The next stage in cell respiration is the citric acid 
cycle, in which the acetyl group in acetyl CoA is oxidized to CO% in the 
mitochondrial matrix, as we now discuss. 


The Citric Acid Cycle Generates NADH by Oxidizing 
Acetyl Groups to CO2 


The citric acid cycle, a series of reactions that takes place in the mito- 
chondrial matrix of eukaryotic cells, catalyzes the complete oxidation of 
the carbon atoms of the acetyl groups in acetyl CoA. The final product 
of this oxidation, CO7, is released as a waste product. The acetyl-group 
carbons are not oxidized directly, however. Instead, they are transferred 
from acetyl CoA to a larger four-carbon molecule, oxaloacetate, to form 
the six-carbon tricarboxylic acid, citric acid, for which the subsequent 
cycle of reactions is named. The citric acid molecule (also called citrate) is 
then progressively oxidized, and the energy of this oxidation is harnessed 
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Figure 13-11 Fatty acids derived from fats are also converted to acetyl CoA in the mitochondrial matrix. 

(A) Fats are stored in the form of triacylglycerol, the glycerol portion of which is shown in blue. Three fatty acid chains 
(shaded in red) are linked to this glycerol through ester bonds. Enzymes called lipases can hydrolyze these ester 
bonds when fatty acids are needed for energy (not shown). The released fatty acids are then coupled to coenzyme 

A ina reaction requiring ATP. These activated fatty acids (fatty acyl CoA) are subsequently oxidized in a cycle 
containing four enzymes, which are not shown. Each turn of the cycle shortens the fatty acyl CoA molecule by two 
carbons (red) and generates one molecule each of FADH2, NADH, and acetyl CoA. (B) Fats are insoluble in water and 
spontaneously form large lipid droplets in specialized fat cells called adipocytes. This electron micrograph shows a 
lipid droplet in the cytoplasm of an adipocyte. (B, courtesy of Daniel S. Friend.) 


to produce activated carriers, including NADH, in much the same manner 
as we described for glycolysis. This series of eight reactions forms a cycle, 
because the oxaloacetate that began the process is regenerated at the 
end (Figure 13-12). The citric acid cycle—which is also called the tricar- 
boxylic acid cycle or the Krebs cycle—is presented in detail in Panel 13-2 
(pp. 442-443), and the experiments that first revealed its cyclic nature are 
described in How We Know, pp. 444-445. 


Although the citric acid cycle accounts for about two-thirds of the total 
oxidation of carbon compounds in most cells, none of its steps use 
molecular oxygen. The cycle, however, requires O2 to proceed because 
the NADH generated passes its high-energy electrons to an electron- 
transport chain in the inner mitochondrial membrane, and this chain 
uses Op as its final electron acceptor. Oxygen thus allows NADH to hand 
off its high-energy electrons, regenerating the NAD* needed to keep the 
citric acid cycle going. Although living organisms have inhabited Earth 
for more than 3.5 billion years, the planet is thought to have developed 
an atmosphere containing Oz gas only some 1 to 2 billion years ago (see 
Figure 14-46). Many of the energy-generating reactions of the citric acid 
cycle, which is now used by all aerobic organisms, are therefore likely to 
be of relatively recent origin. 


A common misconception about the citric acid cycle is that the oxygen 
atoms required to make CO from the acetyl groups entering the citric 


QUESTION 13-3 


Many catabolic and anabolic 
reactions are based on reactions 
that are similar but work in opposite 
directions, such as the hydrolysis 
and condensation reactions 
described in Figure 3-39. This is true 
for fatty acid breakdown and fatty 
acid synthesis. From what you know 
about the mechanism of fatty acid 
breakdown outlined in Figure 13-11, 
would you expect the fatty acids 
found in cells to most commonly 
have an even or an odd number of 
carbon atoms? 
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Figure 13-12 The citric acid cycle 
catalyzes the complete oxidation of 
acetyl groups supplied by acetyl CoA. 
The cycle begins with the reaction of 
acetyl CoA (derived from pyruvate as 
shown in Figure 13-10) with oxaloacetate 
to produce citric acid (citrate). The number 
of carbon atoms in each intermediate is 
shaded in yellow. (See also Panel 13-2, 
pp. 442-443.) The steps of the citric acid 
cycle are reviewed in Movie 13.2. 
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acid cycle are supplied by atmospheric O2. In fact, these oxygen atoms 
come from water (H20). As illustrated at the top of Panel 13-2, three 
molecules of H20 are split as they enter each turn of the cycle, and their 
oxygen atoms are ultimately used to make CO2, As we see shortly, the 
Oz that we breathe is actually reduced to H20 by the electron-transport 
chain; it does not form the CO; that we exhale. 


Thus far, we have discussed only one of the three types of activated carri- 
ers that are produced by the citric acid cycle—NADH. In addition to three 
molecules of NADH, each turn of the cycle also produces one molecule of 
FADH, (reduced flavin adenine dinucleotide) from FAD and one mol- 
ecule of the ribonucleoside triphosphate GTP (guanosine triphosphate) 
from GDP (see Figure 13-12). The structures of these two activated car- 
riers are illustrated in Figure 13-13. GTP is a close relative of ATP, and 
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Figure 13-13 Each turn of the citric acid cycle produces one molecule of GTP H—C—OH 

and one molecule of FADH2. (A) GTP and GDP are close relatives of ATP and ADP, 

respectively, the only difference being the substitution of the base guanine for adenine. H—C—OH 

(B) Despite its very different structure, FADH2, like NADH and NADPH (see Figure 3-34), 

is a carrier of hydrogen atoms and high-energy electrons. It is shown here in its oxidized H,C -O-(P)\(®)—O- CH, 

form (FAD), with the part of the structure involved in accepting and donating hydrogens 

highlighted in yellow. FAD can accept two hydrogen atoms, along with their electrons, 

to form the reduced FADH2. The atoms involved are shown in their reduced form in the FAD 


excerpt to the right. 
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the transfer of its terminal phosphate group to ADP produces one ATP 
molecule in each cycle. Like NADH, FADH; is a carrier of high-energy 
electrons. And like NADH, FADH; transfers its high-energy electrons to 
the electron-transport chain in the inner mitochondrial membrane. As 
we discuss shortly, the movement of energy stored in these readily trans- 
ferrable electrons is subsequently used to produce ATP through oxidative 
phosphorylation on the inner mitochondrial membrane, the only step in 
the oxidative catabolism of foodstuffs that directly requires O2 from the 
atmosphere. 


Many Biosynthetic Pathways Begin with Glycolysis or 
the Citric Acid Cycle 


Catabolic reactions, such as those of glycolysis and the citric acid cycle, 
produce both energy for the cell and the building blocks from which many 
other organic molecules are made. Thus far, we have emphasized energy 
production rather than the provision of starting materials for biosynthe- 
sis. But many of the intermediates formed in glycolysis and the citric acid 
cycle are siphoned off by such anabolic pathways, in which the inter- 
mediates are converted by a series of enzyme-catalyzed reactions into 
amino acids, nucleotides, lipids, and other small organic molecules that 
the cell needs. The oxaloacetate and a-ketoglutarate produced during 
the citric acid cycle, for example (see Panel 13-2), are transferred from 
the mitochondrial matrix back to the cytosol, where they serve as precur- 
sors for the production of many essential molecules, such as the amino 
acids aspartate and glutamate, respectively. An idea of the extent of these 
anabolic pathways can be gathered from Figure 13-14, which illustrates 
some of the branches leading from the central catabolic reactions to bio- 
syntheses. How cells control the flow of intermediates through anabolic 
and catabolic pathways is discussed in the final section of the chapter. 


dihydroxyacetone ———> lipids 


phosphate 


QUESTION 13-4 


Looking at the chemistry detailed in 
the overview of the citric acid cycle 
at the top of the first page of Panel 
13-2 (p. 442), why do you suppose 
it is useful to link the two-carbon 
acetyl group to another, larger 
carbon skeleton, oxaloacetate, 
before completely oxidizing both of 
the acetyl-group carbons to CO2? 
(See also Figure 13-12.) 


Figure 13-14 Glycolysis and the citric 
acid cycle provide the precursors needed 
for cells to synthesize many important 
organic molecules. The amino acids, 
nucleotides, lipids, sugars, and other 
molecules—shown here as products—in 
turn serve as the precursors for many of the 
cell's macromolecules. Each black arrow 

in this diagram denotes a single enzyme- 
catalyzed reaction; the red arrows generally 
represent pathways with many steps that are 
required to produce the indicated products. 
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PAN EL 13-2 THE COMPLETE CITRIC ACID CYCLE 
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Overview of the complete citric acid cycle. 
The two carbons from acetyl CoA that 
enter this turn of the cycle (shadowed in 
red ) will be converted to CO, in 
subsequent turns of the cycle: the two 
carbons in the starting oxaloacetate 
(shadowed in blue ) will be converted to 
CO, in this cycle. 
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Details of these eight steps are shown below. In this part of the panel, for each step, the part of the molecule that undergoes 
a change is shadowed in blue, and the name of the enzyme that catalyzes the reaction is in a yellow box. 
To watch a video of the reactions of the citric acid cycle, see 


STEP 1 


After the enzyme removes 
a proton from the CH; 
group on acetyl CoA, the 
negatively charged CH,~ 
forms a bond to a carbonyl 
carbon of oxaloacetate. 
The subsequent loss of the 
coenzyme A (HS-CoA) by 
hydrolysis drives the 
reaction strongly forward. 


Sinel? 2 


An isomerization reaction, 
in which water is first 
removed and then added 
back, moves the hydroxyl 
group from one carbon 
atom to its neighbor. 
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STEP 3 Coo 
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STEP 4 


The a-ketoglutarate dehydrogenase 
complex closely resembles the large a-ketoglutarate dehydrogenase complex 
enzyme complex that converts 
pyruvate to acetyl CoA, the pyruvate 
dehydrogenase complex in Figure 
13-10. It likewise catalyzes an 
oxidation that produces NADH, CO,, NAD+ +H* 
and a high-energy thioester bond to 
coenzyme A (CoA). CO, 
succinyl CoA 


STEP 5 


An inorganic phosphate H—C—H succinyl CoA synthetase 
displaces the CoA, forming a | 


high-energy phosphate H—C—H 
linkage to succinate. This | 
phosphate is then passed to C=O 
GDP to form GTP. (In bacteria | 

and plants, ATP is formed J 
instead.) S—CoA 


succinyl CoA succinate 


succinate dehydrogenase 
STEP 6 

In the third oxidation step of the 
cycle, FAD accepts two hydrogen 
atoms from succinate. 


STEP 7 fumarase 


The addition of water to 
fumarate places a hydroxyl 
group next to a carbonyl 
carbon. 


STEP 8 | malate dehydrogenase 


In the last of four oxidation 
steps in the cycle, the carbon 
carrying the hydroxyl group is 
converted to a carbonyl group, 
regenerating the oxaloacetate 
needed for step 1. 


oxaloacetate 


™ HOW WE KNOW 


UNRAVELING THE CITRIC ACID CYCLE 


“I have often been asked how the work on the citric 
acid cycle arose and developed,” stated biochemist 
Hans Krebs in a lecture and review article in which he 
described his Nobel Prize-winning discovery of the cycle 
of reactions that lies at the center of cell metabolism. 
Did the concept stem from a sudden inspiration, a rev- 
elatory vision? “It was nothing of the kind,” answered 
Krebs. Instead, his realization that these reactions 
occur in a cycle—rather than a set of linear pathways, 
as in glycolysis—arose from a “very slow evolutionary 
process” that occurred over a five-year period, during 
which Krebs coupled insight and reasoning to careful 
experimentation to discover one of the central path- 
ways that underlies energy metabolism. 


Minced tissues, curious catalysis 


By the early 1930s, Krebs and other investigators had 
discovered that a select set of small organic molecules 
is oxidized extraordinarily rapidly in various types of 
animal tissue preparations—slices of kidney or liver, or 
suspensions of minced pigeon muscle. Because these 
reactions were seen to depend on the presence of 
oxygen, the researchers surmised that this set of mol- 
ecules might include intermediates that are important 
in cell respiration—the consumption of O2 and produc- 
tion of CO; that occurs when tissues break down food 
molecules. 


Using the minced-tissue preparations, Krebs and others 
made the following observations. First, in the presence 
of oxygen, certain organic acids—citrate, succinate, 
fumarate, and malate—were readily oxidized to COp. 
These reactions depended on a continuous supply of 
oxygen. 


Second, the oxidation of these acids occurred in two 
linear, sequential pathways: 


citrate — a-ketoglutarate — succinate 
and 
succinate — fumarate — malate — oxaloacetate 


Third, the addition of small amounts of several of these 
compounds to the minced-muscle suspensions stimu- 
lated an unusually large uptake of O2—far greater 
than that needed to oxidize only the added molecules. 
To explain this surprising observation, Albert Szent- 
Györgyi (the Nobel laureate who worked out the second 
pathway above) suggested that a single molecule of 
each compound must somehow act catalytically to 
stimulate the oxidation of many molecules of some 
endogenous substance in the muscle. 


At this point, most of the reactions central to the citric 
acid cycle were known. What was not yet clear—and 
caused great confusion, even to future Nobel laure- 
ates—was how these apparently linear reactions could 
drive such a catalytic consumption of oxygen, where 


each molecule of metabolite fuels the oxidation of 
many more molecules. To simplify the discussion of 
how Krebs ultimately solved this puzzle—by linking 
these linear reactions together into a circle—we will 
now refer to the molecules involved by a sequence of 
letters, A through H (Figure 13-15). 
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Figure 13-15 In this simplified representation of the citric 
acid cycle, O2 is consumed and COz is liberated as the 
molecular intermediates become oxidized. Krebs and others 
did not initially realize that these oxidation reactions occur in a 
cycle, as shown here. 


A poison suggests a cycle 


Many of the clues that Krebs used to work out the citric 
acid cycle came from experiments using malonate— 
a poisonous compound that specifically inhibits the 
enzyme succinate dehydrogenase, which converts E 
to F. Malonate closely resembles succinate (E) in its 
structure (Figure 13—16), and it serves as a competitive 
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COOR CH, 
mE Figure 13-16 The structure of 
malonate closely resembles 
malonate succinate 


that of succinate. 


inhibitor of the enzyme. Because the addition of 
malonate poisons cell respiration in tissues, Krebs con- 
cluded that succinate dehydrogenase (and the entire 
pathway linked to it) must play a critical role in the cell 
respiration process. 


Krebs then determined that when A, B, or C was added 
to malonate-poisoned tissue suspensions, E accumu- 
lated (Figure 13—17A). This observation reinforced the 
importance of succinate dehydrogenase for successful 
cell respiration. However, he found that E also accumu- 
lated when F, G, or H was added to malonate-poisoned 
muscle (Figure 13-17B). The latter result suggested 
that an additional set of reactions must exist that can 
convert F, G, and H molecules into E, since E was pre- 
viously shown to be a precursor for F, G, and H, rather 
than a product of their reaction pathway. 
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Figure 13-17 Poisoning muscle preparations with malonate 
provided clues to the cyclic nature of these oxidative 
reactions. (A) Adding A (or B or C—not shown) to malonate- 
poisoned muscle causes an accumulation of E. (B) Addition of 

F (or G or H—not shown) to a malonate-poisoned preparation 
also causes an accumulation of E, suggesting that enzymatic 
reactions can convert these molecules into E. The discovery that 
citrate (A) can be formed from oxaloacetate (H) and pyruvate 
allowed Krebs to join these two reaction pathways into a 
complete circle. 
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At about this time, Krebs also determined that when 
muscle suspensions were incubated with pyruvate and 
oxaloacetate, citrate formed: pyruvate + H > A. 


This observation led Krebs to postulate that when oxy- 
gen is present, pyruvate and H condense to form A, 
converting the previously delineated string of linear 
reactions into a cyclic sequence (see Figure 13-15). 


Explaining the mysterious stimulatory 
effects 


The cycle of reactions that Krebs proposed clearly 
explained how the addition of small amounts of any 
of the intermediates A through H could cause the large 
increase in the uptake of Oz that had been observed. 
Pyruvate is abundant in these minced tissues, being 
readily produced by glycolysis (see Figure 13-4), using 
glucose derived from the breakdown of stored glyco- 
gen (as discussed later in this chapter). The oxidation 
of pyruvate requires a functioning citric acid cycle, in 
which each turn of the cycle results in the oxidation 
of one molecule of pyruvate. If the intermediates A 
through H are in small enough supply, the rate at which 
the entire cycle turns will be restricted. Adding a sup- 
ply of any one of these intermediates will then have a 
dramatic effect on the rate at which the entire cycle 
operates. Thus, it is easy to see how a large number of 
pyruvate molecules can be oxidized, and a great deal 
of oxygen consumed, for every molecule of a citric acid 
cycle intermediate that is added (Figure 13-18). 


Krebs went on to demonstrate that all of the individ- 
ual enzymatic reactions in his postulated cycle took 
place in tissue preparations. Furthermore, the reactions 
occurred at rates high enough to account for the rate 
of pyruvate and oxygen consumption in these tissues. 
Krebs therefore concluded that this series of reactions 
is the major, if not the sole, pathway for the oxida- 
tion of pyruvate—at least in muscle. By fitting together 
pieces of information like a jigsaw puzzle, he arrived at 
a coherent picture of the intricate metabolic processes 
that underlie the oxidation—and took home a share of 
the 1953 Nobel Prize in Physiology or Medicine. 


Figure 13—18 Replenishing the supply of any single 
intermediate has a dramatic effect on the rate at which the 
entire citric acid cycle operates. When the concentrations 

of intermediates are limiting, the cycle turns slowly and 

little pyruvate is used. O2 uptake is low because only small 
amounts of NADH and FADH? are produced to feed oxidative 
phosphorylation (see Figure 13—19). But when a large amount of 
any one intermediate is added, the cycle turns rapidly; more of 
all the intermediates is made, and O2 uptake is high. 
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QUESTION 13-5 


What, if anything, is wrong with 
the following statement? “The 
oxygen consumed during the 
complete oxidation of glucose in 
animal cells is returned as part of 
COz to the atmosphere.” How 
could you support your answer 
experimentally? 


Figure 13-19 Oxidative phosphorylation 
completes the catabolism of food 
molecules and generates the bulk of the 
ATP made by the cell. Electron-bearing 
activated carriers produced by the citric 
acid cycle and glycolysis donate their high- 
energy electrons to an electron-transport 
chain in the inner mitochondrial membrane 
(or in the plasma membrane of aerobic 
prokaryotes). This electron transfer pumps 
protons (H*) across the inner membrane 
(redarrows). The resulting proton gradient 
is then used to drive the synthesis of 

ATP through the process of oxidative 
phosphorylation, as we discuss in detail in 
the next chapter. 


How Cells Obtain Energy from Food 


Electron Transport Drives the Synthesis of the Majority of 
the ATP in Most Cells 


We now return briefly to the final stage in the oxidation of food molecules: 
oxidative phosphorylation. It is in this stage that the chemical energy 
captured by the activated carriers produced during glycolysis and the cit- 
ric acid cycle is used to generate ATP. During oxidative phosphorylation, 
NADH and FADH; transfer their high-energy electrons to the electron- 
transport chain—a series of electron carriers embedded in the inner 
mitochondrial membrane in eukaryotic cells (and in the plasma mem- 
brane of aerobic prokaryotes). As the electrons pass through the series of 
electron acceptor and donor molecules that form the chain, they fall to 
successively lower energy states. At specific sites in the chain, the energy 
released is used to drive protons (H*) across the inner membrane, from 
the mitochondrial matrix to the intermembrane space (see Figure 13-2). 
This movement generates a proton gradient across the inner membrane, 
which serves as a source of energy (like a battery) that can be tapped to 
drive a variety of energy-requiring reactions (discussed in Chapter 12). 
The most prominent of these reactions is the phosphorylation of ADP to 
generate ATP on the matrix side of the inner membrane (Figure 13-19). 


At the end of the transport chain, the electrons are added to molecules of 
O2 that have diffused into the mitochondrion, and the resulting reduced 
oxygen molecules immediately combine with protons from the surround- 
ing solution to produce water (see Figure 13-19). The electrons have now 
reached their lowest energy level, and all the available energy has been 
extracted from the food molecule being oxidized. In total, the complete 
oxidation of a molecule of glucose to H20 and CO2 can produce about 30 
molecules of ATP. In contrast, only two molecules of ATP are produced 
per molecule of glucose by glycolysis alone. 


Oxidative phosphorylation occurs in both eukaryotic cells and in aero- 
bic prokaryotes. It represents a remarkable evolutionary achievement, 
and the ability to extract energy from food with such great efficiency 
has shaped the entire character of life on Earth. In the next chapter, we 
describe the mechanisms behind this game-changing molecular process 
and discuss how it likely arose. 
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REGULATION OF METABOLISM 


A cell is an intricate chemical machine, and our discussion of metabo- 
lism—with a focus on glycolysis and the citric acid cycle—has reviewed 
only a tiny fraction of the many enzymatic reactions that can take place in 
a cell at any time (Figure 13-20). For all these pathways to work together 
smoothly, as is required to allow the cell to survive and to respond to its 
environment, the choice of which pathway each metabolite will follow 
must be carefully regulated at every branch point. 


Many sets of reactions need to be coordinated and controlled. For exam- 
ple, to maintain order within their cells, all organisms need to replenish 
their ATP pools continuously through the oxidation of sugars or fats. 
Yet animals have only periodic access to food, and plants need to sur- 
vive without sunlight overnight, when they are unable to produce sugar 
through photosynthesis. Animals and plants have evolved several ways 
to cope with this transient deprivation. One way is to synthesize food 
reserves in times of plenty that can later be consumed when other energy 
sources are scarce. Thus, depending on conditions, a cell must decide 
whether to route key metabolites into anabolic or catabolic pathways—in 
other words, whether to use them to build other molecules or burn them 
to provide immediate energy. In this section, we discuss how a cell regu- 
lates its intricate web of interconnected metabolic pathways to best serve 
both its immediate and long-term needs. 


Catabolic and Anabolic Reactions Are Organized and 
Regulated 


All the reactions shown in Figure 13-20 occur in a cell that is less than 
0.1 mm in diameter, and each step requires a different enzyme. To add 
to the complexity, the same substrate is often a part of many different 
pathways. Pyruvate, for example, is a substrate for half a dozen or more 
different enzymes, each of which modifies it chemically in a different 
way. We have already seen that the pyruvate dehydrogenase complex 
uses pyruvate to produce acetyl CoA (see Figure 13-10), and that, dur- 
ing fermentation, lactate dehydrogenase can convert pyruvate to lactate 
(see Figure 13-6A). A third enzyme converts pyruvate to the amino acid 
alanine (see Figure 13-14), a fourth to oxaloacetate, and so on. All these 
pathways compete for pyruvate molecules, and similar competitions for 
thousands of other small molecules go on in cells all the time. 


To balance the activities of these interrelated reactions—and to allow 
organisms to adapt swiftly to changes in food availability or energy 
expenditure—an elaborate network of contro! mechanisms regulates and 
coordinates the activity of the enzymes that catalyze the myriad met- 
abolic reactions that go on in a cell. As we discuss in Chapter 4, the 
activity of enzymes can be controlled by covalent modification—such 
as the addition or removal of a phosphate group (see Figure 4—46)—and 
by the binding of small regulatory molecules, often a metabolite (see 
pp. 150-152). Such regulation can either enhance the activity of the 
enzyme or inhibit it. As we see next, both types of regulation—positive 
and negative—control the activity of key enzymes involved in the break- 
down and synthesis of glucose. 


Feedback Regulation Allows Cells to Switch from 
Glucose Breakdown to Glucose Synthesis 
Animals need an ample supply of glucose. Active muscles need glucose 


to power their contraction, and brain cells depend almost exclusively on 
glucose for energy. During periods of fasting or intense physical exercise, 
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Figure 13-20 Glycolysis and the citric 
acid cycle constitute a small fraction 

of the reactions that occur in a cell. In 

this diagram, the filled circles represent 
molecules in some of the best-characterized 
metabolic pathways, and the lines that 
connect them represent the enzymatic 
reactions that convert one metabolite to 
another. The reactions of glycolysis and 

the citric acid cycle are shown in red. Many 
other reactions either lead into these two 
central catabolic pathways—delivering 
small organic molecules to be oxidized for 
energy—or lead outward to the anabolic 
pathways that supply carbon compounds for 
biosynthesis. 


QUESTION 13-6 


A cyclic reaction pathway requires 
that the starting material be 
regenerated and available at the 
end of each cycle. If compounds of 
the citric acid cycle are siphoned off 
as building blocks to make other 
organic molecules via a variety of 
metabolic reactions (see Figure 
13-14), why does the citric acid 
cycle not quickly grind to a halt? 
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Figure 13-21 Gluconeogenesis uses 
specific enzymes to bypass those steps in 
glycolysis that are essentially irreversible. 
The enzyme phosphofructokinase 

catalyzes the phosphorylation of 

fructose 6-phosphate to form fructose 1, 
6-bisphosphate in step 3 of glycolysis. This 
reaction is so energetically favorable that 
the enzyme will not work in reverse (see 
step 3 in Panel 13-1). To produce fructose 
6-phosphate during gluconeogenesis, 

the enzyme fructose 1,6-bisphosphatase 
removes the phosphate from fructose 
1,6-bisphosphate in a simple hydrolysis 
reaction. Coordinated feedback regulation 
of these two enzymes helps a cell control 
the flow of metabolites toward glucose 
synthesis or glucose breakdown. 


How Cells Obtain Energy from Food 


the body's glucose reserves get used up faster than they can be replen- 
ished from food. One way to increase available glucose is to synthesize it 
from pyruvate by a process called gluconeogenesis. 


Gluconeogenesis is, in many ways, a reversal of glycolysis: it builds 
glucose from pyruvate, whereas glycolysis breaks down glucose and pro- 
duces pyruvate. Indeed, gluconeogenesis makes use of many of the same 
enzymes as glycolysis; it simply runs them in reverse. For example, the 
isomerase that converts glucose 6-phosphate to fructose 6-phosphate in 
step 2 of glycolysis will readily catalyze the reverse reaction (see Panel 
13-1, pp. 436-437). There are, however, three steps in glycolysis that so 
strongly favor glucose breakdown that they are effectively irreversible: 
steps 1, 3, and 10. To get around these one-way steps, gluconeogenesis 
uses a special set of enzymes that catalyze a set of bypass reactions. In 
step 3 of glycolysis, for example, the enzyme phosphofructokinase uses 
ATP to phosphorylate fructose 6-phosphate, producing the intermedi- 
ate fructose 1,6-bisphosphate. In gluconeogenesis, an enzyme called 
fructose 1,6-bisphosphatase simply removes a phosphate from this inter- 
mediate to generate fructose 6-phosphate (Figure 13-21). 


How does a cell determine whether to synthesize glucose or to 
degrade it? Part of the decision centers on the three irreversible gly- 
colytic reactions. The activity of the enzyme phosphofructokinase, 
for example, is allosterically regulated by the binding of a variety of 
metabolites, which provide both positive and negative feedback regula- 
tion. Such feedback loops, in which a product in a chain of enzymatic 
reactions reduces or stimulates its own production by altering the 
activity of an enzyme earlier in the pathway, regulate many biologi- 
cal processes (see Figure 4-42). Phosphofructokinase is activated by 
by-products of ATP hydrolysis—including ADP, AMP, and inorganic 
phosphate—and it is inhibited by ATP. Thus, when ATP is depleted and 
its metabolic by-products accumulate, phosphofructokinase is turned 
on and glycolysis proceeds, generating ATP; when ATP is abundant, 
the enzyme is turned off and glycolysis shuts down. The enzyme that 
catalyzes the reverse reaction during gluconeogenesis, fructose 1,6- 
bisphosphatase (see Figure 13-21), is regulated by the same molecules 
but in the opposite direction. This enzyme is thus activated when phos- 
phofructokinase is turned off, allowing gluconeogenesis to proceed. 
Many such coordinated feedback mechanisms enable a cell to respond 
rapidly to changing conditions and to adjust its metabolism accordingly. 


Some of the biosynthetic bypass reactions required for gluconeogenesis 
are energetically costly. Reversal of step 10 alone consumes one mol- 
ecule of ATP and one of GTP. Altogether, producing a single molecule of 
glucose by gluconeogenesis consumes four molecules of ATP and two 
molecules of GTP. Thus a cell must tightly regulate the balance between 
glycolysis and gluconeogenesis. If both processes were to proceed simul- 
taneously, they would shuttle metabolites back and forth in a futile cycle 
that would consume large amounts of energy and generate heat for no 
purpose. 
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fructose 6-phosphate fructose 1,6-bisphosphate 


Cells Store Food Molecules in Special Reservoirs to 
Prepare for Periods of Need 


As we have seen, gluconeogenesis is a costly process, requiring sub- 
stantial amounts of energy from the hydrolysis of ATP and GTP. During 
periods when food is scarce, this expensive way of producing glucose 
is suppressed if alternatives are available. Fasting cells, for example, 
can mobilize glucose that has been stored in the form of glycogen, 
a branched polymer of glucose (Figure 13-22A and see Panel 2-4, 
pp. 72-73). This large polysaccharide is stored as small granules in the 
cytoplasm of many animal cells, but mainly in liver and muscle cells 
(Figure 13-22B). The synthesis and degradation of glycogen occur by 
separate metabolic pathways, which can be rapidly and coordinately 
regulated to suit an organism's needs. When more ATP is needed than 
can be generated from food-derived molecules available in the blood- 
stream, cells break down glycogen in a reaction that is catalyzed by the 
enzyme glycogen phosphorylase. This enzyme produces glucose 1-phos- 
phate, which is then converted to the glucose 6-phosphate that feeds into 
the glycolytic pathway (Figure 13-22C). 


Like glycolysis and gluconeogenesis, the glycogen degradative and syn- 
thetic pathways are coordinated by feedback regulation. In this case, 
enzymes in each pathway are allosterically regulated by glucose 6-phos- 
phate, but in opposite directions: in the synthetic pathway, glycogen 
synthetase is activated by glucose 6-phosphate, whereas glycogen phos- 
phorylase, which breaks down glycogen (see Figure 13-22C), is inhibited 
by glucose 6-phosphate, as well as by ATP. This regulation helps to pre- 
vent glycogen breakdown when ATP is plentiful and to favor glycogen 
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Figure 13-22 Animal cells store 
glucose in the form of glycogen to 
provide energy in times of need. 

(A) The structure of glycogen; starch 

in plants is a very similar branched 
polymer of glucose, but has many 
fewer branch points. (B) An electron 
micrograph showing glycogen granules 
in the cytoplasm of a liver cell; each 
granule contains both glycogen and 
the enzymes required for glycogen 
synthesis and breakdown. (C) The 
enzyme glycogen phosphorylase 
breaks down glycogen when cells need 
more glucose. (B, courtesy of Robert 
Fletterick and Daniel S. Friend, by 
permission of E.L. Bearer.) 
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Figure 13-23 Fats are stored in the form 
of lipid droplets in cells. (A) Fat droplets 
(stained red) in the cytoplasm of developing 
adipocytes. (B) Lipid droplets (red) in yeast 
cells, which also use them as a reservoir of 
energy and building blocks for membrane 
lipid biosynthesis. (A, courtesy of Peter 
Tontonoz and Ronald M. Evans; B, courtesy 
of Sepp D. Kohlwein.) 


QUESTION 13-7 


After looking at the structures of 
sugars and fatty acids (discussed 

in Chapter 2), give an intuitive 
explanation as to why oxidation of 

a sugar yields only about half as 
much energy as the oxidation of an 
equivalent dry weight of a fatty acid. 


Figure 13-24 Some plant seeds serve as 
important foods for humans. Corn, nuts, 
and peas all contain rich stores of starch and 
fats, which provide the plant embryo in the 
seed with energy and building blocks for 
biosynthesis. (Courtesy of the John Innes 
Foundation.) 


synthesis when the glucose 6-phosphate concentration is high. The bal- 
ance between glycogen synthesis and breakdown is further regulated 
by intracellular signaling pathways that are controlled by the hormones 
insulin, epinephrine, and glucagon (see Table 16-1, p. 536 and Figure 
16-21). 


Quantitatively, fat is a far more important storage material than glycogen, 
in part because the oxidation of a gram of fat releases about twice as 
much energy as the oxidation of a gram of glycogen. Moreover, glycogen 
binds a great deal of water, producing a sixfold difference in the actual 
mass of glycogen required to store the same amount of energy as fat. 
An average adult human stores enough glycogen for only about a day 
of normal activity, but we store enough fat to last nearly a month. If our 
main fuel reserves had to be carried as glycogen instead of fat, our body 
weight would need to be increased by an average of about 60 pounds 
(nearly 30 kilograms). 


Fats are generally stored as droplets of water-insoluble triacylglycerols 
inside cells (Figure 13-23 and see Figure 13-11). Most animal species 
possess specialized fat-storing cells called adipocytes. In response to 
hormonal signals, fatty acids can be released from these depots into the 
bloodstream for other cells to use as required. Such a need arises after 
a period of not eating. Even a normal overnight fast results in the mobi- 
lization of fat: in the morning, most of the acetyl CoA that enters the 
citric acid cycle is derived from fatty acids rather than from glucose. After 
a meal, however, most of the acetyl CoA entering the citric acid cycle 
comes from glucose derived from food, and any excess glucose is used to 
make glycogen or fat. (Although animal cells can readily convert sugars 
to fats, they cannot convert fatty acids to sugars.) 


The food reserves in both animals and plants form a vital part of the 
human diet. Plants convert some of the sugars they make through photo- 
synthesis during daylight into fats and into starch, a branched polymer 
of glucose very similar to animal glycogen. The fats in plants are triacyl- 
glycerols, as they are in animals, and they differ only in the types of fatty 
acids that predominate (see Figures 2-21 and 2-22). 


The embryo inside a plant seed must live on stored food reserves for a 
long time, until the seed germinates to produce a plant with leaves that 
can harvest the energy in sunlight. The embryo uses these food stores as 
sources of energy and of small molecules to build cell walls and to syn- 
thesize many other biological molecules as it develops. For this reason, 
plant seeds often contain especially large amounts of fats and starch— 
which make them a major food source for animals, including ourselves 
(Figure 13-24). Germinating seeds convert the stored fat and starch into 
glucose as needed. 
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Figure 13-25 Plant cells store both starch and fats in their chloroplasts. 

An electron micrograph of a single chloroplast in a plant cell shows the starch 
granules and fat droplets that have been synthesized in the organelle. (Courtesy 
of K. Plaskitt.) 


In plant cells, fats and starch are both stored in chloroplasts—specialized 
organelles that carry out photosynthesis (Figure 13-25). These energy- 
rich molecules serve as food reservoirs that are mobilized by the cell 
to produce ATP in mitochondria during periods of darkness. In the next 
chapter, we take a closer look at chloroplasts and mitochondria, and 
review the elaborate mechanisms by which they harvest energy from 
sunlight and from food. 


ESSENTIAL CONCEPTS 


e Food molecules are broken down in successive steps, in which energy 
is captured in the form of activated carriers such as ATP and NADH. 


e In plants and animals, these catabolic reactions occur in different 
cell compartments: glycolysis in the cytosol, the citric acid cycle in 
the mitochondrial matrix, and oxidative phosphorylation on the inner 
mitochondrial membrane. 


e During glycolysis, the six-carbon sugar glucose is split to form two 
molecules of the three-carbon sugar pyruvate, producing small 
amounts of ATP and NADH. 


e In the presence of oxygen, eukaryotic cells convert pyruvate into 
acetyl CoA plus CO; in the mitochondrial matrix. The citric acid cycle 
then converts the acetyl group in acetyl CoA to CO2 and H20, cap- 
turing much of the energy released as high-energy electrons in the 
activated carriers NADH and FADH2. 


e Fatty acids produced from the digestion of fats are also imported into 
mitochondria and converted to acetyl CoA molecules, which are then 
further oxidized through the citric acid cycle. 


e Inthe mitochondrial matrix, NADH and FADH; pass their high-energy 
electrons to an electron-transport chain in the inner mitochondrial 
membrane, where a series of electron transfers is used to drive the 
formation of ATP. Most of the energy captured during the breakdown 
of food molecules is harvested during this process of oxidative phos- 
phorylation (described in detail in Chapter 14). 


e Many intermediates of glycolysis and the citric acid cycle are 
starting points for the anabolic pathways that lead to the synthesis 
of proteins, nucleic acids, and the many other organic molecules of 
the cell. 
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e The thousands of different reactions carried out simultaneously by a 
cell are regulated and coordinated by positive and negative feedback, 
enabling the cell to adapt to changing conditions; such feedback reg- 
ulation, for example, allows a cell to switch from glucose breakdown 
to glucose synthesis when food is scarce. 


e Glucose is stored in animal cells as glycogen, whereas plant cells 
store glucose as starch; both animal and plant cells store fatty acids 
as fats (triacylglycerols). The food reserves stored by plants are major 
sources of food for animals, including humans. 


KEY TERMS 


acetyl CoA fermentation 
ADP, ATP GDP, GTP 
anabolic pathway gluconeogenesis 
catabolism glucose 
cell respiration glycogen 
citric acid cycle glycolysis 
electron-transport chain NAD*, NADH 
FAD, FADH2 oxidative phosphorylation 
fat pyruvate 
feedback regulation starch 
QUESTIONS 
QUESTION 13-8 which of the two conditions could you expect the cells to 


: 3 : 
The complete oxidation of sugar molecules by the cell proliferate bettert Explain pout answer 


takes place according to the general reaction 
C6H120¢ (glucose) + 602 — 6CO2 + 6H20 + energy. 
Which of the following statements are correct? Explain During movement, muscle cells require large amounts of 
your answers. ATP to fuel their contractile apparatus. These cells contain 
high levels of creatine phosphate (Figure Q13-11), which 
has a standard free-energy change (AG) for hydrolysis of 
B. None of the produced energy is in the form of heat. its phosphate bond of -43 kJ/mole. Why is this a useful 
compound to store energy? Justify your answer with the 
information shown in Figure 13-8. 


QUESTION 13-11 


A. All of the energy produced is in the form of heat. 


C. The energy is produced by a process that involves the 
oxidation of carbon atoms. 


D. The reaction supplies the cell with essential water. 


E. In cells, the reaction takes place in more than one step. O | I 


F. Many steps in the oxidation of sugar molecules involve 7 
reaction with oxygen gas. ne) H CH; H 
G. Some organisms carry out the reverse reaction. 

creatine phosphate 


H. Some cells that grow in the absence of O2 produce CO2. 
Figure Q13-11 


QUESTION 13-9 


An exceedingly sensitive instrument (yet to be devised) 
shows that one of the carbon atoms in Charles Darwin’s last QUESTION 13-12 
breath is resident in your bloodstream, where it forms part 
of a hemoglobin molecule. Suggest how this carbon atom 
might have traveled from Darwin to you, and list some of 
the molecules it could have entered en route. 


Identical pathways that make up the complicated sequence 
of reactions of glycolysis, shown in Panel 13-1 (pp. 436- 
437), are found in most living cells, from bacteria to humans. 
One could envision, however, countless alternative chemical 
reaction mechanisms that would allow the oxidation of 
sugar molecules and that could, in principle, have evolved to 
Yeast cells can proliferate both in the presence of O2 take the place of glycolysis. Discuss this fact in the context 
(aerobically) and in its absence (anaerobically). Under of evolution. 


QUESTION 13-10 


QUESTION 13-13 


An animal cell, roughly cubical in shape with a side length 
of 10 um, uses 10? ATP molecules every minute. Assume 
that the cell replaces this ATP by the oxidation of glucose 
according to the overall reaction 602 + C4H1204 > 

6CO2 + 6H20 and that complete oxidation of each glucose 
molecule produces 30 ATP molecules. How much oxygen 
does the cell consume every minute? How long will it take 
before the cell has used up an amount of oxygen gas equal 
to its own volume? (Recall that one mole of a gas has a 
volume of 22.4 liters.) 


QUESTION 13-14 


Under the conditions existing in the cell, the free energies 
of the first few reactions in glycolysis (in Panel 13-1, 
pp. 436-437) are: 


step 1 AG = -33.5 kJ/mole 
step 2 AG = -2.5 kJ/mole 
step 3 AG = -22.2 kJ/mole 
step 4 AG = -1.3 kJ/mole 


Are these reactions energetically favorable? Using these 
values, draw to scale an energy diagram (A) for the overall 
reaction and (B) for the pathway composed of the four 
individual reactions. 
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QUESTION 13-15 


The chemistry of most metabolic reactions was deciphered 
by synthesizing metabolites containing atoms that are 
different isotopes from those occurring naturally. The 
products of reactions starting with isotopically labeled 
metabolites can be analyzed to determine precisely which 
atoms in the products are derived from which atoms in 
the starting material. The methods of detection exploit, 
for example, the fact that different isotopes have different 
masses that can be distinguished using biophysical 
techniques such as mass spectrometry. Moreover, some 
isotopes are radioactive and can therefore be readily 
recognized with electronic counters or photographic film 
that becomes exposed by radiation. 


A. Assume that pyruvate containing radioactive 14C 

in its carboxyl group is added to a cell extract that can 
support oxidative phosphorylation. Which of the molecules 
produced should contain the vast majority of the '4C that 
was added? 


B. Assume that oxaloacetate containing radioactive 14C in 
its keto group (refer to Panel 13-2, pp. 442-443) is added 
to the extract. Where should the '4C atom be located after 
precisely one turn of the citric acid cycle? 


QUESTION 13-16 


In cells that can proliferate both aerobically and 
anaerobically, fermentation is inhibited in the presence of 
O2. Suggest a reason for this observation. 
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Energy Generation in 
Mitochondria and Chloroplasts 


The fundamental need to generate energy efficiently has had a profound 
influence on the history of life on Earth. Much of the structure, function, 
and evolution of cells and organisms can be traced to their quest for 
energy. Oxygen did not appear in the atmosphere until more than a bil- 
lion years after the first cells appeared on Earth. It is therefore thought 
that the earliest cells may have produced ATP by breaking down organic 
molecules that had been generated by geochemical processes. Such fer- 
mentation reactions, discussed in Chapter 13, can occur in the cytosol 
of present-day cells, when they use the energy derived from the partial 
oxidation of energy-rich food molecules to form ATP. 


But very early in the history of life, a much more efficient mechanism 
for generating energy and synthesizing ATP appeared—one based on the 
transport of electrons along membranes. This mechanism is so central 
to the survival of life on Earth that we devote this entire chapter to it. 
Membrane-based electron transport first appeared in bacteria more than 
3 billion years ago, and the progeny of these pioneering cells currently 
crowd every crevice of our planet's land and oceans in a wild menagerie 
of living forms. Perhaps most remarkably, remnants of these energy- 
generating electron-transport systems can be found in the bacterial 
descendants that labor within living eukaryotic cells: chloroplasts and 
mitochondria. 


In this chapter, we consider the molecular mechanisms that enable elec- 
tron-transport systems to generate the energy that cells need to survive. 
We begin with a brief overview of the general principles central to the 
generation of energy in all living things: the use of a membrane to har- 
ness the energy of moving electrons. We describe how such processes 
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Figure 14-1 Membrane-based 
mechanisms use the energy provided 
by food or sunlight to generate ATP. In 

oxidative phosphorylation, which occurs in 

mitochondria, an electron-transport system energy 
uses energy derived from the oxidation of from food 
food to generate a proton (H+) gradient 
across a membrane. In photosynthesis, 
which occurs in chloroplasts, an electron- 
transport system uses energy derived from 
the sun to generate a proton gradient 
across a membrane. In both cases, this 
proton gradient is then used to drive ATP 
synthesis. ELECTRON TRANSFER 


PUMPS PROTONS 
ACROSS MEMBRANE 


energy from 


H sunlight 


photosynthesis membrane 
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operate in both mitochondria and chloroplasts, and we review the chemi- 
cal principles that allow the transfer of electrons to release large amounts 
of energy. Finally, we trace the evolutionary pathways that most likely 
gave rise to these marvelous mechanisms. 


Cells Obtain Most of Their Energy by a Membrane- 
based Mechanism 


The main chemical energy currency in cells is ATP (see Figure 3-31). 
Although small amounts of ATP are generated during glycolysis in the cell 
cytosol (discussed in Chapter 13), most of the ATP needed by cells is pro- 
duced by oxidative phosphorylation. The generation of ATP by oxidative 
phosphorylation differs from the way ATP is produced during glycolysis, 
in that it requires a membrane-bound compartment. In eukaryotic cells, 
oxidative phosphorylation takes place in mitochondria, and it depends 
on an electron-transport process that drives the transport of protons (H*) 
across the inner mitochondrial membrane. A related membrane-based 
process produces ATP during photosynthesis in plants, algae, and photo- 
synthetic bacteria (Figure 14-1). 


This membrane-based process for making ATP consists of two linked 
stages: one sets up an electrochemical proton gradient, and the other 
uses that gradient to generate ATP. Both stages are carried out by special 
protein complexes embedded in a membrane. 


1. In stage 1, high-energy electrons—derived from the oxidation of 
food molecules (discussed in Chapter 13) or from sunlight or other 
chemical sources (discussed later)—are transferred along a series 
of electron carriers, called an electron-transport chain, embedded 
in a membrane. These electron transfers release energy that is used 
to pump protons, derived from the water that is ubiquitous in cells, 
across the membrane and thus generate an electrochemical proton 
gradient (Figure 14-2A). An ion gradient across a membrane is a 
form of stored energy that can be harnessed to do useful work when 
the ions are allowed to flow back across the membrane, down their 
electrochemical gradient (discussed in Chapter 12). 

2. In stage 2, protons flow back down their electrochemical gradient 
through a membrane-embedded protein complex called ATP synthase, 
which catalyzes the energy-requiring synthesis of ATP from ADP and 
inorganic phosphate (Pj). This ubiquitous enzyme functions like a 
turbine that couples the movement of protons across the membrane 
to the production of ATP (Figure 14—2B). 
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STAGE 1: ENERGY OF ELECTRON 


When it was first proposed in 1961, this mechanism for generating energy 
was called the chemiosmotic hypothesis, because it linked the chemical 
bond-forming reactions that synthesize ATP (“chemi-”) with the mem- 
brane transport processes that pump protons (“osmotic,” from the Greek 
osmos, “to push”). Thanks to this chemiosmotic mechanism, now known 
as chemiosmotic coupling, cells can harness the energy of electron 
transfers in much the same way that the energy stored in a battery can be 
harnessed to do useful work (Figure 14-3). 


Chemiosmotic Coupling Is an Ancient Process, 
Preserved in Present-Day Cells 


The membrane-based, chemiosmotic mechanism for making ATP arose 
very early in life’s history, more than 3 billion years ago. The exact same 
type of ATP-generating processes occur in the plasma membrane of 
modern bacteria and archaea. The mechanism was so successful that its 
essential features have been retained in the long evolutionary journey 
from early prokaryotes to present-day cells. 


The remarkable resemblance of the mechanism in prokaryotes and 
eukaryotes can be attributed in part to the fact that the organelles that 
produce ATP in eukaryotic cells—the chloroplasts and mitochondria— 
evolved from bacteria that were engulfed by ancestral cells more than a 
billion years ago (see Figures 1-19 and 1-21). As evidence of their bacte- 
rial ancestry, both chloroplasts and mitochondria reproduce in a manner 


Figure 14-3 Batteries can use the energy of electron transfer to 
perform work. (A) If a battery's terminals are directly connected to 
each other, the energy released by electron transfer is all converted 
into heat. (B) If the battery is connected to a pump, much of the energy 
released by electron transfer can be harnessed to do work instead 

(in this case, to pump water). Cells can similarly harness the energy 

of electron transfer to do work—for example, pumping H* across a 
membrane (see Figure 14—2A). 


Figure 14-2 Membrane-based 

systems use the energy stored in an 
electrochemical proton gradient to 
synthesize ATP. The process occurs in two 
stages. (A) In the first stage, a proton pump 
harnesses the energy of electron transfer 
(described later in the chapter) to pump 
protons (H*) across a membrane, creating 
a proton gradient. The blue arrow shows 
the direction of electron movement. These 
high-energy electrons can come from 
organic or inorganic molecules, or they 
can be produced by the action of light on 
special molecules such as chlorophyll. The 
protons are derived from water, which is 
ubiquitous in the aqueous environment of 
the cell. (B) The proton gradient produced 
in (A) serves as a versatile energy store 
that can be used to drive a variety of 
energy-requiring reactions in mitochondria, 
chloroplasts, and prokaryotes—most 
importantly, the synthesis of ATP by ATP 
synthase. 


QUESTION 14-1 


Dinitrophenol (DNP) is a small 
molecule that renders membranes 
permeable to protons. In the 
1940s, small amounts of this highly 
toxic compound were given to 
patients to induce weight loss. 

DNP was effective in melting away 
the pounds, especially promoting 
the loss of fat reserves. Can you 
explain how it might cause such 
loss? As an unpleasant side reaction, 
however, patients had an elevated 
temperature and sweated profusely 
during the treatment. Provide an 
explanation for these symptoms. 


all chemical 
energy from 
electron transfer 
is converted to 
heat energy 


A chemical energy 
from electron 
transfer is converted 
to the potential 
energy stored ina 
difference in water 
levels; less energy 
is therefore lost 

as heat energy 
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(A) 


Figure 14-4 A mitochondrion can divide like a bacterium. (A) It 
undergoes a fission process that is conceptually similar to bacterial 
division. (B) An electron micrograph of a dividing mitochondrion in Aes. PES 
a liver cell. (B, courtesy of Daniel S. Friend, by permission of E.L. Bearer.) (B) l 
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similar to that of most prokaryotes (Figure 14-4). The organelles also har- 
bor bacterial-like biosynthetic machinery for making RNA and proteins, 
and they possess DNA-based genomes (Figure 14-5). Not surprisingly, 
many chloroplast genes are strikingly similar to those of cyanobacteria— 
the photosynthetic bacteria from which these organelles were derived. 


Although mitochondria and chloroplasts retain their own genomes, the 
bacteria from which they arose gave up many of the genes required for 
independent living as they developed their symbiotic relationships with 
eukaryotic animal and plant cells. Many of these jettisoned genes were 
not lost, however; they were relocated to the cell nucleus, where they 
continue to direct the production of proteins that mitochondria and chlo- 
roplasts import to carry out their specialized functions—including the 
generation of ATP. 
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Figure 14-5 Mitochondria and chloroplasts share many of the features of 
their bacterial ancestors. Both organelles contain their own DNA-based genome 
and the machinery to replicate this DNA and to make RNA and protein. The inner 
compartments of these organelles—the mitochondrial matrix and the chloroplast 
stroma—contain the DNA (red) and a special set of ribosomes. Membranes in 
both organelles—the mitochondrial inner membrane and the chloroplast thylakoid 
membrane—contain the protein complexes involved in ATP production. 


Mitochondria and Oxidative Phosphorylation 


MITOCHONDRIA AND OXIDATIVE 
PHOSPHORYLATION 


Mitochondria are present in nearly all eukaryotic cells, where they pro- 
duce the bulk of the cell’s ATP. Without mitochondria, eukaryotes would 
have to rely on the relatively inefficient process of glycolysis for all of 
their ATP production. When glucose is converted to pyruvate by glyco- 
lysis in the cytosol, the net result is that only two molecules of ATP are 
produced per glucose molecule, which is less than 10% of the total free 
energy potentially available from oxidizing the sugar. By contrast, about 
30 molecules of ATP are produced when mitochondria are recruited to 
complete the oxidation of glucose that begins in glycolysis. Had ances- 
tral cells not established the relationship with the bacteria that gave rise 
to modern mitochondria, it seems unlikely that complex multicellular 
organisms could have evolved. 


The importance of mitochondria is further highlighted by the dire conse- 
quences of mitochondrial dysfunction. Defects in the proteins required for 
electron transport, for example, are responsible for an inherited disorder 
called myoclonic epilepsy and ragged red fiber disease (MERRF). Because 
muscle and nerve cells need large amounts of ATP to function normally, 
individuals with this condition typically experience muscle weakness, 
heart problems, epilepsy, and often dementia. 


MERFF, like many of the disorders that affect mitochondrial function, 
stems from mutations that disable genes present in mitochondrial DNA 
(see Figure 14-5). Because mitochondria are passed down from mother 
to child (sperm mitochondria are lost after fertilization), such mutations 
are transmitted by the egg. To prevent the transmission of these life- 
threatening defects, reproductive biologists have developed methods for 
removing the nucleus from an egg that carries faulty mitochondria and 
transferring it to a donor egg that has healthy mitochondria. Although a 
baby boy produced using this form of mitochondrial replacement therapy 
was born in 2016, the approach remains controversial, in part because 
the effects of having genetic material from three “parents’—mother, 
father, and mitochondrial donor—are unknown. 


In this section, we review the structure and function of mitochondria. 
We outline how this organelle makes use of an electron-transport chain, 
embedded in its inner membrane, to generate the proton gradient needed 
to drive the synthesis of ATP. And we consider the overall efficiency with 
which this membrane-based system converts the energy stored in food 
molecules into the energy contained in the phosphate bonds of ATP. 


Mitochondria Are Dynamic in Structure, Location, and 
Number 


Isolated mitochondria are generally similar in appearance to their bac- 
terial ancestors. Inside a cell, however, mitochondria are remarkably 
adaptable and can adjust their location, shape, and number to suit that 
particular cell’s needs. In some cell types, mitochondria remain fixed in 
one location, where they supply ATP directly to a site of unusually high 
energy consumption. In a heart muscle cell, for example, mitochondria 
are located close to the contractile apparatus, whereas in a sperm they 
are wrapped tightly around the motile flagellum (Figure 14-6). In other 
cells, mitochondria fuse to form elongated, tubular networks, which are 
diffusely distributed through the cytoplasm (Figure 14-7). These networks 
are dynamic, continually breaking apart by fission (see Figure 14-4) and 
fusing again (Movie 14.1 and Movie 14.2). 
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Figure 14-6 Mitochondria are located near sites of high ATP utilization. (A) In a cardiac muscle cell, mitochondria 
are located close to the contractile apparatus, where ATP hydrolysis provides the energy for contraction. The 
structure of the contractile apparatus is discussed in Chapter 17. (B) An electron micrograph of cardiac muscle shows 
a preponderance of mitochondria. (C) In a sperm, mitochondria are located in the tail, wrapped around a portion of 
the motile flagellum that requires ATP for its movement. The internal structure of the flagellar core is discussed in 
Chapter 17. (D) Micrograph showing a flagellum that has been thinly sliced to reveal the internal core structure as 
well as the surrounding mitochondria. (B, Keith Porter papers, Center for Biological Sciences Archives, University of 
Maryland, Baltimore Country; D, from W. Bloom and D.W. Fawcett, A Textbook of Histology, 10th ed. Philadelphia: 
W.B. Saunders Company, 1975. Reprinted with permission from the Estate of D.W. Fawcett.) 


Mitochondria are present in large numbers—1000 to 2000 in a liver cell, 
for example. But their numbers vary depending on the cell type and can 
change with the energy needs of the cell. In skeletal muscle cells that 
have been repeatedly stimulated to contract, mitochondria can divide 
until their numbers increase five- to tenfold. Marathon runners, for exam- 
ple, may have twice the volume of mitochondria in their leg muscles than 
do individuals who are more sedentary. 


Regardless of their varied appearance, location, and number, however, 
all mitochondria have the same basic internal structure—a design that 
supports the efficient production of ATP, as we see next. 


A Mitochondrion Contains an Outer Membrane, an 
Inner Membrane, and Two Internal Compartments 
An individual mitochondrion is bounded by two highly specialized 


membranes—one inside the other. These membranes, called the 
inner and outer mitochondrial membranes, create two mitochondrial 


mitochondria nucleus continuous mitochondrion 


\ 


PA 


Figure 14-7 Mitochondria often form 
elongated, tubular networks, which can 
extend throughout the cytoplasm. 

(A) Mitochondria (red) are fluorescently 
labeled in this cultured mouse fibroblast. 
(B) In a yeast cell, the mitochondria (red) 
form a continuous network, tucked 

against the plasma membrane. (A, courtesy 
of Carl Zeiss Microscopy, LLC; B, from 

J. Nunnari et al., Mol. Biol. Cell 8:1233-1242, 
1997. With permission from The American (A) (B) 

Society for Cell Biology.) 5 um 5 um 
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Matrix. This space contains a highly 
concentrated mixture of hundreds of 
enzymes, including those required for the 
oxidation of pyruvate and fatty acids and 
for the citric acid cycle. 


Inner membrane. Folded into numerous 
cristae, the inner membrane contains the 
proteins that carry out oxidative 
phosphorylation, including the 
electron-transport chain and the ATP 
synthase that makes ATP. It also contains 
transport proteins that move selected 
molecules into and out of the matrix. 4 


Outer membrane. Because it contains large, 
channel-forming proteins (called porins), 
the outer membrane is permeable to all 
molecules of 5000 daltons or less. 


Intermembrane space. This space contains 
several enzymes that use the ATP passing 
out of the matrix to phosphorylate other 
nucleotides. It also contains proteins that 
are released during apoptosis (discussed 
in Chapter 18). 


(A) 


compartments: a large internal space called the matrix and a much nar- 
rower intermembrane space (Figure 14-8). When purified mitochondria 
are gently fractionated into separate components and their contents 
analyzed (see Panel 4-3, pp. 164-165), each of the membranes, and the 
spaces they enclose, are found to contain a unique collection of proteins. 


The outer membrane contains many molecules of a transport protein 
called porin, which forms wide, aqueous channels through the lipid bilayer 
(described in Chapter 11). As a result, the outer membrane is like a sieve 
that is permeable to all molecules of 5000 daltons or less, including small 
proteins. This makes the intermembrane space chemically equivalent to 
the cytosol with respect to the small molecules and inorganic ions it con- 
tains. In contrast, the inner membrane, like other membranes in the cell, 
is impermeable to the passage of ions and most small molecules, except 
where a path is provided by the specific membrane transport proteins 
that it contains. The mitochondrial matrix thus contains only those mol- 
ecules that are selectively transported into the matrix across the inner 
membrane, and its contents are highly specialized. 


The inner mitochondrial membrane is the site of oxidative phosphoryla- 
tion, and it is here that the proteins of the electron-transport chain and 
the ATP synthase required for ATP production are concentrated. This 
membrane is highly convoluted, forming a series of infoldings—known 
as cristae—that project into the matrix space (see Figure 14-8 and Movie 
14.3). These folds greatly increase the surface area of the membrane. In 
a liver cell, the inner membranes of all the mitochondria make up about 
one-third of the total membranes of the cell. 


The Citric Acid Cycle Generates High-Energy Electrons 
Required for ATP Production 


The generation of ATP is powered by the flow of electrons that are 
derived from the burning of carbohydrates, fats, and other foodstuffs dur- 
ing glycolysis and the citric acid cycle. These “high-energy” electrons are 
provided by activated carriers generated during these two sets of cata- 
bolic reactions, with the majority being churned out by the citric acid 
cycle that operates in the mitochondrial matrix (discussed in Chapter 13). 


Figure 14-8 A mitochondrion is organized 
into four separate compartments. 

(A) A schematic drawing and (B) an 
electron micrograph of a mitochondrion. 
Each compartment contains a unique 

set of proteins, enabling it to perform its 
distinct functions. In liver mitochondria, an 
estimated 67% of the total mitochondrial 
protein is located in the matrix, 21% in 

the inner membrane, 6% in the outer 
membrane, and 6% in the intermembrane 
space. (B, courtesy of Daniel S. Friend, by 
permission of E.L. Bearer.) 


QUESTION 14-2 


Electron micrographs show that 
mitochondria in heart muscle have a 
much higher density of cristae than 
mitochondria in skin cells. Suggest 
an explanation for this observation. 
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Figure 14-9 Acetyl CoA is produced in 
the mitochondria. In animal cells and other 
eukaryotes, pyruvate produced during 
glycolysis and fatty acids derived from the 
breakdown of fats enter the mitochondrion 
from the cytosol. Once inside the 
mitochondrial matrix, both of these food- 
derived molecules are converted to acetyl 
CoA and then oxidized to CO2. 


Figure 14-10 NADH donates its 
“high-energy” electrons to an electron- 
transport chain. A hydride ion (a hydrogen 
atom with two electrons, red) is removed 
from NADH and is converted into a proton 
and two electrons (blue). Only the part 

of NADH that carries these high-energy 
electrons is shown; for the complete 
structure and the conversion of NAD* back 
to NADH, see the structure of the closely 
related NADPH in Figure 3-34. Electrons 
are also carried in a similar way by FADH2, 
whose structure is shown in Figure 13-13B. 


CYTOSOL 


The citric acid cycle gets the fuel it needs to produce these activated car- 
riers from food-derived molecules that make their way into mitochondria 
from the cytosol. Both the pyruvate produced by glycolysis and the fatty 
acids derived from the breakdown of fats (see Figure 13-3) can enter 
the mitochondrial intermembrane space through the porins in the outer 
mitochondrial membrane. These fuel molecules are then transported 
across the inner mitochondrial membrane into the matrix, where they 
are converted into the crucial metabolic intermediate, acetyl CoA (Figure 
14-9). The acetyl groups in acetyl CoA are then oxidized to CO2 via the 
citric acid cycle (see Figure 13-12). Some of the energy derived from this 
oxidation is saved in the form of high-energy electrons, held by the acti- 
vated carriers NADH and FADH2. These two activated carriers can then 
donate their electrons to the electron-transport chain in the inner mito- 
chondrial membrane (Figure 14-10). 


The Movement of Electrons Is Coupled to the Pumping 
of Protons 


The chemiosmotic generation of energy begins when the activated car- 
riers NADH and FADH, donate their electrons to the electron-transport 
chain in the inner mitochondrial membrane, becoming oxidized to NAD* 
and FAD, respectively, in the process (see Figure 14-10). The electrons are 
quickly passed along the chain to molecular oxygen (O2) to form water 
(H20). The stepwise movement of these electrons through the compo- 
nents of the electron-transport chain releases energy that can then be 
used to pump protons across the inner mitochondrial membrane (Figure 
14-11). The resulting proton gradient, in turn, is used to drive the synthe- 
sis of ATP. The full sequence of reactions is shown in Figure 14-12. The 
inner mitochondrial membrane thus serves as a device that converts the 
energy contained in the high-energy electrons of NADH (and FADHz2) into 
the phosphate bond of ATP molecules (Figure 14-13). This chemiosmotic 
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mechanism for ATP synthesis is called oxidative phosphorylation 
because it involves both the consumption of O2 and the addition of a 
phosphate group to ADP to form ATP. 


The source of the high-energy electrons that power the proton pump- 
ing differs widely between different organisms and different processes. 
During cell respiration—the energy-generating process that takes place 
in both mitochondria and aerobic bacteria—these electrons are ultimately 
derived from sugars or fats. In photosynthesis, the high-energy electrons 
come from the organic green pigment chlorophyll, which captures energy 
from sunlight. And many single-celled organisms (archaea and bacteria) 
use inorganic substances such as hydrogen, iron, and sulfur as the source 
of the high-energy electrons that they need to make ATP (see, for exam- 
ple, Figure 1-13). 


inner mitochondrial membrane outer mitochondrial membrane 


citric 
acid 
cycle 
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Figure 14-11 As electrons are transferred 
from activated carriers to oxygen, 
protons are pumped across the inner 
mitochondrial membrane. This is stage 1 
of chemiosmotic coupling (see Figure 14-2). 
The path of electron flow is indicated by 
blue arrows. Only the pathway for NADH is 
shown here. 


Figure 14-12 Activated carriers 
generated during the citric acid cycle 
power the production of ATP. Pyruvate 
and fatty acids enter the mitochondrial 
matrix (bottom), where they are converted 
to acetyl CoA. The acetyl CoA is then 
metabolized by the citric acid cycle, which 
produces NADH (and FADHz, not shown). 
During oxidative phosphorylation, high- 
energy electrons donated by NADH (and 
FADHz) are then passed along the electron- 
transport chain in the inner membrane 

and ultimately handed off to oxygen (Oz); 
this electron transport generates a proton 
gradient across the inner membrane, 

which is used to drive the production of 
ATP by ATP synthase. The exact ratios 

of “reactants” and “products” are not 
indicated in this diagram: for example, we 
will see shortly that it requires four electrons 
from NADH molecules to convert O2 to two 
H20 molecules. 
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Figure 14-13 To produce ATP, mitochondria 
catalyze a major conversion of energy. 

In oxidative phosphorylation, the energy 
released by the oxidation of NADH to NAD* 
is harnessed—through energy-conversion 
processes in the inner mitochondrial 
membrane—to drive the energy-requiring 
phosphorylation of ADP to form ATP. The 
net equation for this process, in which four 
electrons pass from NADH to oxygen, is 
2NADH + O; + 2H* — 2NAD* + 2H20. A 
smaller amount of ATP is similarly generated 
from energy released by the oxidation of 
FADH»? to FAD (not shown). 


Figure 14-14 High-energy electrons are 
transferred through three respiratory 
enzyme complexes in the inner 
mitochondrial membrane. The relative 
size and shape of each complex are 
indicated, but the numerous individual 
protein components that form each 
complex are not. During the transfer of 
electrons from NADH to oxygen (blue 
lines), protons derived from water are 
pumped across the membrane from the 
matrix into the intermembrane space 

by each of the complexes (Movie 14.4). 
Ubiquinone (Q) and cytochrome c (c) serve 
as mobile carriers that ferry electrons from 
one complex to the next. 
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Regardless of the electron source, the vast majority of living organisms 
use a chemiosmotic mechanism to generate ATP. In the following sec- 
tions, we describe in detail how this process occurs. 


Electrons Pass Through Three Large Enzyme Complexes 
in the Inner Mitochondrial Membrane 


The electron-transport chain—or respiratory chain—that carries out 
oxidative phosphorylation is present in many copies in the inner mito- 
chondrial membrane. Each chain contains over 40 proteins, grouped into 
three large respiratory enzyme complexes. These complexes each con- 
tain multiple individual proteins, including transmembrane proteins that 
anchor the complex firmly in the inner mitochondrial membrane. 


The three respiratory enzyme complexes, in the order in which they 
receive electrons, are (1) NADH dehydrogenase complex, (2) cytochrome c 
reductase complex, and (3) cytochrome c oxidase complex (Figure 14-14). 
Each complex contains metal ions and other chemical groups that act 
as stepping stones to enable the passage of electrons through the com- 
plex. The movement of electrons through these respiratory complexes is 
accompanied by the pumping of protons from the mitochondrial matrix 
to the intermembrane space. Thus each complex can be thought of as a 
proton pump. 


The first respiratory complex in the chain, NADH dehydrogenase, accepts 
electrons from NADH. These electrons are extracted from NADH in the 
form of a hydride ion (H7), which is then converted into a proton and two 
high-energy electrons (see Figure 14-10). That reaction, H — H* + 26, 
is catalyzed by the NADH dehydrogenase complex itself. After passing 
through this complex, the electrons move along the chain to each of the 
other enzyme complexes in turn, using mobile electron carriers to ferry 
them between the complexes (see Figure 14-14). This transfer of elec- 
trons is energetically favorable: the electrons are passed from electron 
carriers with a weaker electron affinity to those with a stronger electron 
affinity, until they combine with a molecule of O2 to form water. The final 
electron transfer is the only oxygen-requiring step in cell respiration, and 
it consumes nearly all of the oxygen that we breathe. 


Proton Pumping Produces a Steep Electrochemical Proton 
Gradient Across the Inner Mitochondrial Membrane 


Without a mechanism for harnessing the energy released by the ener- 
getically favorable transfer of electrons from NADH to Oz, this energy 
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would simply be liberated as heat. Cells are able to recover much of 
this energy because each of the respiratory enzyme complexes in the 
electron-transport chain uses it to pump protons across the inner mito- 
chondrial membrane, from the matrix into the intermembrane space (see 
Figure 14-14). Later, we will outline the molecular mechanisms involved. 
For now, we focus on the consequences of this nifty maneuver. First, the 
pumping of protons generates an H+ gradient—or pH gradient—across 
the inner membrane. As a result, the pH in the matrix (around 7.9) is 
about 0.7 unit higher than it is in the intermembrane space (which is 7.2, 
the same pH as the cytosol). Second, proton pumping generates a volt- 
age gradient—or membrane potential—across the inner membrane; as 
H+ flows outward, the matrix side of the membrane becomes negative 
and the side facing the intermembrane space becomes positive. 


As discussed in Chapter 12, the force that drives the passive flow of an ion 
across a membrane is proportional to the ion’s electrochemical gradient. 
The strength of that electrochemical gradient depends both on the volt- 
age across the membrane, as measured by the membrane potential, and 
on the ion’s concentration gradient (see Figure 12-5). Because protons 
are positively charged, they will more readily cross a membrane if there 
is an excess of negative charge on the other side. In the case of the inner 
mitochondrial membrane, the pH gradient and membrane potential work 
together to create a steep electrochemical proton gradient that makes it 
energetically very favorable for H* to flow back into the mitochondrial 
matrix. The membrane potential contributes significantly to this proton- 
motive force, which pulls H* back across the membrane; the greater the 
membrane potential, the more energy is stored in the proton gradient 
(Figure 14-15). 


ATP Synthase Uses the Energy Stored in the 
Electrochemical Proton Gradient to Produce ATP 


If protons in the intermembrane space were simply allowed to flow back 
into the mitochondrial matrix, the energy stored in the electrochemi- 
cal proton gradient would be lost as heat. Such a seemingly wasteful 
process allows hibernating bears to stay warm, as we discuss further in 
How We Know (pp. 476-477). In most cells, however, the electrochemi- 
cal proton gradient across the inner mitochondrial membrane is used to 
drive the synthesis of ATP from ADP and Pj (see Figure 2-27). The device 
that makes this possible is ATP synthase, a large, multisubunit protein 
embedded in the inner mitochondrial membrane. 


ATP synthase is of ancient origin; the same enzyme generates ATP in the 
mitochondria of animal cells, the chloroplasts of plants and algae, and 


Figure 14-15 The electrochemical H* 
gradient across the inner mitochondrial 
membrane includes a large force due 

to the membrane potential (AV) and a 
smaller force due to the H* concentration 
gradient—that is, the pH gradient (ApH). 
The intermembrane space is slightly more 
acidic than the matrix, because the higher 
the concentration of protons, the more 
acidic the solution (see Panel 2-2, 

pp. 68-69). Both the membrane potential 
and the pH gradient combine to generate 
the proton-motive force, which pulls H+ 
back into the mitochondrial matrix. The 
exact, mathematical relationship between 
these forces is expressed by the Nernst 
equation (see Figure 12-24). 


QUESTION 14-3 


When the drug dinitrophenol (DNP) 
is added to mitochondria, the inner 
membrane becomes permeable 

to protons (H*). In contrast, when 
the drug nigericin is added to 
mitochondria, the inner membrane 
becomes permeable to K*. (A) How 
does the electrochemical proton 
gradient change in response to 
DNP? (B) How does it change in 
response to nigericin? 


465 


466 


CHAPTER 14 


Figure 14-16 ATP synthase 

acts like a motor to convert the 
energy of protons flowing down 
their electrochemical gradient 

to chemical-bond energy in ATP. 
(A) The multisubunit protein is 
composed of a stationary head, 
called the Fı ATPase, and a rotating 
portion called Fo. Both Fy and Fo 
are formed from multiple subunits. 
Driven by the electrochemical 
proton gradient, the Fo part of 

the protein—which consists of the 
transmembrane H* carrier (blue) plus 
a central stalk (dark purple)—spins 
rapidly within the stationary head of 
the F, ATPase (green), causing it to 
generate ATP from ADP and Pi. The 
stationary head is secured to the 
inner membrane by an elongated 
protein “arm” called the peripheral 
stalk (orange). The F; ATPase is so 
named because it can carry out the 
reverse reaction—the hydrolysis of 
ATP to ADP and P;—when detached 
from the Fo portion of the complex. 
(B) The three-dimensional structure 
of ATP synthase, as determined by 
x-ray crystallography. The peripheral 
stalk is anchored to the membrane 
with the help of an additional 
subunit (light purple). At its other 
end, this stalk is attached to the F4 
ATPase head via a small red subunit. 
Movie 14.6 shows how the ATP 
synthase proteins are organized into 
mitochondrial cristae. 
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the plasma membrane of bacteria and archaea. The part of the protein 
that catalyzes the phosphorylation of ADP is shaped like a lollipop head 
that projects into the mitochondrial matrix; it is penetrated by a central 
stalk that is attached to a transmembrane H* carrier (Figure 14-16). The 
passage of protons through the carrier causes the carrier and its stalk to 
spin rapidly, like a tiny motor. As the stalk rotates, it rubs against proteins 
in the enzyme'’s stationary head, altering their conformation and causing 
them to produce ATP. In this way, a mechanical deformation gets con- 
verted into the chemical-bond energy of ATP (Movie 14.5). This fine-tuned 
sequence of interactions allows ATP synthase to produce more than 100 
molecules of ATP per second—3 molecules of ATP per revolution. 


ATP synthase can also operate in reverse—using the energy of ATP 
hydrolysis to pump protons “uphill,” against their electrochemical gra- 
dient (Figure 14-17). In this mode, ATP synthase functions like the H+ 
pumps described in Chapter 12. Whether ATP synthase primarily makes 
ATP—or consumes it to pump protons—depends on the magnitude of 
the electrochemical proton gradient across the membrane in which the 
enzyme is embedded. In many bacteria that can grow either aerobically or 
anaerobically, the direction in which the ATP synthase works is routinely 
reversed when the bacterium runs out of O2. Under these conditions, the 
ATP synthase uses some of the ATP generated inside the cell by glycolysis 
to pump protons out of the cell, creating the proton gradient that the bac- 
terial cell needs to import its essential nutrients by coupled transport. A 
proton gradient is similarly used to drive the transport of small molecules 
in and out of the mitochondrial matrix, as we discuss next. 


The Electrochemical Proton Gradient Also Drives 
Transport Across the Inner Mitochondrial Membrane 


The synthesis of ATP is not the only process driven by the electrochemical 
proton gradient in mitochondria. Many small, charged molecules, such 
as pyruvate, ADP, and inorganic phosphate (Pj), are imported into the 
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mitochondrial matrix from the cytosol, while others, such as ATP, must be 
transported in the opposite direction. Carrier proteins that bind some of 
these molecules couple their transport to the energetically favorable flow 
of H* into the matrix (see the “coupled transporters” in Figure 12-15). 
Pyruvate and Pj, for example, are each co-transported inward, along with 
protons, as the protons move down their electrochemical gradient into 
the matrix. 


Other transporters take advantage of the membrane potential generated 
by the electrochemical proton gradient, which makes the matrix side of 
the inner mitochondrial membrane more negatively charged than the 
side that faces the intermembrane space (see Figure 14-15). A special 
antiport carrier protein exploits this voltage gradient to export ATP from 
the mitochondrial matrix and to bring ADP in. This exchange allows the 
ATP synthesized in the mitochondrion to be exported rapidly, which is 
important for energizing the rest of the cell (Figure 14-18). 


The electrochemical proton gradient is also required for the transloca- 
tion of proteins across the inner mitochondrial membrane and into the 
matrix. As mentioned earlier, although mitochondria retain their own 
genome—and synthesize some of their own proteins—most of the pro- 
teins required for mitochondrial function are made in the cytosol and 
must be actively imported into the organelle. We discuss this transport 
process—which requires energy supplied by the electrochemical proton 
gradient as well as ATP hydrolysis—in Chapter 15. 


In eukaryotic cells, therefore, the electrochemical proton gradient is 
used to drive both the generation of ATP and the transport of selected 
metabolites and proteins across the inner mitochondrial membrane. In 
bacteria, the proton gradient across the plasma membrane is similarly 
used to drive ATP synthesis and metabolite transport. But it also serves 
as an important source of directly usable energy: in motile bacteria, for 
instance, the flow of protons into the cell drives the rapid rotation of the 
bacterial flagellum, which propels the bacterium along (Movie 14.8). 


The Rapid Conversion of ADP to ATP in Mitochondria 
Maintains a High ATP/ADP Ratio in Cells 
As a result of the nucleotide exchange shown in Figure 14-18, ADP mol- 


ecules—produced by hydrolysis of ATP in the cytosol—are rapidly drawn 
back into mitochondria for recharging, while the bulk of the ATP molecules 


Figure 14-17 ATP synthase is a reversible 
coupling device. The protein can either 

(A) synthesize ATP by harnessing the 
electrochemical H* gradient or (B) pump 
protons against this gradient by hydrolyzing 
ATP. The direction of operation (and of 
rotation) at any given instant depends on 
the net free-energy change (AG, discussed 
in Chapter 3) for the coupled processes 

of H* translocation across the membrane 
and the synthesis of ATP from ADP and Pi. 
For example, if the electrochemical proton 
gradient falls below a certain level, the 

AG for H* transport into the matrix will 

no longer be large enough to drive ATP 
production; instead, ATP will be hydrolyzed 
by the ATP synthase to rebuild the proton 
gradient. A tribute to the activity of this 
all-important protein complex is shown in 
Movie 14.7. 


QUESTION 14—4 


The remarkable properties that 
allow ATP synthase to run in either 
direction allow the interconversion 
of energy stored in the H* gradient 
and energy stored in ATP to proceed 
in either direction. (A) If ATP 
synthase making ATP can be likened 
to a water-driven turbine producing 
electricity, what would be an 
appropriate analogy when it works 
in the opposite direction? (B) Under 
what conditions would one expect 
the ATP synthase to stall, running 
neither forward nor backward? 

(C) What determines the direction in 
which the ATP synthase operates? 
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Figure 14-18 The electrochemical proton 
gradient across the inner mitochondrial 
membrane is used to drive some coupled 
transport processes. The charge on each 
of the transported molecules is indicated for 
comparison with the membrane potential, 
which is negative inside, as shown. Pyruvate 
and inorganic phosphate (Pi) are moved 
into the matrix along with protons, as the 
protons move down their electrochemical 
gradient. Both are negatively charged, 

so their movement is opposed by the 
negative membrane potential; however, 
the H* concentration gradient—the pH 
gradient—is harnessed in a way that 
nevertheless drives their inward transport. 
ADP is pumped into the matrix and ATP 

is pumped out by an antiport process 

that uses the voltage gradient across the 
membrane to drive the exchange. The 
outer mitochondrial membrane is freely 
permeable to all of these compounds due 
to the presence of porins in the membrane 
(not shown). The active transport of 
molecules across membranes by carrier 
proteins and the generation of a membrane 
potential are discussed in Chapter 12. 
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produced in mitochondria are exported into the cytosol, where they are 
most needed. (A small amount of ATP is used within the mitochondrion 
itself to power DNA replication, protein synthesis and translocation, and 
other energy-consuming reactions that occur there.) With all of this back- 
and-forth, a typical ATP molecule in a human cell will shuttle out of a 
mitochondrion, then back in as ADP, more than once every minute. 


As discussed in Chapter 3, most biosynthetic enzymes drive energetically 
unfavorable reactions by coupling them to the energetically favorable 
hydrolysis of ATP (see Figure 3-32). The pool of ATP in a cell is thus used 
to drive a huge variety of cell processes in much the same way that a 
battery is used to drive an electric engine. To serve as a readily available 
energy source, the concentration of ATP in the cytosol must be kept about 
10 times higher than that of ADP. When the activity of mitochondria is 
halted, ATP levels fall dramatically and the cell's battery runs down. 
Eventually, energetically unfavorable reactions can no longer take place 
and the cell dies. The poison cyanide, which blocks electron transport in 
the inner mitochondrial membrane, causes cell death in exactly this way. 


Cell Respiration Is Amazingly Efficient 


The oxidation of sugars to produce ATP may seem unnecessarily com- 
plex. Surely the process could be accomplished more directly—perhaps 
by eliminating the citric acid cycle or some of the steps in the respiratory 
chain. Such simplification would certainly make the chemistry easier to 
learn—but it would not be as helpful for the cell. As discussed in Chapter 
13, the oxidative pathways that allow cells to extract energy from food 
in a usable form involve many intermediates, each differing only slightly 
from its predecessor. In this way, the huge amount of energy locked up in 
food molecules can be parceled out into small packets that can be cap- 
tured in activated carriers such as NADH and FADH; (see Figure 13-1). 


Much of the energy carried by NADH and FADH; is ultimately converted 
into the bond energy of ATP. How much ATP each of these activated carri- 
ers can produce depends on several factors, including where its electrons 
enter the respiratory chain. The NADH molecules produced in the mito- 
chondrial matrix during the citric acid cycle pass their high-energy 
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electrons to the NADH dehydrogenase complex—the first complex in the 
chain. As the electrons pass from one enzyme complex to the next, they 
promote the pumping of protons across the inner mitochondrial mem- 
brane. In this way, each NADH molecule provides enough net energy to 
generate about 2.5 molecules of ATP (see Question 14-5 and its answer). 


FADH? molecules, on the other hand, bypass the NADH dehydrogenase 
complex and pass their electrons to the membrane-embedded mobile 
carrier ubiquinone (see Figure 14-14). Because these electrons enter 
further down the respiratory chain than those donated by NADH, they 
promote the pumping of fewer protons: each molecule of FADH; thus 
produces only 1.5 molecules of ATP. Table 14—1 provides a full account- 
ing of the ATP produced by the complete oxidation of glucose. 


Although the biological oxidation of glucose to CO2 and H20 consists 
of many interdependent steps, the overall process—known as cell 
respiration—is remarkably efficient. Almost 50% of the total energy that 
could be released by burning sugars or fats is captured and stored in 
the phosphate bonds of ATP during cell respiration. That might not seem 
impressive, but it is considerably better than most nonbiological energy- 
conversion devices. Electric motors and gasoline engines operate at about 
10-20% efficiency. If cells operated at this efficiency, an organism would 
have to eat voraciously just to maintain itself. Moreover, because the 
wasted energy is liberated as heat, large organisms (including humans) 
would need far better mechanisms for cooling themselves. It is hard to 
imagine how animals could have evolved without the elaborate yet eco- 
nomical mechanisms that allow cells to extract a maximum amount of 
energy from food. 


MOLECULAR MECHANISMS OF ELECTRON 
TRANSPORT AND PROTON PUMPING 


For many years, biochemists struggled to understand why electron- 
transport chains had to be embedded in membranes to function in ATP 
production. The puzzle was essentially solved in the 1960s, when it was 
discovered that transmembrane proton gradients drive the process. The 
concept of chemiosmotic coupling was so novel, however, that it was not 
widely accepted until more than a decade later, when experiments with 
artificial energy-generating systems put the power of proton gradients to 
the test, as described in How We Know (pp. 476-477). 


Although investigators are still unraveling some of the details of chemi- 
osmotic coupling at the atomic level, the fundamentals are now clear. In 
this section, we examine the basic principles that underlie the movement 
of electrons, and we explain in molecular detail how electron transport 
can drive the generation of a proton gradient. Because very similar mech- 
anisms are used by mitochondria, chloroplasts, and prokaryotes, these 
principles apply to nearly all living things. 


Protons Are Readily Moved by the Transfer of Electrons 


Although protons resemble other positive ions such as Na* and K* in the 
way they move across membranes, in some respects they are unique. 
Hydrogen atoms are by far the most abundant atom in living organisms: 
they are plentiful not only in all carbon-containing biological molecules 
but also in the water molecules that surround them. The protons in water 
are highly mobile: by rapidly dissociating from one water molecule and 
then associating with its neighbor, they can quickly flit through a hydro- 
gen-bonded network of water molecules (see Figure 2—15B). Thus water, 
which is everywhere in cells, serves as a ready reservoir for the donation 


TABLE 14-1 PRODUCT YIELDS 
FROM GLUCOSE OXIDATION 


Direct Final ATP 
Product Yield per 
Glucose 
Glycolysis 2 NADH 34 
(cytosolic) 
2 ATP 2 
Pyruvate 2 NADH 5 
oxidation (mitochondrial 
to acetyl matrix) 
CoA 
(two per 
glucose) 
Complete 6 NADH 15 
oxidation (mitochondrial 
of the matrix) 
acetyl 
group 2 FADH2 3 
of acetyl 
CoA AGP 2 
(two per 
glucose) 
TOTAL 30 


*NADH produced in the cytosol 

yields fewer ATP molecules than 

NADH produced in the mitochondrial 
matrix because the mitochondrial 

inner membrane is impermeable to 
NADH. Transporting NADH into the 
mitochondrial matrix—where it can pass 
electrons to NADH dehydrogenase— 
thus requires energy. 


QUESTION 14-5 


Calculate the number of usable 
ATP molecules produced per pair 
of electrons transferred from 
NADH to oxygen if (i) five protons 
are pumped across the inner 
mitochondrial membrane for each 
electron passed through the three 
respiratory enzyme complexes, 

(ii) three protons must pass through 
the ATP synthase for each ATP 
molecule that it produces from ADP 
and inorganic phosphate inside the 
mitochondrion, and (iii) one proton 
is used to produce the voltage 
gradient needed to transport 

each ATP molecule out of the 
mitochondrion to the cytosol 

where it is used. 
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Figure 14-19 Electron transfers can 
cause the movement of entire hydrogen 
atoms, because protons are readily 
accepted from or donated to water. In 
these examples, (A) an oxidized electron 
carrier molecule, X, picks up an electron 
plus a proton when it is reduced, and 

(B) a reduced electron carrier molecule, Y, 
loses an electron plus a proton when it is 
oxidized. 
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and acceptance of protons. These nomadic protons often accompany the 
electrons that are transferred during oxidation and reduction. An isolated 
electron (e~) bears a negative charge. But when a molecule is reduced by 
acquiring an electron, in many cases, this charge is immediately neutral- 
ized by the addition of a proton from water. Thus the net effect of the 
reduction is to transfer an entire hydrogen atom, H* + e (Figure 14-19A). 


Similarly, when a molecule is oxidized, it often loses an electron belong- 
ing to one of its hydrogen atoms: in most instances, when this electron is 
transferred to an electron carrier, the proton that is left behind is passed 
on to water (Figure 14-19B). Therefore, ina membrane in which electrons 
are being passed along an electron-transport chain, it is a relatively sim- 
ple matter, in principle, to move protons from one side of the membrane 
to the other. All that is required is that the electron carrier be oriented in 
the membrane in such a way that it accepts an electron—along with a 
proton from water—on one side of the membrane, and then releases a 
proton on the other side of the membrane when it passes an electron on 
to the next electron carrier molecule in the chain (Figure 14-20). 


The Redox Potential Is a Measure of Electron Affinities 


The proteins of the respiratory chain guide the electrons so that they 
move sequentially from one enzyme complex to the next. Each of these 
electron transfers is an oxidation-reduction reaction: as described in 
Chapter 3, the molecule or atom donating the electron becomes oxidized, 
while the receiving molecule or atom becomes reduced (see pp. 87-88). 
These reactions are necessarily coupled: electrons removed from one 
molecule are always passed to another, so that whenever one molecule 
is oxidized, another is reduced. 


Like any other chemical reaction, the tendency of such oxidation-reduc- 
tion reactions, or redox reactions, to proceed spontaneously depends 
on the free-energy change (AG) for the electron transfer, which in turn 
depends on the relative electron affinities of the participating molecules. 
Electrons will pass spontaneously from molecules that have a relatively 
low affinity for some of their electrons, and thus lose them easily, to mol- 
ecules that have a higher affinity for electrons. For example, NADH has a 
low electron affinity, so that its electrons are readily passed to the NADH 
dehydrogenase complex (see Figure 14-14). The batteries that power our 
electronic gadgets are based on similar electron transfers between chem- 
ical substances with different electron affinities. 


Because electron transfers provide most of the energy in living things, 
it is worth taking time to understand them. We saw in Chapter 2 that 
molecules that donate protons are known as acids; those that accept pro- 
tons are called bases (see Panel 2-2, pp. 68-69). These molecules exist 
in conjugate acid-base pairs, in which the acid is readily converted into 
the base by the loss of a proton. For example, acetic acid (CH3COOH) is 
converted into its conjugate base (CH3COO)) in the reaction 


CH3COOH = CH3COO- + H* 


Figure 14-20 The orientation of a membrane-embedded electron 
carrier allows electron transfer to drive proton pumping. As 

an electron passes along an electron-transport chain, it can bind 

and release a proton at each step. In this schematic diagram, the 
electron carrier, protein B, picks up a proton (H*) from one side of the 
membrane when it accepts an electron (e~) from protein A; protein 

B releases the proton to the other side of the membrane when it 
donates its electron to the electron carrier, protein C. In this example, 
the transfer of a single electron thereby pumps the equivalent of one 
proton across a membrane. 
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In a similar way, pairs of compounds such as NADH and NAD¢* are called 
redox pairs, because NADH is converted to NAD* by the loss of electrons 
in the reaction 


NADH = NAD+ + H+ + 2e 


NADH is a strong electron donor. Its electrons can be said to be held at 
“high energy” because the AG for passing them to many other molecules 
is highly favorable. Conversely, because it is difficult to produce the high- 
energy electrons in NADH, its partner, NAD*, is a weak electron acceptor. 


The tendency for a redox pair such as NADH/NAD* to donate or accept 
electrons can be determined experimentally by measuring its redox 
potential (Panel 14-1, p. 472). The lower the redox potential, the lower 
the molecules’ affinity for electrons—and the more likely they are to act 
as electron donors. Redox potentials are expressed in units of volts, as 
for a standard battery. 


Electrons will move spontaneously from a redox pair with a low redox 
potential (or low affinity for electrons), such as NADH/NAD*, to a redox 
pair with a high redox potential (or high affinity for electrons), such as 
O2/H20. Thus, NADH is an excellent molecule to donate electrons to the 
respiratory chain, while Oz is well suited to act as an electron “sink” at the 
end of the pathway. As explained in Panel 14-1, the difference in redox 
potential, AE}, is a direct measure of the standard free-energy change 
(AG°) for the transfer of an electron from one molecule to another. 


Electron Transfers Release Large Amounts of Energy 


The amount of energy that can be released by an electron transfer can 
be determined by comparing the redox potentials of the molecules 
involved. Again, let’s look at the transfer of electrons from NADH and 
to O2. As shown in Panel 14-1, a 1:1 mixture of NADH and NAD+ has 
a redox potential of -320 mV, indicating that NADH has a weak affinity 
for electrons—and a strong tendency to donate them; a 1:1 mixture of 
H20 and '402 has a redox potential of +820 mV, indicating that O2 has 
a strong affinity for electrons—and a strong tendency to accept them. 
The difference in redox potential between these two pairs is 1.14 volts 
(1140 mV), which means that the transfer of each electron from NADH 
to Oz under these standard conditions is enormously favorable: the AG° 
for that electron transfer is -109.6 kJ/mole per electron—or -219.2 kJ/ 
mole for the two electrons that are donated from each NADH molecule 
(see Panel 14-1). If we compare this free-energy change with that needed 
for the formation of the terminal phosphoanhydride bond of ATP in cells 
(about 54 kJ/mole), we see that enough energy is released by the oxidiza- 
tion of one NADH molecule to synthesize several molecules of ATP. 


Living systems could have evolved enzymes that would allow NADH to 
donate electrons directly to O2 to make water. But because of the huge 
drop in free energy, this reaction would proceed with almost explosive 
force and nearly all of the energy would be released as heat. Instead, as 
we have seen, the transfer of electrons from NADH to O2 is made in many 
small steps along the electron-transport chain, enabling nearly half of the 
released energy to be stored in the proton gradient across the inner mito- 
chondrial membrane rather than getting lost to the environment as heat. 


Metals Tightly Bound to Proteins Form Versatile Electron 
Carriers 


Each of the three respiratory enzyme complexes includes metal atoms 
that are tightly bound to the proteins. Once an electron has been donated 
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HOW REDOX POTENTIALS ARE MEASURED 


One beaker (/eft) contains substance A with an equimolar 
mixture of the reduced (Areducea) and oxidized (Aoxidized) 
members of its redox pair. The other beaker contains the 
hydrogen reference standard (2H+ + 2e- = H2), whose redox 
eT potential is arbitrarily assigned as zero by international 
, O; a agreement. (A salt bridge formed from a concentrated KCI 
solution allows K+ and Cl- to move between the beakers as 
salt bridge required to neutralize the charges when electrons flow 
between the beakers.) The metal wire (dark blue) provides a 
resistance-free path for electrons, and a voltmeter then 
measures the redox potential of substance A. If electrons flow 
from Areduced to H+, as indicated here, the redox pair formed 
by substance A is said to have a negative redox potential. If 
they instead flow from H2 to Aoxidized, the redox pair is said to 
Areir end Beles 1 M H+ and have a positive redox potential. 


in equimolar amounts 1 atmosphere H2 gas 


voltmeter 


: standard redox 
examples of redox reactions 


. 1 
THE STANDARD REDOX POTENTIAL, E; potential £, 
. i NADH = NAD* + H* + 2e -320 mV 

The standard redox potential for a redox pair, 
defined as E,, is measured for a standard state reduced oxidized + z 

i chs ey epee + 2H* + 2e +30 mV 
where all of the reactants are at a concentration of ubiquinone ubiquinone 
1 M, including H*. Since biological reactions occur at reduced _, oxidized, N 
pH 7, biologists instead define the standard state as cytochrome c ~ cytochrome c +230 m 
Areduced = Aoxidized and H+ = 1077 M. This standard 
redox potential is designated by the symbol Eù, in HO = 20, + 2H" + 2e° +820 mV 
place of Eù. 


CALCULATION OF AG” FROM EFFECT OF CONCENTRATION CHANGES 


REDOX POTENTIALS 
To determine the energy change for an electron As explained in Chapter 3 (see p. 93), the actual 
transfer, the AG° of the reaction (kJ/mole) is free-energy change for a reaction, AG, depends on the 
calculated as follows: concentrations of the reactants and generally will be 
AG? = —n(0.096) AE’, where n is the number of different from the standard free-energy change, AG°. 
electrons seanstericd across a redox potential The standard redox potentials are for a 1:1 mixture of 
change of AE’, millivolts (mV), and the redox pair. For example, the standard redox 
0 ' 


potential of -320 mV is for a 1:1 mixture of NADH and 


AE; = E (acceptor) - E,(donor) NAD+. But when there is an excess of NADH over NADF, 


EXAMPLE: electron transfer from NADH to an electron acceptor 
o becomes more favorable. This is reflected by a more 
Va (A) ~ negative redox potential and a more negative AG for 


electron transfer. 


oxidized 
ubiquinone 
reduced 
biguinoné excess NADH standard 1:1 excess NAD 
Meld mixture 
1:1 mixture of 1:1 mixture of oxidized ( 7 
NADH and NAD* and reduced ubiquinone e o © O e O O O O 
For the transfer of one electron from NADH to e O e © e O Oo © O O e O 
ubiquinone: , a eO o © e O 2” O O 
AE; = +30 - (-320) = +350 mV O 
+ 
AG? = -n(0.096)AE; = -1 (0.096)(350) = -34 kJ/mole ae NAD db T 
A similar calculation reveals that the transfer of one 
electron from ubiquinone to oxygen has an even more ager es electron standard be ee electron 
t tenti t 
favorable AG? of -76 kJ/mole. The AG° value for the (dere negate E') ise ore inore positive E') 


transfer of one electron from NADH to oxygen is the 
sum of these two values, -110 kJ/mole. 
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to a respiratory complex, it can move within the complex by skipping 
from one embedded metal ion to another ion with an even greater affin- 
ity for electrons. 


When electrons pass from one respiratory complex to the next, in con- 
trast, they are ferried by electron carriers that can diffuse freely within or 
along the lipid bilayer. These mobile molecules pick up electrons from 
one complex and deliver them to the next in line. In the mitochondrial 
respiratory chain, for example, a small, hydrophobic molecule called 
ubiquinone picks up electrons from the NADH dehydrogenase complex 
and delivers them to the cytochrome c reductase complex (see Figure 
14-14). A related quinone functions similarly during electron transport in 
photosynthesis. A ubiquinone molecule can accept or donate either one 
or two electrons, and it picks up one H+ from water with each electron 
that it carries (Figure 14-21). Its redox potential of +30 mV places ubiqui- 
none between the NADH dehydrogenase complex and the cytochrome 
c reductase complex in terms of its tendency to gain or lose electrons— 
which explains why ubiquinone receives electrons from the former and 
donates them to the latter (Figure 14-22). Ubiquinone also serves as the 
entry point for electrons donated by the FADH; that is generated both 
during the citric acid cycle and from fatty acid oxidation (see Figures 
13-11 and 13-12). 


The redox potentials of different metal complexes influence where they 
are used along the electron-transport chain. Iron-sulfur centers have rel- 
atively low affinities for electrons and thus are prominent in the electron 
carriers that operate in the early part of the chain. An iron-sulfur center 
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Figure 14-21 Quinones carry electrons 
within the lipid bilayer. The quinone in the 
mitochondrial electron-transport chain is 
called ubiquinone. It picks up one H* from 
the aqueous environment for every electron 
it accepts, and it can carry two electrons as 
part of its hydrogen atoms (red). When this 
reduced ubiquinone donates its electrons to 
the next carrier in the chain, the protons are 
released. Its long, hydrophobic hydrocarbon 
tail confines ubiquinone to the inner 
mitochondrial membrane. 


QUESTION 14-6 


At many steps in the electron- 
transport chain, Fe ions are used 
as part of heme or FeS clusters to 
bind the electrons in transit. Why 
do these functional groups that 
carry out the chemistry of electron 
transfer need to be bound to 
proteins? Provide several reasons 
why this is necessary. 


Figure 14-22 Redox potential increases 
along the mitochondrial electron- 
transport chain. The biggest increases in 
redox potential occur across each of the 
three respiratory enzyme complexes, which 
allows each of them to pump protons. 
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Figure 14-23 The iron in a heme group 
can serve as an electron acceptor. 

(A) Ribbon structure showing the position 
of the heme group (red) associated with 


cytochrome c (green). (B) The porphyrin ring 


of the heme group (light red) is attached 
covalently to side chains in the protein. 
The heme groups of different cytochromes 
have different electron affinities because 
they differ slightly in structure and are held 
in different local environments within each 
protein. 


QUESTION 14-7 


Two different diffusible electron 
carriers, ubiquinone and cytochrome 
c, shuttle electrons between the 
three protein complexes of the 
electron-transport chain. Could the 
same diffusible carrier, in principle, 
be used for both steps? Explain your 
answer. 


COOH COOH 


(A) (B) 


in the NADH dehydrogenase complex, for example, passes electrons to 
ubiquinone. Later in the pathway, iron atoms that are held in the heme 
groups bound to cytochrome proteins are commonly used as electron 
carriers (Figure 14-23). These heme groups give cytochromes, such as 
the cytochrome c reductase and cytochrome c oxidase complexes, their 
color (“cytochrome” from the Greek chroma, “color”). Like other electron 
carriers, the cytochrome proteins increase in redox potential the further 
down the mitochondrial electron-transport chain they are located. For 
example, cytochrome c, a small protein that accepts electrons from the 
cytochrome c reductase complex and transfers them to the cytochrome c 
oxidase complex, has a redox potential of +230 mV—a value about mid- 
way between those of the cytochromes with which it interacts (see Figure 
14-22). 


Cytochrome c Oxidase Catalyzes the Reduction of 
Molecular Oxygen 


Cytochrome c oxidase, the final electron carrier in the respiratory chain, 
has the highest redox potential of all. This protein complex removes 
electrons from cytochrome c, thereby oxidizing it—hence the name 
“cytochrome c oxidase.” The exceptionally high electron affinity stems in 
part from a special oxygen-binding site within cytochrome c oxidase that 
contains a heme group plus a copper atom (Figure 14-24). It is here that 
nearly all the oxygen we breathe is consumed, when the electrons that 
had been donated by NADH at the start of the electron-transport chain 
are handed off to O2 to produce H20. 


In total, four electrons donated by cytochrome c and four protons 
extracted from the aqueous environment are added to each O2 mole- 
cule in the reaction 46 + 4H* + O2 — 2H20. In addition to the protons 
that combine with O2, four other protons are pumped across the mem- 
brane during the transfer of the four electrons from cytochrome c to O2. 
This pumping occurs because the transfer of electrons drives allosteric 
changes in the conformation of cytochrome c oxidase that cause protons 
to be ejected from the mitochondrial matrix (Figure 14-25). 


Oxygen is useful as an electron sink because of its very high affinity for 
electrons. However, once Oz picks up one electron, it forms the superox- 
ide radical O7; this radical is dangerously reactive and will avidly take up 
another three electrons wherever it can find them, a tendency that can 
cause serious damage to nearby DNA, proteins, and lipid membranes. 
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Figure 14-24 Cytochrome c oxidase is a finely tuned protein machine. The protein is a dimer formed from a 
monomer with 13 different protein subunits. (A) The entire protein is shown positioned in the inner mitochondrial 
membrane. The three colored subunits that form the functional core of the complex are encoded by the 
mitochondrial genome; the remaining subunits are encoded by the nuclear genome. (B) As electrons pass through 
this protein on the way to its bound O2 molecule, they cause the protein to pump protons across the membrane. 
As indicated, a heme and a copper atom (Cu) form the site where a tightly bound O2 molecule will receive four 
electrons to produce H20. Only two of the 13 subunits are shown. 


The active site of cytochrome c oxidase therefore holds on tightly to 
an oxygen molecule until it receives all four of the electrons needed to 
convert it to two molecules of H20. This retention is critical, because it 
prevents superoxide radicals from attacking macromolecules through- 
out the cell—a type of damage that has been postulated to contribute to 
human aging. 


The evolution of cytochrome c oxidase allowed cells to use O2 as an elec- 
tron acceptor, and this protein complex is essential for all aerobic life. 
Poisons such as cyanide are extremely toxic because they bind tightly to 
cytochrome c oxidase complexes, thereby halting electron transport and 
the production of ATP. 
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Figure 14-25 Proton pumping is coupled to electron transport. This type of mechanism is thought to be used by the NADH 
dehydrogenase complex and by cytochrome c oxidase, as well as by many other proton pumps. The protein is driven through a cycle 
of three conformations. In one of these conformations, the protein has a high affinity for H+, causing it to pick up an H* on the matrix 
side of the membrane. In another conformation, the protein has a low affinity for H*, causing it to release an H* on the other side of the 
membrane. As indicated, the cycle goes only in one direction—releasing the proton into the intermembrane space—because one of 
the steps is driven by allosteric change in conformation driven by the energetically favorable transport of electrons. 


” HOW WE KNOW 


HOW CHEMIOSMOTIC COUPLING DRIVES ATP SYNTHESIS 


In 1861, Louis Pasteur discovered that yeast cells grow 
and divide more vigorously when air is present—the first 
demonstration that aerobic metabolism is more efficient 
than anaerobic metabolism. His observations make 
sense now that we know that oxidative phosphorylation 
is a much more efficient means of generating ATP than 
is glycolysis, producing about 30 molecules of ATP for 
each molecule of glucose oxidized, compared with the 2 
ATPs generated by glycolysis alone. But it took another 
hundred years for researchers to determine that it is 
the process of chemiosmotic coupling—using proton 
pumping to power ATP synthesis—that allows cells to 
generate energy with such efficiency. 


Imaginary intermediates 


In the 1950s, many researchers believed that the oxi- 
dative phosphorylation that takes place in mitochondria 
generated ATP via a mechanism similar to that used in 
glycolysis. During glycolysis, ATP is produced when a 
molecule of ADP receives a phosphate group directly 
from a “high-energy” intermediate. Such substrate-level 
phosphorylation occurs in steps 7 and 10 of glyco- 
lysis, where the high-energy phosphate groups from 
1,3-bisphosphoglycerate and _ phosphoenolpyruvate, 
respectively, are transferred to ADP to form ATP (see 
Panel 13-1, pp. 436-437). It was assumed that the elec- 
tron-transport chain in mitochondria would similarly 
generate some phosphorylated intermediate that could 
then donate its phosphate group directly to ADP. This 
assumption inspired a long and frustrating search for 
this mysterious high-energy intermediate. Investigators 
occasionally claimed to have discovered the missing 
intermediate, but the compounds turned out to be either 
unrelated to electron transport or, as one researcher put 
it in a review of the history of bioenergetics, “products of 
high-energy imagination.” 
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Harnessing the force 


It wasn’t until 1961 that Peter Mitchell suggested that the 
“high-energy intermediate” his colleagues were seeking 
was, in fact, the electrochemical proton gradient gen- 
erated by the electron-transport system. His proposal, 
dubbed the chemiosmotic hypothesis, stated that the 
energy of an electrochemical proton gradient formed 
during the transfer of electrons through the electron- 
transport chain could be tapped to drive ATP synthesis. 


Several lines of evidence offered support for Mitchell’s 
proposed mechanism. First, it was known that mito- 
chondria do generate an electrochemical proton gradient 
across their inner membrane. But what does this gradi- 
ent—also called the proton-motive force—actually do? 
If the gradient is required to drive ATP synthesis, as the 
chemiosmotic hypothesis posits, then either disrupting 
the inner membrane or eliminating the proton gradient 
across it should inhibit ATP production. In fact, research- 
ers found both these predictions to be true. Physical 
disruption of the inner mitochondrial membrane halts 
ATP synthesis in that organelle. Similarly, dissipation of 
the proton gradient by a chemical “uncoupling” agent 
such as 2,4-dinitrophenol (DNP) also inhibits mitochon- 
drial ATP production. Such gradient-busting chemicals 
carry H* across the inner mitochondrial membrane, 
forming a shuttle system for the movement of H* that 
bypasses the ATP synthase that generates ATP (Figure 
14-26). In this way, compounds such as DNP uncouple 
electron transport from ATP synthesis. As a result of this 
short-circuiting, the proton-motive force is dissipated 
completely, and the organelle can no longer make ATP. 


Such uncoupling occurs naturally in some specialized 
fat cells. In these cells, called brown fat cells, most of 
the energy from the oxidation of fat is dissipated as 
heat rather than being converted into ATP. The inner 


Figure 14-26 Uncoupling agents are H* carriers that can insert into the inner mitochondrial membrane. They render the 
membrane permeable to protons, allowing H* to flow into the mitochondrial matrix without passing through ATP synthase. This 
short-circuit effectively uncouples electron transport from ATP synthesis. 


Molecular Mechanisms of Electron Transport and Proton Pumping 


membranes of the mitochondria in these cells contain a 
carrier protein that allows protons to move down their 
electrochemical gradient, circumventing ATP synthase. 
As a result, the cells oxidize their fat stores at a rapid 
rate and produce much more heat than ATP. Tissues 
containing brown fat serve as biological heating pads, 
helping to revive hibernating animals and to protect 
sensitive areas of newborn human babies (such as the 
backs of their necks) from the cold. 


Artificial ATP generation 


If disrupting the electrochemical proton gradient across 
the mitochondrial inner membrane terminates ATP syn- 
thesis, then, conversely, generating an artificial proton 
gradient should stimulate ATP synthesis. Again, this 
is exactly what happens. When a proton gradient is 
imposed artificially by lowering the pH on the outside of 
the mitochondrial inner membrane, out pours ATP. 


How does the electrochemical proton gradient drive 
ATP production? This is where the ATP synthase comes 
in. In 1974, Efraim Racker and Walther Stoeckenius 
demonstrated that they could assemble an artificial 
ATP-generating system by combining an ATP synthase 
isolated from the mitochondria of cow heart muscle with 
a proton pump purified from the purple membrane of 
the archaean Halobacterium halobium. As discussed in 
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Chapter 11, the plasma membrane of this prokaryote is 
packed with bacteriorhodopsin, a protein that pumps H+ 
out of the cell in response to sunlight (see Figure 11-28). 


When bacteriorhodopsin alone was reconstituted 
into artificial lipid vesicles (liposomes), Racker and 
Stoeckenius showed that, in the presence of light, the 
protein pumps Ht into the vesicles, generating a proton 
gradient. (The orientation of the protein is reversed in 
these membranes, so that protons are transported into 
the vesicles; in the organism, protons are pumped out.) 
When the bovine ATP synthase was then incorporated 
into these vesicles, much to the amazement of many bio- 
chemists, the system catalyzed the synthesis of ATP from 
ADP and inorganic phosphate in response to light. This 
ATP formation showed an absolute dependence on an 
intact proton gradient, as either eliminating bacteriorho- 
dopsin from the system or adding uncoupling agents 
such as DNP abolished ATP synthesis (Figure 14-27). 


This remarkable experiment demonstrated without a 
doubt that a proton gradient can cause ATP synthase 
to make ATP. Thus, although biochemists had initially 
hoped to discover a high-energy intermediate involved 
in oxidative phosphorylation, the experimental evidence 
eventually convinced them that their search was in vain 
and that the chemiosmotic hypothesis was correct. 
Mitchell was awarded a Nobel Prize in 1978. 


ATP synthase 


Figure 14-27 Experiments in which 
bacteriorhodopsin and bovine 
mitochondrial ATP synthase were 
introduced into liposomes provided 
direct evidence that proton gradients 

{HF | can power ATP production. (A) When 
bacteriorhodopsin is added to artificial lipid 
vesicles (liposomes), the protein generates 
a proton gradient in response to light. 

(B) In artificial vesicles containing both 
bacteriorhodopsin and an ATP synthase, 

a light-generated proton gradient drives 
the formation of ATP from ADP and P;, 

(C) Artificial vesicles containing only ATP 
synthase do not on their own produce 

ATP in response to light. (D) In vesicles 
containing both bacteriorhodopsin and ATP 
synthase, uncoupling agents that abolish 
the proton gradient eliminate light-induced 
ATP synthesis. 
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Figure 14-28 Microorganisms that carry 
out oxygen-producing photosynthesis 
changed Earth’s atmosphere. (A) Living 
stromatolites from a lagoon in Western 
Australia. These structures are formed in 
specialized environments by large colonies 
of oxygen-producing photosynthetic 
cyanobacteria, which form mats that 

trap sand or minerals in thin layers. 

(B) Cross section of a modern stromatolite, 
showing its stratification. (C) A similar, 
layered structure can be seen in a fossilized 
stromatolite. These ancient accretions, some 
more than 3.5 billion years old, contain the 
remnants of the photosynthetic bacteria 
whose O2-liberating activities ultimately 
transformed the Earth's atmosphere. 

(A, courtesy of Cambridge Carbonates 
Ltd.; B, courtesy of Roger Perkins, Virtual 
Fossil Museum, https://creativecommons 
.org/licenses/by-nc/4.0/; C, courtesy of S.M. 
Awramik, University of California/Biological 
Photo Service.) 
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CHLOROPLASTS AND PHOTOSYNTHESIS 


Virtually all the organic material in present-day cells is produced by 
photosynthesis—the series of light-driven reactions that creates organic 
molecules from atmospheric carbon dioxide (CO2). Plants, algae, and 
photosynthetic bacteria such as cyanobacteria use electrons from water 
and the energy of sunlight to perform this chemical feat. In the process, 
water molecules are split, releasing vast quantities of O2 gas into the 
atmosphere. This oxygen in turn supports oxidative phosphorylation— 
not only in animals but also in plants and aerobic bacteria. As we discuss 
in detail at the end of the chapter, it was the activity of photosynthetic 
bacteria that eventually filled the atmosphere with oxygen, enabling the 
subsequent evolution of the myriad life-forms that today use aerobic 
metabolism to make their ATP (Figure 14-28). 


For most plants, photosynthesis occurs mainly in the leaves. There, spe- 
cialized intracellular organelles called chloroplasts capture light energy 
and use it to produce ATP and NADPH. These activated carriers are used 
to convert CO2 into organic molecules that serve as the precursors for 
sugars—a process called carbon fixation. 


Given the chloroplast’s central role in photosynthesis, we begin this 
section by describing the structure of this highly specialized organelle. 
We then provide an overview of photosynthesis, followed by a detailed 
accounting of the mechanism by which chloroplasts harvest energy from 
sunlight to produce huge amounts of ATP and NADPH. Finally, we explain 
how plants use these two activated carriers to synthesize the sugars and 
other food molecules that sustain them, as well as the huge number of 
organisms that subsequently consume plants as part of their diet. 


Chloroplasts Resemble Mitochondria but Have an Extra 
Compartment—the Thylakoid 


Chloroplasts are larger than mitochondria, but both are organized along 
structurally similar principles. Chloroplasts have a highly permeable 
outer membrane and a much less permeable inner membrane, in which 
various membrane transport proteins are embedded. Together, these two 
membranes form the chloroplast envelope, separated by a narrow, inter- 
membrane space. The inner membrane surrounds a large space called 
the stroma, which contains many metabolic enzymes and is analogous 
to the mitochondrial matrix (see Figure 14-5). 


There is, however, an important difference between the organization 
of mitochondria and that of chloroplasts. The inner membrane of the 
chloroplast does not contain the molecular machinery needed to pro- 
duce energy. Instead, the light-capturing systems, electron-transport 
chain, and ATP synthase that convert light energy into ATP during 
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photosynthesis are all contained in the thylakoid membrane. This third 
membrane is folded to form a set of flattened, disclike sacs, called the 
thylakoids, which are arranged in stacks called grana (Figure 14-29). 
The interior of each thylakoid is thought to be connected with that of 
other thylakoids, creating the thylakoid space—a compartment that is 
separate from the chloroplast stroma. 


Photosynthesis Generates—and Then Consumes—ATP 
and NADPH 


The chemistry carried out by photosynthesis can be summarized in one 
simple equation: 


light energy + CO2 + H2O — sugars + O2 + heat energy 


On its surface, the equation accurately represents the process by 
which light energy drives the production of sugars from CO2. But this 
superficial accounting leaves out two of the most important players in 
photosynthesis: the activated carriers ATP and NADPH. In the first stage 
of photosynthesis, the energy from sunlight is used to produce ATP and 
NADPH; in the second stage, these activated carriers are consumed to 
fuel the synthesis of sugars. 


1. Stage 1 of photosynthesis resembles the oxidative phosphorylation 
that takes place on the mitochondrial inner membrane. In this stage, 
an electron-transport chain in the thylakoid membrane harnesses 
the energy of electron transport to pump protons into the thylakoid 
space; the resulting proton gradient then drives the synthesis of ATP 
by ATP synthase. What makes photosynthesis very different is that 
the high-energy electrons donated to the photosynthetic electron- 
transport chain come from a molecule of chlorophyll that has 
absorbed energy from sunlight. Thus the energy-producing reactions 
of stage 1 are sometimes called the light reactions (Figure 14-30). 
Another major difference between photosynthesis and oxidative 
phosphorylation is where the high-energy electrons ultimately wind 
up: those that make their way down the photosynthetic electron- 
transport chain in chloroplasts are donated not to O2 but to NADP*, 
to produce NADPH. 


Chloroplasts and Photosynthesis 


Figure 14-29 Chloroplasts, like 
mitochondria, are composed of a 

set of specialized membranes and 
compartments. (A) Light micrograph 
showing chloroplasts (green) in the cell of 
a flowering plant. (B) Drawing of a single 
chloroplast showing the organelle’s three 
sets of membranes, including the thylakoid 
membrane (dark green), which contains 
the light-capturing and ATP-generating 
systems. (C) A high-magnification view 

of an electron micrograph shows the 
thylakoids arranged in stacks called grana; 
a single thylakoid stack is called a granum 
(Movie 14.9). (A, courtesy of Preeti Dahiya; 
C, courtesy of K. Plaskitt.) 


QUESTION 14-8 


Chloroplasts have a third internal 
compartment, the thylakoid 

space, bounded by the thylakoid 
membrane. This membrane contains 
the photosystems, reaction centers, 
electron-transport chain, and ATP 
synthase. In contrast, mitochondria 
use their inner membrane for 
electron transport and ATP 
synthesis. In both organelles, 
protons are pumped out of the 
largest internal compartment 

(the matrix in mitochondria and 

the stroma in chloroplasts). The 
thylakoid space is completely sealed 
off from the rest of the cell. Why 
does this arrangement allow a larger 
H* gradient in chloroplasts than can 
be achieved for mitochondria? 


479 


Figure 14-30 Both stages of 
photosynthesis depend on the 
chloroplast. In stage 1, a series 
of photosynthetic electron- 
transfer reactions produce ATP 
and NADPH; in the process, 
electrons are extracted from 
water and oxygen is released 

as a by-product, as we discuss 
shortly. In stage 2, carbon dioxide 
is assimilated (fixed) to produce 
sugars and a variety of other 
organic molecules. Stage 1 occurs 
in the thylakoid membrane, 
whereas stage 2 begins in the 
chloroplast stroma (as shown) and 
continues in the cytosol. 
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Figure 14-31 Chlorophylls absorb light 
of blue and red wavelengths. As shown 
in this absorption spectrum, one form of 
chlorophyll preferentially absorbs light 
around wavelengths of 430 nm (blue) and 
660 nm (red). Green light, in contrast, is 
absorbed poorly by this pigment. Other 
chlorophylls can absorb light of slightly 
different wavelengths. 
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2. In stage 2 of photosynthesis, the ATP and the NADPH produced by 
the photosynthetic electron-transfer reactions of stage 1 are used to 
drive the manufacture of sugars from CO; (see Figure 14-30). These 
carbon-fixation reactions, which do not directly require sunlight, begin 
in the chloroplast stroma. There they generate a three-carbon sugar 
called glyceraldehyde 3-phosphate. This simple sugar is exported to 
the cytosol, where it is used to produce a large number of organic 
molecules in the leaves of the plant, including the disaccharide 
sucrose, which is exported from the leaves to nourish the rest of the 
plant. 

Although the formation of ATP and NADPH during stage 1, and the conver- 
sion of CO; to carbohydrate during stage 2, are mediated by two separate 
sets of reactions, they are linked by elaborate feedback mechanisms that 
allow a plant to manufacture sugars only when it is appropriate to do 
so. Several of the enzymes required for carbon fixation, for example, 
are inactivated in the dark and reactivated by light-stimulated electron 
transport. 


Chlorophyll Molecules Absorb the Energy of Sunlight 


Visible light is a form of electromagnetic radiation composed of many 
wavelengths, ranging from violet (wavelength 400 nm) to deep red 
(700 nm). Most chlorophylls absorb light best in the blue and red wave- 
lengths, and they absorb green light poorly (Figure 14-31). Plants look 
green to us because the green light that is not absorbed is reflected back 
to our eyes. 


Chlorophyll’s ability to harness energy derived from sunlight stems from 
its unique structure. The electrons in a chlorophyll molecule are dis- 
tributed in a decentralized cloud around the molecule’s light-absorbing 
porphyrin ring (Figure 14-32). When light of an appropriate wavelength 
hits a molecule of chlorophyll, it excites electrons within this diffuse 
network. This high-energy state is unstable, and an excited chlorophyll 
molecule will rapidly release this excess energy and return to its more 
stable, unexcited state. 


A molecule of chlorophyll, on its own in solution, would simply release 
its absorbed energy in the form of light or heat—accomplishing nothing 
useful. However, the chlorophyll molecules in a chloroplast are able to 
convert light energy into a form of energy useful to the cell because they 
are associated with a special set of photosynthetic proteins in the thyla- 
koid membrane, as we see next. 


Figure 14-32 Chlorophyll’s structure allows it to absorb energy 
from light. Each chlorophyll molecule contains a porphyrin ring with a 
magnesium atom (pink) at its center. This porphyrin ring is structurally 
similar to the one that binds iron in heme (see Figure 14-25). Light is 
absorbed by electrons within the bond network shown in blue, while 
the long, hydrophobic tail (gray) helps hold the chlorophyll in the 
thylakoid membrane. 


Excited Chlorophyll Molecules Funnel Energy into a 
Reaction Center 


In the thylakoid membrane of plants—and the plasma membrane of 
photosynthetic bacteria—chlorophyll molecules are held in large multi- 
protein complexes called photosystems. Each photosystem consists of 
a set of antenna complexes, which capture light energy, and a reaction 
center, which converts that light energy into chemical energy. 


In an antenna complex, hundreds of chlorophyll molecules are arranged 
so that the light energy captured by one chlorophyll molecule can be 
transferred to a neighboring chlorophyll molecule in the network. In this 
way, energy jumps randomly from one chlorophyll molecule to the next— 
either within the same antenna or in an adjacent antenna. At some point, 
this wandering energy will encounter a chlorophyll dimer called the spe- 
cial pair, which holds its electrons at a slightly lower energy than do the 
other chlorophyll molecules. When energy is accepted by this special 
pair, it becomes effectively trapped there. 


The chlorophyll special pair is not located in an antenna complex. Instead, 
it is part of the reaction center—a transmembrane complex of proteins 
and pigments that is thought to have first evolved more than 3 billion 
years ago in primitive photosynthetic bacteria (Movie 14.10). Within the 
reaction center, the special pair is positioned directly next to a set of 
electron carriers that are poised to accept a high-energy electron from 
the excited chlorophyll special pair (Figure 14-33). This electron transfer 
converts the light energy that entered the special pair into the chemical 
energy of a transferable electron—a transformation that lies at the heart 
of photosynthesis. 


As soon as a high-energy electron is passed from chlorophyll to an elec- 
tron carrier, the chlorophyll special pair becomes positively charged, and 
the electron carrier that accepts the electron becomes negatively charged. 
The rapid movement of this electron along a set of intermediary electron 
carriers within the reaction center then creates a charge separation that 
sets in motion the flow of high-energy electrons from the reaction center 
to the electron-transport chain (Figure 14-34). 
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Figure 14-33 A photosystem consists 

of a reaction center surrounded 

by chlorophyll-containing antenna 
complexes. Once light energy has been 
captured by a chlorophyll molecule in an 
antenna complex, it will pass randomly from 
one chlorophyll molecule to another (red 
lines), until it gets trapped by a chlorophyll 
dimer called the special pair, located in 

the reaction center. The chlorophyll special 
pair holds its electrons at a somewhat lower 
energy than the antenna chlorophylls, so the 
energy transferred to it from the antenna 
gets trapped there. Note that in the 
antenna complex, it is energy that moves 
from one chlorophyll molecule to another, 
not electrons. 
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Figure 14-34 In a reaction center, a high- 
energy electron is transferred from the 
chlorophyll special pair to a carrier that 
becomes part of an electron-transport 
chain. Not shown is the set of intermediary 
carriers, embedded within the reaction 
center, that provides a rapid path (blue 
arrows) from the special pair to a mobile 
electron carrier (orange). As illustrated, the 
transfer of the high-energy electron from 
the excited chlorophyll special pair leaves 
behind a positive charge that creates a 
charge-separated state, thereby converting 
light energy to chemical energy. Once 

the electron in the special pair has been 
replaced (an event we will discuss in detail 
shortly), the mobile carrier diffuses away 
from the reaction center, transferring the 
high-energy electron to the transport chain. 


Figure 14-35 Photosystem Il feeds 
electrons to a photosynthetic proton 
pump, leading to the generation of 

ATP by ATP synthase. When light energy 
is captured by photosystem Il, a high- 
energy electron is transferred to a mobile 
electron carrier called plastoquinone (Q), 
which closely resembles the ubiquinone 

of mitochondria. This carrier transfers its 
electrons to a proton pump called the 
cytochrome b¢-f complex, which resembles 
the cytochrome c reductase complex of 
mitochondria and is the sole site of active 
proton pumping in the chloroplast electron- 
transport chain. As in mitochondria, an ATP 
synthase embedded in the membrane then 
uses the energy of the electrochemical 
proton gradient to produce ATP. 
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A Pair of Photosystems Cooperate to Generate both 
ATP and NADPH 


Photosynthesis is ultimately a biosynthetic process. Building organic mol- 
ecules from CO: requires a huge input of energy, in the form of ATP, anda 
very large amount of reducing power, in the form of the activated carrier 
NADPH (see Figure 3-34). To generate both ATP and NADPH, plant cells— 
and free-living photosynthetic organisms such as cyanobacteria—make 
use of two different photosystems, which operate in series. Although they 
are similar in structure, these two photosystems do different things with 
the high-energy electrons that leave their reaction-center chlorophylls. 


When the first photosystem (which, paradoxically, is called photosystem 
II for historical reasons) absorbs light energy, its reaction center passes 
electrons to a mobile electron carrier called plastoquinone, which is part 
of the photosynthetic electron-transport chain. This carrier transfers the 
high-energy electrons to a proton pump, which—like the proton pumps in 
the mitochondrial inner membrane—uses the movement of electrons to 
generate an electrochemical proton gradient. The electrochemical proton 
gradient then drives the production of ATP by an ATP synthase located in 
the thylakoid membrane (Figure 14-35). 


At the same time, a second, nearby photosystem—called photosystem 
I~has been also busy capturing the energy from sunlight. The reaction 
center of this photosystem passes its high-energy electrons to a different 
mobile electron carrier, called ferredoxin, which brings them to an enzyme 
that uses the electrons to reduce NADP* to NADPH (Figure 14-36). It is 
the combined action of these two photosystems that produces both the 
ATP (photosystem II) and the NADPH (photosystem I) required for carbon 
fixation in stage 2 of photosynthesis (see Figure 14-30). 
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Figure 14-36 Photosystem I transfers 

high-energy electrons to an enzyme that 

produces NADPH. When light energy 
oes is captured by photosystem |, a high- 


energy electron is passed to a mobile 
thylakoid | electron carrier called ferredoxin (Fd), a 
membrane a small protein that contains an iron-sulfur 
STROMA >) FNR | center. Ferredoxin carries its electrons to 


photosystem | Qa 


`~ ferredoxin- 
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ferredoxin-NADP* reductase (FNR), the final 
protein in the electron-transport chain that 
catalyzes the production of NADPH. 


Oxygen Is Generated by a Water-Splitting Complex 
Associated with Photosystem II 


The scheme that we have thus far described for photosynthesis has 
ignored a major chemical conundrum. When a mobile electron carrier 
removes an electron from a reaction center (whether in photosystem I or 
photosystem II), it leaves behind a positively charged chlorophyll special 
pair (see Figure 14-34). To reset the system and allow photosynthesis to 


QUESTION 14-9 


Both NADPH and the related carrier 
molecule NADH are strong electron 


proceed, this missing electron must be replaced. 


For photosystem II, the missing electron is replaced by a special manga- 
nese-containing protein complex that removes the electrons from water. 
The cluster of manganese atoms in this water-splitting enzyme holds 
onto two water molecules from which electrons are extracted one at a 
time. Once four electrons have been removed from these two water mol- 
ecules—and used to replace the electrons lost by four excited chlorophyll 
special pairs—Oz is released (Figure 14-37). It is by this means that all of 
the O2 in our atmosphere—all of the O2 we breathe—is produced. Life on 
Earth would be a very different affair without the water-splitting enzyme 
of photosystem II. 
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Figure 14-37 The reaction center of photosystem II includes a water-splitting enzyme that catalyzes the extraction of electrons 
from water. (A) Schematic diagram showing the flow of electrons through the reaction center of photosystem II. When light energy 
excites the chlorophyll special pair, an electron is passed to the mobile electron carrier plastoquinone (Q). An electron is then returned 
to the special pair by a water-splitting enzyme that extracts electrons from water. The manganese (Mn) cluster that participates in the 
electron extraction is shown as a red spot. Once four electrons have been withdrawn from two water molecules, O2 is released into 
the atmosphere. (B) The structure and position of some of the electron carriers involved. (C) Structure of a membrane-embedded 
photosystem II (PSII) complex, including a reaction center and several light-harvesting antenna complexes. This structure, obtained 
from spinach, was determined by cryoelectron microscopy (see Panel 4-6, pp. 168-169). Note that this complex exists as a dimer in the 


membrane, and thus contains two copies of the water-splitting enzyme. 
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Figure 14-38 The serial movement of 
electrons through two photosystems 
powers the production of both ATP 

and NADPH. Electrons are supplied to 
photosystem II by a water-splitting enzyme 
that extracts four electrons from two 
molecules of water, producing O2 as a 
by-product. Their energy is raised by the 
absorption of light to power the pumping 


of protons by the cytochrome bs-f complex. 


Electrons that pass through this complex 
are then donated to a copper-containing 
protein, the mobile electron carrier 
plastocyanin (pC), which ferries them to 
the reaction center of photosystem I. After 
a second energy boost from light, these 
electrons are used to generate NADPH. 
An overview of these reactions is shown 

in Movie 14.11. 
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The “waiting for four electrons” maneuver executed by the water-splitting 
enzyme ensures that no partly oxidized water molecules are released as 
dangerous, highly reactive chemicals. As we discussed earlier, that same 
trick is used by the cytochrome c oxidase that catalyzes the reverse reac- 
tion—the transfer of electrons to O2 to produce water—during oxidative 
phosphorylation (see Figure 14-24). 


The Special Pair in Photosystem | Receives its Electrons 
from Photosystem II 


We have seen that photosystem II replaces electrons lost by its chloro- 
phyll special pair with electrons extracted from water. But where does 
photosystem I get the electrons it needs to reset its special pair? These 
electrons come from photosystem II: the two photosystems work in series, 
such that the chlorophyll special pair in photosystem I serves as the final 
electron acceptor for the electron-transport chain that carries electrons 
from photosystem II. The overall flow of electrons through this linked sys- 
tem is shown in Figure 14-38. In sum, electrons removed from water by 
photosystem II are passed, through a proton pump (the cytochrome b¢-f 
complex), to a mobile electron carrier called plastocyanin. Plastocyanin 
then carries these electrons to photosystem I, to replace the electrons 
lost by its excited chlorophyll special pair. When light is again absorbed 
by this photosystem, the electrons will be boosted to the very high energy 
level needed to reduce NADP* to NADPH. 


Having these two photosystems operating in series effectively couples 
their two electron-energizing steps. This extra boost of energy—pro- 
vided by the light harvested by both photosystems—allows an electron 
to be transferred from water, which normally holds onto its electrons 
very tightly (redox potential = +820 mV), to NADPH, which normally 
holds onto its electrons loosely (redox potential = -320 mV). In addition 
to powering this chemistry, there is enough energy left over to enable 
the electron-transport chain that links the two photosystems to pump 
H* across the thylakoid membrane, so that the ATP synthase embedded 
in this membrane can also harness light-derived energy to produce ATP 
(Figure 14-39). 


Carbon Fixation Uses ATP and NADPH to Convert CO2 
into Sugars 
The light reactions of photosynthesis generate ATP and NADPH in the 


chloroplast stroma, as we have just seen. But the inner membrane of the 
chloroplast is impermeable to both of these compounds, which means 


Chloroplasts and Photosynthesis 485 


Figure 14-39 The combined 
actions of photosystems | and Il 
boost electrons to the energy level 
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that they cannot be exported directly to the cytosol. To provide energy and 
reducing power for the rest of the cell, the ATP and NADPH are instead 
used within the chloroplast stroma to produce a simple three-carbon 
sugar that can be exported to the cytosol by specific carrier proteins in 
the chloroplast inner membrane. This production of sugar from CO2 and 
water, which occurs during stage 2 of photosynthesis, is called carbon 
fixation. 


In the central reaction of photosynthetic carbon fixation, CO2 from 
the atmosphere is attached to a five-carbon sugar derivative, ribulose 
1,5-bisphosphate, to yield two molecules of the three-carbon compound 
3-phosphoglycerate. This carbon-fixing reaction, which was discovered in 
1948, is catalyzed in the chloroplast stroma by a large enzyme called ribu- 
lose bisphosphate carboxylase or Rubisco (Figure 14-40). Rubisco works 
much more slowly than most other enzymes: it processes about three 
molecules of substrate per second—compared with 1000 molecules per 
second for a typical enzyme. To compensate for this sluggish behavior, 
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QUESTION 14-10 
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A. How do cells in plant roots 
survive, since they contain no 
chloroplasts and are not exposed to 
light? 

B. Unlike mitochondria, 
chloroplasts do not have a 
transporter that allows them to 
export ATP to the cytosol. How, 
then, do plant cells obtain the ATP 
that they need to carry out energy- 
requiring metabolic reactions in the 
cytosol? 


plants maintain a surplus of Rubisco to ensure the efficient production 
of sugars. The enzyme generally represents more than 50% of the total 
chloroplast protein, and it is widely claimed to be the most abundant 
protein on Earth. 


Although the production of carbohydrates from CO3 and H20 is extremely 
energetically unfavorable, the fixation of CO2 catalyzed by Rubisco is 
actually an energetically favorable reaction. That’s because a continuous 
supply of energy-rich ribulose 1,5-bisphosphate is fed into the reac- 
tion. As this compound is consumed—by the addition of CO2 (see Figure 
14-40)—it must be replenished. The energy and reducing power needed 
to regenerate ribulose 1,5-bisphosphate come from the ATP and NADPH 
produced by the photosynthetic light reactions. 


The elaborate series of reactions in which CO2 combines with ribulose 
1,5-bisphosphate to produce a simple three-carbon sugar—a portion of 
which is used to regenerate the ribulose 1,5-bisphosphate that’s con- 
sumed—forms a cycle, called the carbon-fixation cycle, or the Calvin cycle 
(Figure 14-41). For every three molecules of CO? that enter the cycle, one 
molecule of glyceraldehyde 3-phosphate is ultimately produced, at the 
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Figure 14-41 The carbon-fixation cycle consumes ATP and NADPH to form 
glyceraldehyde 3-phosphate from CO2 and H20. In the first stage of the cycle 
(highlighted in yellow), CO2 is added to ribulose 1,5-bisphosphate (as shown 

in Figure 14-40). In the second stage (highlighted in red), ATP and NADPH are 
consumed to convert 3-phosphoglycerate to glyceraldehyde 3-phosphate. In the 
final stage (highlighted in blue), most of the glyceraldehyde 3-phosphate produced 
is used to regenerate ribulose 1,5-bisphosphate; the rest is transported out of 
the chloroplast stroma into the cytosol. The number of carbon atoms in each 
type of molecule is indicated in yellow. There are many intermediates between 
glyceraldehyde 3-phosphate and ribulose 1,5-bisphosphate, but they have been 
omitted here for clarity. The entry of water into the cycle is also not shown. 
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expense of nine molecules of ATP and six molecules of NADPH, which are 
consumed in the process. Glyceraldehyde 3-phosphate, the three-carbon 
sugar that is the final product of the cycle, provides the starting material 
for the synthesis of the many other sugars and other organic molecules 
that the plant needs. 


Sugars Generated by Carbon Fixation Can Be Stored 
as Starch or Consumed to Produce ATP 


The glyceraldehyde 3-phosphate generated by carbon fixation in the chlo- 
roplast stroma can be used in a number of ways, depending on the needs 
of the plant. During periods of excess photosynthetic activity, much of the 
sugar is retained in the chloroplast stroma and converted to starch. Like 
glycogen in animal cells, starch is a large polymer of glucose that serves 
as a carbohydrate reserve, and it is stored as large granules in the chloro- 
plast stroma. Starch forms an important part of the diet of all animals that 
eat plants. Other glyceraldehyde 3-phosphate molecules are converted to 
fat in the stroma. This material, which accumulates as fat droplets, like- 
wise Serves as an energy reserve (Figure 14-42). 


At night, this stored starch and fat can be broken down to sugars and 
fatty acids, which are exported to the cytosol to help support the meta- 
bolic needs of the plant. Some of the exported sugar enters the glycolytic 
pathway (see Figure 13-5), where it is converted to pyruvate. Most of that 
pyruvate, along with the fatty acids, enters the plant cell mitochondria 
and is fed into the citric acid cycle, ultimately leading to the production 
of ATP by oxidative phosphorylation (Figure 14-43). Plants use this ATP 
to power a huge variety of metabolic reactions, just as animal cells and 
other nonphotosynthetic organisms do. 
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Figure 14-42 Chloroplasts often 

contain large stores of carbohydrates 
and fatty acids. An electron micrograph 

of a thin section of a single chloroplast 
shows the chloroplast envelope and the 
starch granules and fat droplets that have 
accumulated in the stroma as a result of 
the biosynthetic processes that occur there. 
(Courtesy of K. Plaskitt.) 


Figure 14-43 In plants, the chloroplasts 
and mitochondria collaborate to 

supply cells with metabolites and ATP. 
The chloroplast’s inner membrane is 
impermeable to the ATP and NADPH that 
are produced in the stroma during the 
light reactions of photosynthesis. These 
molecules are funneled into the carbon- 
fixation cycle, where they are used to make 
sugars. The resulting sugars and their 
metabolites are either stored within the 
chloroplast—in the form of starch or fat— 
or exported to the rest of the plant cell. 
There, they can enter the energy-generating 
pathway that ends in ATP synthesis in the 
mitochondria. Unlike those chloroplasts, 
mitochondrial membranes are permeable 
to ATP, as indicated. Note that some of 
the O2 released to the atmosphere by 
photosynthesis in chloroplasts is used for 
oxidative phosphorylation in mitochondria; 
similarly, some of the COz released by the 
citric acid cycle in mitochondria is used for 
carbon fixation in chloroplasts. 
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The glyceraldehyde 3-phosphate exported from chloroplasts into the 
cytosol can also be converted into many other metabolites, including the 
disaccharide sucrose. Sucrose is the major form in which sugar is trans- 
ported between the cells of a plant: just as glucose is transported in the 
blood of animals, so sucrose is exported from the leaves via the vascular 
system to provide carbohydrate to the rest of the plant. 


THE EVOLUTION OF ENERGY-GENERATING 
SYSTEMS 


The ability to sequence the genomes of microorganisms that are difficult, 
if not impossible, to grow in culture has made it possible to identify a 
huge variety of previously mysterious life-forms. Some of these unicel- 
lular organisms thrive in the most inhospitable habitats on the planet, 
including sulfurous hot springs and hydrothermal vents that lie deep on 
the ocean floor. In these remarkable microbes, we are finding clues to 
life's history. Like fingerprints left at the scene of a crime, the proteins and 
small molecules these organisms produce provide evidence that allows 
us to trace the history of ancient biological events, including those that 
gave rise to the ATP-generating systems present in the mitochondria and 
chloroplasts of modern eukaryotic cells. We therefore end this chapter 
with a brief review of what has been learned about the origins of present- 
day energy-harvesting systems, which have played such a critical part in 
fueling the evolution of life on Earth. 


Oxidative Phosphorylation Evolved in Stages 


As we mentioned earlier, the first living cells on Earth may have con- 
sumed geochemically produced organic molecules and generated ATP 
by fermentation. Because oxygen was not yet present in the atmosphere, 
such anaerobic fermentation reactions would have dumped organic 
acids—such as lactic or formic acids, for example—into the environment 
(see Figure 13-6A). 


A buildup of such acids would have lowered the pH of the environment, 
favoring the survival of cells that evolved transmembrane proteins that 
could pump Ht out of the cytosol, preventing the cell interior from becom- 
ing too acidic. Some of these pumps may have used the energy available 
from ATP hydrolysis to eject H* from the cell (stage 1 in Figure 14-44). 
Such a proton pump could have been the ancestor of present-day ATP 
synthases. Other pumps, like those in modern respiratory chain com- 
plexes, eventually evolved to use the movement of electrons between 
molecules of different redox potentials as a source of energy for pumping 
H+ across the plasma membrane (stage 2 in Figure 14-44). Indeed, some 
present-day bacteria that grow on formic acid use the small amount of 
redox energy derived from the transfer of electrons from formic acid to 
fumarate to pump Ht. 


When these H*t-pumping electron-transport systems became efficient 
enough, cells could harvest more redox energy than they needed to 


Figure 14-44 Chemiosmotic processes most likely evolved in 
stages. The first stage might have involved the evolution of an ATPase 
that pumped protons out of the cell using the energy of ATP hydrolysis. 
Stage 2 could have involved the evolution of a different proton pump, 
driven by an electron-transport chain. Stage 3 could then link these two 
systems together to generate an ATP synthase that uses the protons 
pumped by the electron-transport chain to synthesize ATP. An early cell 
with this final system would have had a large selective advantage over 
cells with neither of the systems or only one. 
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maintain their internal pH. These cells could then generate large elec- 
trochemical proton gradients, which they could couple to the production 
of ATP (stage 3 in Figure 14-44). Because such cells would require much 
less of the dwindling supply of fermentable nutrients, they would have 
proliferated at the expense of their neighbors. 


Photosynthetic Bacteria Made Even Fewer Demands on 
Their Environment 


The major evolutionary breakthrough in energy metabolism, however, 
was almost certainly the formation of photochemical reaction cent- 
ers that could use the energy of sunlight to produce molecules such as 
NADPH. It is thought that this development occurred early in the pro- 
cess of evolution—more than 3 billion years ago, in the ancestors of 
green sulfur bacteria. Present-day green sulfur bacteria use light energy 
to transfer hydrogen atoms (as an electron plus a proton) from H2S to 
NADPH, thereby creating the strong reducing power required for carbon 
fixation (Figure 14-45). 


The next step is thought to have involved the evolution of organisms 
capable of using water instead of H2S as the electron source for photo- 
synthesis. This entailed the evolution of a water-splitting enzyme and the 
addition of a second photosystem, acting in conjunction with the first, to 
bridge the enormous gap in redox potential between H2O and NADPH 
(see Figure 14-39). 


The biological consequences of this evolutionary step were far-reaching. 
For the first time, there were organisms that made only minimal chemical 
demands on their environment. These cells—including the first cyano- 
bacteria (see Figure 14-28)—could spread and evolve in ways denied to 
the earlier photosynthetic bacteria, which needed H2S, organic acids, or 
other sources of electrons. Consequently, large amounts of fermentable 
organic materials—produced by these cells and their ancestors—began 
to accumulate. Moreover, O2 began to enter the atmosphere in large 
amounts (Figure 14-46). 


The availability of O2 made possible the development of bacteria that 
relied on aerobic metabolism to make their ATP. As explained previously, 
these organisms could harness the large amount of energy released when 
carbohydrates and other reduced organic molecules are broken down all 
the way to CO? and H20. 


Figure 14-45 Photosynthesis in green 
sulfur bacteria uses hydrogen sulfide 
(H2S) as an electron donor rather than 
water. Electrons are easier to extract from 
H2S than from H2O, because H2S has a 
much higher redox potential (compare 
with Figure 14-39). Therefore, only one 
photosystem is needed to produce NADPH, 
and elemental sulfur is formed as a by- 
product instead of O2. The photosystem 

in green sulfur bacteria resembles 
photosystem | in plants and cyanobacteria. 
These photosystems all use a series of 
iron-sulfur centers as the electron carriers 
that eventually donate their high-energy 
electrons to ferredoxin (Fd). A bacterium of 
this type is Chlorobium tepidum, which can 
thrive at high temperatures and low light 
intensities in hot springs. 
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Figure 14-46 Oxygen entered Earth’s 
atmosphere billions of years ago. With the 
evolution of photosynthesis in prokaryotes 
more than 3 billion years ago, organisms 
would have no longer depended on 
preformed organic chemicals: they could 
make their own organic molecules from 
CO2. Note that there was a delay of about 
a billion years between the appearance 

of photosynthetic bacteria that split water 
and released O2 and the accumulation of 
high levels of O2 in the atmosphere. This 
delay is thought to have been due to the 
initial reaction of the Oz with abundant 
ferrous iron (Fe2*) dissolved in the early 
oceans. Only when the iron was used 

up could large amounts of Oz begin to 
accumulate in the atmosphere. In response 
to rising amounts of O2 in the atmosphere, 
nonphotosynthetic, aerobic organisms 
appeared, and the concentration of O2 in 
the atmosphere eventually leveled out. 


As organic materials accumulated as a by-product of photosynthesis, 
some photosynthetic bacteria—including the ancestors of the bacterium 
Escherichia coli—lost their ability to survive on light energy alone and 
came to rely entirely on cell respiration. Mitochondria arose when a pre- 
eukaryotic cell engulfed such an aerobic bacterium (see Figure 1-19). 
Plants arose somewhat later, when a descendant of this early aerobic 
eukaryote captured a photosynthetic bacterium, which became the pre- 
cursor of chloroplasts (see Figure 1-21). Once eukaryotes had acquired 
the bacterial symbionts that became mitochondria and chloroplasts, they 
could then embark on the spectacular pathway of evolution that eventu- 
ally led to complex multicellular organisms, including ourselves. 


The Lifestyle of Methanococcus Suggests That 
Chemiosmotic Coupling Is an Ancient Process 


The conditions today that most resemble those under which cells are 
thought to have lived 3.5-3.8 billion years ago may be those near deep- 
ocean hydrothermal vents. These vents represent places where the 
Earth’s molten mantle is breaking through the overlying crust, expanding 
the width of the ocean floor. Indeed, the modern organisms that appear 
to be most closely related to the hypothetical cells from which all life 
evolved live at 75°C to 95°C, temperatures approaching that of boiling 
water. This ability to thrive at such extreme temperatures suggests that 
life's common ancestor—the cell that gave rise to bacteria, archaea, and 
eukaryotes—lived under very hot, anaerobic conditions. 


One of the archaea that live in this environment today is Methanococcus 
jannaschii. Originally isolated from a hydrothermal vent more than a 
mile beneath the ocean surface, the organism grows in the complete 
absence of light and gaseous oxygen, using as nutrients the inorganic 
gases—hydrogen (H2), CO2, and nitrogen (N2)—that bubble up from the 
vent (Figure 14-47). Its mode of existence gives us a hint of how early 
cells might have used electron transport to derive energy and to extract 
carbon molecules from inorganic materials that were freely available on 
the hot early Earth. 


Methanococcus relies on N2 gas as its source of nitrogen for making 
organic molecules such as amino acids. The organism reduces N2 to 
ammonia (NH3) by the addition of hydrogen, a process called nitrogen 
fixation. Nitrogen fixation requires a large amount of energy, as does the 
carbon-fixation process that converts CO2 and H2O into sugars. Much 


of the energy that Methanococcus requires for both processes is derived 
from the transfer of electrons from Hz to CO2, with the release of large 
amounts of methane (CH4) as a waste product (thus producing natu- 
ral gas and giving the organism its name). Part of this electron transfer 
occurs in the plasma membrane and results in the pumping of protons 
(H*) across it. The resulting electrochemical proton gradient drives an 
ATP synthase in the same membrane to make ATP. 


The fact that such chemiosmotic coupling exists in an organism like 
Methanococcus suggests that the storage of energy in a proton gradient 
derived from electron transport is an extremely ancient process. Thus, 
chemiosmotic coupling may have fueled the evolution of nearly all life- 
forms on Earth. 


ESSENTIAL CONCEPTS 


e Mitochondria, chloroplasts, and many prokaryotes generate energy 
by a membrane-based mechanism known as chemiosmotic coupling, 
which involves using an electrochemical proton gradient to drive the 
synthesis of ATP. 


e In animal cells, mitochondria produce most of the ATP, using energy 
derived from the oxidation of sugars and fatty acids. 


e Mitochondria have an inner and an outer membrane. The inner mem- 
brane encloses the mitochondrial matrix; there, the citric acid cycle 
produces large amounts of NADH and FADH: from the oxidation of 
acetyl CoA derived from sugars and fats. 


e Inthe inner mitochondrial membrane, high-energy electrons donated 
by NADH and FADH2 move along an electron-transport chain and 
eventually combine with molecular oxygen (O2) to form water. 


e Much of the energy released by electron transfers along the electron- 
transport chain is harnessed to pump protons (H+) out of the matrix, 
creating an electrochemical proton gradient. The proton pumping is 
carried out by three large respiratory enzyme complexes embedded 
in the inner membrane. 


e The electrochemical proton gradient across the inner mitochondrial 
membrane is harnessed to make ATP when protons move back into 
the matrix through an ATP synthase located in the inner membrane. 


e The electrochemical proton gradient also drives the active transport 
of selected metabolites into and out of the mitochondrial matrix. 


e During photosynthesis in chloroplasts and photosynthetic bacteria, 
the energy of sunlight is captured by chlorophyll molecules embed- 
ded in large protein complexes known as photosystems; in plants, 
these photosystems are located in the thylakoid membranes of chlo- 
roplasts in leaf cells. 
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Figure 14-47 Methanococcus represents 
life-forms that might have existed 

early in Earth’s history. (A) Scanning 
electron micrograph showing individual 
Methanococcus cells. These deep-sea 
archaea use the hydrogen gas (Hz) that 
bubbles from deep-sea vents (B) as the 
source of reducing power to generate 
energy via chemiosmotic coupling. 

(A, from C.B. Park & D.S. Clark, Appl. 
Environ. Microbiol. 68:1458-1463, 2002. 
With permission from the American Society 
for Microbiology; B, National Oceanic and 
Atmospheric Administration's Pacific Marine 
Environmental Laboratory Vents Program.) 
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e Electron-transport chains associated with photosystems transfer 
electrons from water to NADP* to form NADPH, which produces O2 
as a by-product. 


e The photosynthetic electron-transport chains in chloroplasts also 
generate a proton gradient across the thylakoid membrane, which 
is used by an ATP synthase embedded in that membrane to generate 
ATP. 


e The ATP and the NADPH made by photosynthesis are used within 
the chloroplast stroma to drive the carbon-fixation cycle, which pro- 
duces carbohydrate from CO2 and water. 


e Carbohydrate is exported from the stroma to the plant cell cytosol; 
there it provides the starting material used for the synthesis of many 
other organic molecules and for the production of the materials used 
by plant cell mitochondria to produce ATP. 


e Both mitochondria and chloroplasts are thought to have evolved 
from bacteria that were endocytosed by other cells. Each retains its 
own genome and divides by processes that resemble bacterial cell 
division. 

e Chemiosmotic coupling mechanisms are of ancient origin. Modern 
microorganisms that live in environments similar to those thought to 
have been present on the early Earth also use chemiosmotic coupling 
to produce ATP. 
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QUESTIONS 


QUESTION 14-11 


Which of the following statements are correct? Explain your 
answers. 


A. After an electron has been removed by light, the 
positively charged chlorophyll in the reaction center of the 
first photosystem (photosystem II) has a greater affinity for 
electrons than O2 has. 


B. Photosynthesis is the light-driven transfer of an electron 
from chlorophyll to a second molecule that normally has a 
much lower affinity for electrons. 


C. Because it requires the removal of four electrons to 
release one O2 molecule from two H20 molecules, the 


respiratory enzyme complex 
stroma 


thylakoid 


water-splitting enzyme in photosystem II has to keep the 
reaction intermediates tightly bound so as to prevent partly 
reduced, and therefore hazardous, superoxide radicals from 
escaping. 


QUESTION 14-12 


Which of the following statements are correct? Explain your 
answers. 


A. Many, but not all, electron-transfer reactions involve 
metal ions. 


B. The electron-transport chain generates an electrical 
potential across the membrane because it moves electrons 
from the intermembrane space into the matrix. 


C. The electrochemical proton gradient consists of two 
components: a pH difference and an electrical potential. 


D. Ubiquinone and cytochrome c are both diffusible 
electron carriers. 


E. Plants have chloroplasts and therefore can live without 
mitochondria. 


F. Both chlorophyll and heme contain an extensive system 
of double bonds that allows them to absorb visible light. 


G. The role of chlorophyll in photosynthesis is equivalent to 


that of heme in mitochondrial electron transport. 


H. Most of the dry weight of a tree comes from the 
minerals that are taken up by the roots. 


QUESTION 14-13 


A single proton moving down its electrochemical 

gradient into the mitochondrial matrix space liberates 

19.2 kJ/mole of free energy (AG). How many protons 

have to flow across the inner mitochondrial membrane to 
synthesize one molecule of ATP if the AG for ATP synthesis 
under intracellular conditions is between 46 and 54 kJ/ 
mole? (AG is discussed in Chapter 3, pp. 88-98.) Why is a 
range given for this latter value and not a precise number? 
Under which conditions would the lower value apply? 


QUESTION 14-14 


In the following statement, choose the correct one of the 
alternatives in italics and justify your answers. “If no O2 is 
available, all components of the mitochondrial electron- 
transport chain will accumulate in their reduced/oxidized 
form. If O2 is suddenly added again, the electron carriers 
in cytochrome c oxidase will become reduced/oxidized 
before/after those in NADH dehydrogenase.” 


QUESTION 14-15 


Assume that the conversion of oxidized ubiquinone to 
reduced ubiquinone by NADH dehydrogenase occurs on 
the matrix side of the inner mitochondrial membrane and 
that its oxidation by cytochrome c reductase occurs on the 
intermembrane-space side of the membrane (see Figures 
14-14 and 14-21). What are the consequences of this 
arrangement for the generation of the H* gradient across 
the membrane? 


QUESTION 14-16 


If a voltage is applied to two platinum wires (electrodes) 
immersed in water, then water molecules become split into 
H2 and O2 gas. At the negative electrode, electrons are 
donated and H2 gas is released; at the positive electrode, 
electrons are accepted and O2 gas is produced. When 
photosynthetic bacteria and plant cells split water, they 
produce O2 but no H2. Why? 


QUESTION 14-17 


In an insightful experiment performed in the 1960s, 
chloroplasts were first soaked in an acidic solution at pH 4, 
so that the stroma and thylakoid space became acidified 
(Figure Q14-17). They were then transferred to a basic 
solution (pH 8). This quickly increased the pH of the stroma 
to 8, while the thylakoid space temporarily remained at 
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pH 4. A burst of ATP synthesis was observed, and the pH 
difference between the thylakoid and the stroma then 
disappeared. 


A. Explain why these conditions lead to ATP synthesis. 
B. Is light needed for the experiment to work? 


C. What would happen if the solutions were switched, 
so that the first incubation is in the pH 8 solution and the 
second one in the pH 4 solution? 


D. Does the experiment support or question the 
chemiosmotic model? 


Explain your answers. 


QUESTION 14-18 


As your first experiment in the laboratory, your adviser 
asks you to reconstitute purified bacteriorhodopsin, a 
light-driven H* pump from the plasma membrane of 
photosynthetic bacteria, and purified ATP synthase from 
ox-heart mitochondria together into the same membrane 
vesicles—as shown in Figure Q14-18. You are then asked 
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Figure Q14-18 
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to add ADP and P; to the external medium and shine light 
into the suspension of vesicles. 


A. What do you observe? 


B. What do you observe if not all the detergent is removed 
and the vesicle membrane therefore remains leaky to ions? 


C. You tell a friend over dinner about your new 
experiments, and he questions the validity of an approach 
that utilizes components from so widely divergent, 
unrelated organisms: “Why would anybody want to mix 
vanilla pudding with brake fluid?” Defend your approach 
against his critique. 


QUESTION 14-19 


FADH? is produced in the citric acid cycle by a 
membrane-embedded enzyme complex, called succinate 
dehydrogenase, that contains bound FAD and carries out 
the reactions 


succinate + FAD — fumarate + FADH2 
and 
FADH2 — FAD + 2Ht + 2e 


The redox potential of FADH2, however, is only -220 mV. 
Referring to Panel 14-1 (p. 472) and Figure 14-22, suggest a 
plausible mechanism by which its electrons could be fed into 
the electron-transport chain. Draw a diagram to illustrate 
your proposed mechanism. 


QUESTION 14-20 


Some bacteria have become specialized to live in an 
environment of high pH (pH ~10). Do you suppose that 
these bacteria use a proton gradient across their plasma 
membrane to produce their ATP? (Hint: all cells must 
maintain their cytoplasm at a pH close to neutrality.) 


QUESTION 14-21 


Figure Q14-21 summarizes the circuitry used by 
mitochondria and chloroplasts to interconvert different 
forms of energy. Is it accurate to say 


A. that the products of chloroplasts are the substrates for 
mitochondria? 


B. that the activation of electrons by the photosystems 
enables chloroplasts to drive electron transfer from H20 to 
carbohydrate, which is the opposite direction of electron 
transfer in the mitochondrion? 


C. that the citric acid cycle is the reverse of the normal 
carbon-fixation cycle? 


QUESTION 14-22 


A manuscript has been submitted for publication to a 
prestigious scientific journal. In the paper, the authors 
describe an experiment in which they have succeeded in 
trapping an individual ATP synthase molecule and then 
mechanically rotating its head by applying a force to it. 

The authors show that upon rotating the head of the 

ATP synthase, ATP is produced, in the absence of an H* 
gradient. What might this mean about the mechanism 
whereby ATP synthase functions? Should this manuscript be 
considered for publication in one of the best journals? 


Energy Generation in Mitochondria and Chloroplasts 


QUESTION 14-23 


You mix the following components in a reconstituted 
membrane-bound system. Assuming that the electrons 
must follow the path specified in Figure 14-14, in which 
experiments would you expect a net transfer of electrons to 
cytochrome c? Discuss why electron transfer does not occur 
in the other experiments. 


A. reduced ubiquinone and oxidized cytochrome c 
oxidized ubiquinone and oxidized cytochrome c 
reduced ubiquinone and reduced cytochrome c 


oxidized ubiquinone and reduced cytochrome c 


moo p 


. reduced ubiquinone, oxidized cytochrome c, and 
cytochrome c reductase complex 


F. oxidized ubiquinone, oxidized cytochrome c, and 
cytochrome c reductase complex 


G. reduced ubiquinone, reduced cytochrome c, and 
cytochrome c reductase complex 


H. oxidized ubiquinone, reduced cytochrome c, and 
cytochrome c reductase complex 


(A) MITOCHONDRION 
H+ gradient 


citric 
acid 
cycle 


products 


(B) CHLOROPLAST 
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fixation 
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Figure 014-21 


Intracellular Compartments 
and Protein Transport 


At any one time, a typical eukaryotic cell carries out thousands of differ- 
ent chemical reactions, many of which are mutually incompatible. One 
series of reactions makes glucose, for example, while another breaks it 
down; some enzymes synthesize peptide bonds, whereas others hydro- 
lyze them, and so on. Indeed, if the cells of an organ such as the liver are 
broken apart and their contents are mixed together in a test tube, the 
result is chemical chaos, and the cells’ enzymes and other proteins are 
quickly degraded by their own proteolytic enzymes. For a cell to operate 
effectively, the different intracellular processes that occur simultaneously 
must somehow be segregated. 


Cells have evolved several strategies for isolating and organizing their 
chemical reactions. One strategy used by both prokaryotic and eukary- 
otic cells is to aggregate the different enzymes required to catalyze a 
particular sequence of reactions into large, multicomponent complexes. 
Such complexes—which can form large biochemical subcompartments 
with distinct functions—are involved in many important cell processes, 
including the synthesis of DNA and RNA, and the assembly of ribosomes 
(as discussed in Chapter 4, pp. 155-158). A second strategy, which is most 
highly developed in eukaryotic cells, is to confine different metabolic 
processes—and the proteins required to perform them—within different 
membrane-enclosed compartments. As discussed in Chapters 11 and 12, 
cell membranes provide selectively permeable barriers through which 
the transport of most molecules can be controlled. In this chapter, we 
consider this strategy of membrane-dependent compartmentalization. 


In the first section, we describe the principal membrane-enclosed com- 
partments, or membrane-enclosed organelles, of eukaryotic cells and 
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Figure 15-1 In eukaryotic cells, internal 
membranes create enclosed compartments 
that segregate different metabolic 
processes. Examples of many of the major 
membrane-enclosed organelles can be 
identified in this electron micrograph of 

part of a liver cell, seen in cross section. 

The small, black granules between the 
compartments are aggregates of glycogen 
and the enzymes that control its synthesis 
and breakdown. (By permission of E.L. Bearer 
and Daniel S. Friend.) 
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briefly consider their main functions. In the second section, we discuss 
how the protein composition of the different compartments is set up and 
maintained. Each compartment contains a unique set of proteins that 
have to be transferred selectively from the cytosol, where they are made, 
to the compartment where they will be used. This transfer process, called 
protein sorting, depends on signals built into the amino acid sequence of 
the proteins. In the third section, we describe how certain membrane- 
enclosed compartments in a eukaryotic cell communicate with one 
another by forming small, membrane-enclosed sacs, or vesicles. These 
vesicles pinch off from one compartment, move through the cytosol, and 
fuse with another compartment in a process called vesicular transport. In 
the last two sections, we discuss how this constant vesicular traffic also 
provides the main routes for releasing proteins from the cell by the pro- 
cess of exocytosis and for importing them by the process of endocytosis. 


MEMBRANE-ENCLOSED ORGANELLES 


Whereas a prokaryotic cell usually consists of a single compartment 
enclosed by the plasma membrane, eukaryotic cells are elaborately sub- 
divided by internal membranes. When a cross section through a plant or 
an animal cell is examined in the electron microscope, numerous small, 
membrane-enclosed sacs, tubes, spheres, and irregularly shaped struc- 
tures can be seen, often arranged without much apparent order (Figure 
15-1). Most of these structures are membrane-enclosed organelles, or 
parts of such organelles, each of which contains a unique set of large and 
small molecules and carries out a specialized function. In this section, we 
review these functions and discuss how different membrane-enclosed 
organelles may have evolved. 


Eukaryotic Cells Contain a Basic Set of Membrane- 
enclosed Organelles 


The major membrane-enclosed organelles of an animal cell are illus- 
trated in Figure 15-2, and their functions are summarized in Table 15-1. 
These organelles are surrounded by the cytosol, which is enclosed by 
the plasma membrane. The nucleus is generally the most prominent 
organelle in eukaryotic cells. It is surrounded by a double membrane, 
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known as the nuclear envelope, and communicates with the cytosol via 
nuclear pores that perforate the envelope. The outer nuclear membrane 
is continuous with the membrane of the endoplasmic reticulum (ER), a 
system of interconnected membranous sacs and tubes that often extends 
throughout most of the cell. The ER is the major site of synthesis of new 
membranes in the cell. Large areas of the ER have ribosomes attached 
to the cytosolic surface and are designated rough endoplasmic reticulum 
(rough ER). The ribosomes are actively synthesizing proteins that are 
inserted into the ER membrane or delivered to the ER interior, a space 
called the Jumen. The smooth endoplasmic reticulum (smooth ER) lacks 
ribosomes. It is scanty in most cells but is highly developed for perform- 
ing particular functions in others: for example, it is the site of steroid 
hormone synthesis in some endocrine cells of the adrenal gland and the 
site where a variety of organic molecules, including alcohol, are detoxi- 
fied in liver cells. In many eukaryotic cells, the smooth ER also sequesters 
Ca?* from the cytosol; the release and reuptake of Ca?* from the ER is 
involved in muscle contraction and other responses to extracellular sig- 
nals (discussed in Chapters 16 and 17). 


TABLE 15-1 THE MAIN FUNCTIONS OF MEMBRANE-ENCLOSED 
ORGANELLES OF A EUKARYOTIC CELL 
comparen [Wei Futon U 


Cytosol contains many metabolic pathways (Chapters 3 and 13); 
protein synthesis (Chapter 7); the cytoskeleton (Chapter 17) 


Nucleus contains main genome (Chapter 5); DNA and RNA synthesis 
(Chapters 6 and 7) 

Endoplasmic synthesis of most lipids (Chapter 11); synthesis of proteins for 

reticulum (ER) distribution to many organelles and to the plasma membrane 


(this chapter) 


Golgi apparatus modification, sorting, and packaging of proteins and lipids for 
either secretion or delivery to another organelle (this chapter) 


Lysosomes intracellular degradation (this chapter) 
Endosomes sorting of endocytosed material (this chapter) 
Mitochondria ATP synthesis by oxidative phosphorylation (Chapter 14) 


Chloroplasts (in ATP synthesis and carbon fixation by photosynthesis 
plant cells) (Chapter 14) 


Peroxisomes oxidative breakdown of toxic molecules (this chapter) 


Membrane-Enclosed Organelles 


Figure 15-2 A cell from the lining of 

the intestine contains the basic set 

of membrane-enclosed organelles 

found in most animal cells. The nucleus, 
endoplasmic reticulum (ER), Golgi 
apparatus, lysosomes, endosomes, 
mitochondria, and peroxisomes are distinct 
compartments separated from the cytosol 
by at least one selectively permeable 
membrane. Ribosomes are shown bound 
to the cytosolic surface of portions of the 
ER, called the rough ER; the ER that lacks 
ribosomes is called smooth ER. Additional 
ribosomes can be found free in the cytosol. 
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The Golgi apparatus, which is usually situated near the nucleus, receives 
proteins and lipids from the ER, modifies them, and then dispatches them 
to other destinations in the cell. Small sacs of digestive enzymes called 
lysosomes degrade worn-out organelles, as well as macromolecules and 
particles taken into the cell by endocytosis. On their way to lysosomes, 
endocytosed materials must first pass through a series of compartments 
called endosomes, which sort the ingested molecules and recycle some 
of them back to the plasma membrane. Peroxisomes are small organelles 
that contain enzymes that break down lipids and destroy toxic molecules, 
producing hydrogen peroxide. Mitochondria and (in plant cells) chloro- 
plasts are each surrounded by a double membrane and are the sites of 
oxidative phosphorylation and photosynthesis, respectively (discussed in 
Chapter 14); both contain internal membranes that are highly specialized 
for the production of ATP. 


Many of the membrane-enclosed organelles, including the ER, Golgi 
apparatus, mitochondria, and chloroplasts, are positioned in the cell by 
attachment to the cytoskeleton, especially to microtubules. Cytoskeletal 
filaments provide tracks for moving the organelles around and for direct- 
ing the traffic of vesicles between one organelle and another. These 
movements are driven by motor proteins that use the energy of ATP 
hydrolysis to propel the organelles and vesicles along the filaments, as 
discussed in Chapter 17. 


On average, the membrane-enclosed organelles together occupy nearly 
half the volume of a eukaryotic cell (Table 15-2), and the total amount of 
membrane associated with them is enormous. In a typical mammalian 
cell, for example, the area of the endoplasmic reticulum membrane is 
20-30 times greater than that of the plasma membrane. In terms of its 
area and mass, the plasma membrane is only a minor membrane in most 
eukaryotic cells. 


Much can be learned about the composition and function of an organelle 
once it has been isolated from other cell structures. For the most part, 
organelles are far too small to be isolated by hand, but it is possible to 
separate one type of organelle from another by differential centrifugation 
(described in Panel 4-3, pp. 164-165). Once a purified sample of one type 
of organelle has been obtained, the organelle’s proteins can be identified. 
In many cases, the organelle itself can be incubated in a test tube under 
conditions that allow its functions to be studied. Isolated mitochondria, 
for example, can produce ATP from the oxidation of pyruvate to CO2 and 
water, provided they are adequately supplied with ADP, inorganic phos- 
phate, and O2. 


TABLE 15-2 THE RELATIVE VOLUMES AND NUMBERS OF THE MAJOR 
MEMBRANE-ENCLOSED ORGANELLES IN A LIVER CELL (HEPATOCYTE) 


Intracellular Compartment Percentage of Approximate 
Total Cell Volume Number per Cell 


Cytosol 54 
Mitochondria 22 1700 
Endoplasmic reticulum 12 
Nucleus 6 
Golgi apparatus 3 
Peroxisomes 1 400 
Lysosomes 1 300 


Endosomes 1 200 
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Membrane-enclosed Organelles Evolved in Different Ways 


In trying to understand the relationships between the different compart- 
ments of a modern eukaryotic cell, it is helpful to consider how they 
evolved. The precursors of the first eukaryotic cells are thought to have 
been simple microorganisms, resembling present-day archaea, which had 
a plasma membrane but no internal membranes. The plasma membrane 
in such cells would have provided all membrane-dependent functions, 
including ATP synthesis and lipid synthesis, as does the plasma mem- 
brane in most modern prokaryotes. Archaea and bacteria can get by with 
this arrangement because of their small size, which gives them a high 
surface-to-volume ratio: their plasma membrane area is thus sufficient to 
sustain all the vital functions for which membranes are required. Present- 
day eukaryotic cells, by contrast, have volumes that are 1000 to 10,000 
times greater. Such a large cell has a small surface-to-volume ratio and 
presumably could not survive with a plasma membrane as its only mem- 
brane. Thus, the increase in size typical of eukaryotic cells probably could 
not have occurred without the development of internal membranes. 


Membrane-enclosed organelles are thought to have arisen in evolu- 
tion in stages. The nuclear membranes and the membranes of the ER, 
Golgi apparatus, endosomes, and lysosomes most likely originated by 
invagination of the plasma membrane, as illustrated for the nuclear and 
ER membranes in Figure 15-3. The ER, Golgi apparatus, peroxisomes, 
endosomes, and lysosomes are all part of what is collectively called the 
endomembrane system. As we discuss later, the interiors of these organ- 
elles communicate extensively with one another and with the outside 
of the cell by means of small vesicles that bud off from one of these 
organelles and fuse with another. Consistent with this proposed evolu- 
tionary origin, the interiors of these organelles are treated by the cell in 
many ways as “extracellular,” as we will see. The hypothetical scheme 
shown in Figure 15-3 also explains why the nucleus is surrounded by two 
membranes. 


Mitochondria and chloroplasts are thought to have originated in a differ- 
ent way. They differ from all other organelles in that they possess their 
own small genomes and can make some of their own proteins, as dis- 
cussed in Chapter 14. The similarity of their genomes to those of bacteria 
and the close resemblance of some of their proteins to bacterial proteins 
strongly suggest that both these organelles evolved from bacteria that 
were engulfed by primitive eukaryotic cells with which they initially lived 
in symbiosis (Figure 15-4). As might be expected from their origins, mito- 
chondria and chloroplasts remain isolated from the extensive vesicular 
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Figure 15-3 Nuclear membranes and 

the ER may have evolved through 
invagination of the plasma membrane. 

In modern bacteria and archaea, a single 
DNA molecule is typically attached to the 
plasma membrane. It is possible that, in 

a very ancient anaerobic archaeon, the 
plasma membrane, with its attached DNA, 
could have invaginated and, in subsequent 
generations, formed a two-layered envelope 
of membrane completely surrounding the 
DNA. This envelope is presumed to have 
eventually pinched off completely from the 
plasma membrane, ultimately producing 

a nuclear compartment penetrated by 
channels called nuclear pores, which enable 
communication with the cytosol. Other 
portions of the invaginated membrane may 
have formed the ER, which would explain 
why the space between the inner and outer 
nuclear membranes is continuous with the 
ER lumen. 


QUESTION 15-1 


As shown in the drawings in 
Figure 15-3, the lipid bilayer 

of the inner and outer nuclear 
membranes forms a continuous 
sheet, joined around the nuclear 
pores. As membranes are two- 
dimensional fluids, this would 
imply that membrane proteins can 
diffuse freely between the two 
nuclear membranes. Yet each of 
these two nuclear membranes has 
a different protein composition, 
reflecting different functions. How 
could you reconcile this apparent 
contradiction? 
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Figure 15-4 Mitochondria are thought a aa 
to have originated when an aerobic eukaryotic cell eukaryotic cell 
bacterium was engulfed by a larger internal 
anaerobic eukaryotic cell. Chloroplasts membranes 
are thought to have originated later in a 
similar way, when a eukaryotic cell with 
mitochondria engulfed a photosynthetic 
bacterium. This theory would explain why 
these organelles have two membranes, 


possess their own genomes, and do 
not participate in the vesicular traffic 
that connects the compartments of the 
endomembrane system. 


nucleus 
= = O => 
plasma O 
membrane / 


bacterial plasma membrane DEGRADATION OF mitochondria with 
bacterial outer membrane PLASMA MEMBRANE double membrane 


aerobic DERIVED FROM 
bacterium EUKARYOTIC CELL 


traffic that connects the interiors of most of the other membrane-enclosed 
organelles to one another and to the outside of the cell. 


PROTEIN SORTING 


Before a eukaryotic cell divides, it must duplicate its membrane-enclosed 
organelles. As cells grow, membrane-enclosed organelles enlarge by 
incorporation of new molecules; the organelles then divide and, during 
cell division, are distributed between the two daughter cells. Organelle 
growth requires a supply of new lipids to make more membrane and 
a supply of the appropriate proteins—both membrane proteins and the 
soluble proteins that will occupy the interior of the organelle. Even in 
cells that are not dividing, proteins are being produced continually. These 
newly synthesized proteins must be accurately delivered to their appro- 
priate organelle—some for eventual secretion from the cell and some 
to replace organelle proteins that have been degraded. Directing newly 
made proteins to their correct organelle is therefore necessary for any 
cell to grow and divide, or just to function properly. 


For some organelles, including mitochondria, chloroplasts, and the inte- 
rior of the nucleus, proteins are delivered directly from the cytosol. For 
others, including the Golgi apparatus, lysosomes, endosomes, and the 
inner nuclear membrane, proteins and lipids are delivered indirectly via 
the ER, which is itself a major site of lipid and protein synthesis. Proteins 
enter the ER directly from the cytosol: some are retained there, but most 
are transported by vesicles to the Golgi apparatus and then onward to the 
plasma membrane or to other organelles. Peroxisomes make use of both 
pathways. Although these organelles acquire some of their membrane 
proteins from the ER, the bulk of their digestive enzymes enter directly 
from the cytosol. 


In this section, we discuss the mechanisms by which proteins enter 
membrane-enclosed organelles from the cytosol. Proteins made in the 
cytosol are dispatched to different locations in the cell according to spe- 
cific address labels contained in their amino acid sequence. Once at the 
correct address, the protein enters either the membrane or the interior 
lumen of its designated organelle. 


Proteins Are Transported into Organelles by Three 
Mechanisms 


The synthesis of virtually all proteins in the cell begins on ribosomes in 
the cytosol. The exceptions are the few mitochondrial and chloroplast 


proteins that are synthesized on ribosomes inside these organelles; most 
mitochondrial and chloroplast proteins, however, are made in the cytosol 
and subsequently imported. The fate of any protein molecule synthesized 
in the cytosol depends on its amino acid sequence, which can contain 
a sorting signal that directs the protein to the organelle in which it is 
required. Proteins that lack such signals remain as permanent residents 
of the cytosol; those that possess a sorting signal move from the cytosol 
to the appropriate organelle. Different sorting signals direct proteins into 
the nucleus, mitochondria, chloroplasts (in plants), peroxisomes, and 
the ER. 


When a membrane-enclosed organelle imports a water-soluble protein 
to its interior—either from the cytosol or from another organelle—it faces 
a problem: the protein must be transported across its membrane (or 
membranes), which is normally impermeable to hydrophilic macromol- 
ecules. How this task is accomplished depends on the organelle. 


1. Proteins moving from the cytosol into the nucleus are transported 
through the nuclear pores, which penetrate both the inner and outer 
nuclear membranes. The pores function as selective gates that actively 
transport specific macromolecules but also allow free diffusion of 
smaller molecules (mechanism 1 in Figure 15-5). 

2. Proteins moving from the cytosol into the ER, mitochondria, or 
chloroplasts are transported across the organelle membrane by 
protein translocators located in the membrane. Unlike the transport 
through nuclear pores, the transported protein must usually unfold 
for the translocator to guide it across the hydrophic interior of the 
membrane (mechanism 2 in Figure 15-5). Bacteria have similar 
protein translocators in their plasma membrane, which they use to 
export proteins from the cytosol to the cell exterior. 

3. Proteins moving onward from the ER—and from one compartment 
of the endomembrane system to another—are transported by 
a mechanism that is fundamentally different than the ones just 
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Figure 15-5 Membrane-enclosed 
organelles import proteins by one of 
three mechanisms. All of these processes 
require energy. The protein remains folded 
during transport in mechanisms 1 and 

3 but usually has to be unfolded during 
mechanism 2. 
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TABLE 15-3 SOME TYPICAL SIGNAL SEQUENCES 
Function of Signal Example of Signal Sequence 


Import into ER *H3N-Met-Met-Ser-Phe-Val-Ser-Leu-Leu-Leu-Va|-Gly- 
Ile-Leu-Phe-Trp-Ala-Thr-Glu-Ala-Glu-GIn-Leu-Thr- 
Lys-Cys-Glu-Val-Phe-Gln- 


Retention in lumen of ER — -Lys-Asp-Glu-Leu-COO7 


Import into mitochondria | *H3N-Met-Leu-Ser-Leu-Arg-Gln-Ser-lle-Arg-Phe- 
g g 
Phe-Lys-Pro-Ala-Thr-Arg-Thr-Leu-Cys-Ser-Ser-Arg- 


Tyr-Leu-Leu- 
Import into nucleus -Pro-Pro-Lys-Lys-Lys-Arg-Lys-Val- 
Export from nucleus -Met-Glu-Glu-Leu-Ser-Gln-Ala-Leu-Ala-Ser-Ser-Phe- 
Import into peroxisomes -Ser-Lys-Leu- 


Positively charged amino acids are shown in red and negatively charged amino 
acids in blue. Important hydrophobic amino acids are shown in green. 
*H3N indicates the N-terminus of a protein; COO- indicates the C-terminus. 


described. These proteins are ferried by transport vesicles, which pinch 
off from the membrane of one compartment and then fuse with the 
membrane of a second compartment (mechanism 3 in Figure 15-5). In 
this process, transport vesicles deliver soluble cargo proteins, as well 
as the proteins and lipids that are part of the vesicle membrane. 


Signal Sequences Direct Proteins to the Correct 
Compartment 


The typical sorting signal on a protein is a continuous stretch of amino 
acid sequence, typically 15-60 amino acids long. This signal sequence is 
often (but not always) removed from the finished protein once it has been 
sorted. Some of the signal sequences used to specify different destina- 
tions in the cell are shown in Table 15-3. 


Signal sequences are both necessary and sufficient to direct a protein to 
a particular destination. This has been shown by experiments in which 
the sequence is either deleted or transferred from one protein to another 
by genetic engineering techniques (discussed in Chapter 10). Deleting a 
signal sequence from an ER protein, for example, converts it into a cyto- 
solic protein, while placing an ER signal sequence at the beginning of a 
cytosolic protein redirects the protein to the ER (Figure 15-6). The signal 
sequences specifying the same destination can vary greatly even though 


cytosolic protein ER protein with signal 
(no signal sequence) sequence removed 


ER signal 
sequence removed 
from ER protein 
and attached 


Figure 15-6 Signal sequences direct 
proteins to the correct destination. 

(A) Proteins destined for the ER possess 
an N-terminal signal sequence that directs 
them to that organelle, whereas those 
destined to remain in the cytosol lack any 


to cytosolic 
protein 


such signal sequence. (B) Recombinant 

DNA techniques can be used to change the Neat 

destination of the two proteins: if the signal 

sequence is removed from an ER protein ER protein ER signal sequence cytosolic protein with 


and attached to a cytosolic protein, both 
proteins are reassigned to the expected, 


inappropriate location. (A) NORMAL SIGNAL SEQUENCE 


(B) 


added ER signal sequence 


RELOCATED SIGNAL SEQUENCE 


they have the same function: physical properties such as hydrophobicity 
or the placement of charged amino acids often appear to be more impor- 
tant for the function of these signals than the exact amino acid sequence. 


Proteins Enter the Nucleus Through Nuclear Pores 


The nuclear envelope, which encloses the nuclear DNA and defines the 
nuclear compartment, is formed from two concentric membranes. The 
inner nuclear membrane contains some proteins that act as binding sites 
for the chromosomes (discussed in Chapter 5) and others that provide 
anchorage for the nuclear lamina, a finely woven meshwork of protein fil- 
aments that lines the inner face of this membrane and provides structural 
support for the nuclear envelope (discussed in Chapter 17). The composi- 
tion of the outer nuclear membrane closely resembles the membrane of 
the ER, with which it is continuous (Figure 15-7). 


The nuclear envelope in all eukaryotic cells is perforated by nuclear 
pores that form the gates through which molecules enter or leave the 
nucleus. A nuclear pore is a large, elaborate structure composed of a 
complex of about 30 different proteins, each present in multiple copies 
(Figure 15-8). Many of the proteins that line the nuclear pore contain 
extensive, unstructured regions in which the polypeptide chains are 
largely disordered. These disordered segments form a soft, tangled 
meshwork—like a kelp forest—that fills the center of the channel, pre- 
venting the passage of large molecules but allowing small, water-soluble 
molecules to pass freely and nonselectively between the nucleus and the 
cytosol. 


Selected larger molecules and macromolecular complexes also need to 
pass through nuclear pores. RNA molecules, which are synthesized in 
the nucleus, and ribosomal subunits, which are assembled there, must 
be exported to the cytosol (discussed in Chapter 7). Newly made proteins 
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Figure 15-7 The outer nuclear membrane 
is continuous with the ER membrane. The 
double membrane of the nuclear envelope 
is penetrated by nuclear pores. The 
ribosomes that are normally bound to the 
cytosolic surface of the ER membrane and 
outer nuclear membrane are not shown. 


Figure 15-8 The nuclear pore complex 
forms a gate through which selected 
macromolecules and larger complexes 
enter or exit the nucleus. (A) Drawing 

of a small region of the nuclear envelope 
showing two pores. Protein fibrils protrude 
from both sides of the pore complex; on 
the nuclear side, they converge to form a 
basketlike structure. The spacing between 
the fibrils is wide enough that the fibrils 
do not obstruct access to the pores. 

(B) Electron micrograph of a region of 
nuclear envelope showing a side view of 
two nuclear pores (brackets). (C) Electron 
micrograph showing a face-on view of 
nuclear pore protein complexes; the 
membranes have been extracted with 
detergent. (B, courtesy of Werner W. 
Franke; C, courtesy of Ron Milligan.) 
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prospective — P, — nuclear Figure 15-9 Prospective nuclear proteins are imported from 
nuclear protein localization signal the cytosol through nuclear pores. The proteins contain a nuclear 
(cargo) localization signal that is recognized by nuclear import receptors, 


cn which interact with the cytosolic fibrils that extend from the rim of the 


receptor pore. After being captured, the receptors with their cargo jostle their 
way through the gel-like meshwork formed from the unstructured 
regions of the nuclear pore proteins until nuclear entry triggers cargo 
release. After cargo delivery, the receptors return to the cytosol via 
nuclear pores for reuse. Similar types of transport receptors, operating 
in the reverse direction, export mRNAs from the nucleus (see Figure 
7-25). These sets of import and export receptors have a similar basic 
structure. 
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that are destined for the nucleus must also be imported from the cytosol 
(Movie 15.1). To gain entry to a pore, these large molecules and mac- 
Ñ gel-like romolecular complexes must display an appropriate sorting signal. The 


nuer balet meshwork of signal sequence that directs a protein from the cytosol into the nucleus, 
sone as called a nuclear localization signal, typically consists of one or two short 
S sequences containing several positively charged lysines or arginines (see 

Table 15-3). 


The nuclear localization signal on proteins destined for the nucleus is 

E recognized by cytosolic proteins called nuclear import receptors. These 
receptors help direct a newly synthesized protein to a nuclear pore by 
interacting with the tentacle-like fibrils that extend from the rim of the 
pore into the cytosol (Figure 15-9). Once there, the nuclear import recep- 
tor penetrates the pore by grabbing onto short, repeated amino acid 
sequences within the tangle of nuclear pore proteins that fill the center of 
the pore. When the nuclear pore is empty, these repeated sequences bind 
to one another, forming a loosely packed gel. Nuclear import receptors 
interrupt these interactions, and they open a local passageway through 
the meshwork. The import receptors then bump along from one repeat 
sequence to the next, until they enter the nucleus and deliver their cargo. 
The empty receptor then returns to the cytosol via the nuclear pore for 
reuse (see Figure 15-9). 


The import of nuclear proteins is powered by energy provided by the 
hydrolysis of GTP. This hydrolysis is mediated by a monomeric GTPase 
named Ran. Like other GTPases, Ran exists in two conformations: one 
bearing a molecule of GTP, the other GDP. These forms, however, are 
differently localized: Ran-GTP is present in high concentrations in the 
nucleus, whereas Ran-GDP is produced in the cytosol (Figure 15-10A). 
In the nucleus, Ran-GTP displaces the prospective nuclear protein from 
its receptor, allowing the imported protein to be released. The import 
receptor—now bearing Ran-GTP—returns to the cytosol, where hydroly- 
sis of GTP allows Ran-GDP to dissociate, leaving the receptor free to pick 
up another protein destined for the nucleus. In this way, GTP hydrolysis 
drives nuclear transport in the appropriate direction (Figure 15-10B). 


Nuclear export receptors work in a similar way, driving protein and RNA 
traffic from the nucleus to the cytosol. They recognize nuclear export sig- 
nals, which are different from those specifying import (see Table 15-3), 
and they also use Ran to couple the transport to an energy source. 


Nuclear pore proteins operate this molecular gate at an amazing speed, 

E rapidly pumping macromolecules in both directions through each pore. 

QUESTION 15-2 Proteins are transported into the nucleus in their fully folded conforma- 
tion and ribosomal components as assembled particles. This feature 
nucleus as a separate compartment distinguishes the nuclear transport mechanism from the mechanisms 
when prokaryotic cells can manage that transport proteins into most other organelles. Proteins have to 


perfectly well without? unfold to cross the membranes of mitochondria and chloroplasts, as we 
discuss next. 


Why do eukaryotic cells require a 
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Figure 15-10 Energy supplied by GTP hydrolysis drives nuclear transport. (A) The small monomeric GTPase, 
Ran, exists in two conformations—one carrying GTP, the other GDP (see Figure 4-48 or 16-12). Ran is converted 
from one conformation to the other with the help of accessory proteins that are differently localized. The accessory 
protein that triggers GTP hydrolysis, called Ran-GAP (GTPase-activating protein), is found exclusively in the cytosol, 
where it converts Ran-GTP to Ran-GDP. The accessory protein that causes Ran-GDP to release its GDP and take up 
GTP, called Ran-GEF (guanine nucleotide exchange factor), is found exclusively in the nucleus. The localization of 
these accessory proteins guarantees that the concentration of Ran-GTP is higher in the nucleus, thus driving the 
nuclear import cycle in the desired direction. (B) A nuclear import receptor picks up a prospective nuclear protein 

in the cytosol and enters the nucleus. There it encounters Ran-GTP, which binds to the import receptor, causing it 
to release the nuclear protein. Having discharged its cargo in the nucleus, the receptor—still carrying Ran-GTP—is 
transported back through the pore to the cytosol, where Ran hydrolyzes its bound GTP. Ran-GDP falls off the import 
receptor, which is then free to bind another protein destined for the nucleus. Ran-GDP is carried into the nucleus by 
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its own unique import receptor (not shown). 


Proteins Unfold to Enter Mitochondria and Chloroplasts 


Both mitochondria and chloroplasts are surrounded by inner and outer 
membranes, and both organelles specialize in the synthesis of ATP. 
Chloroplasts also contain a third membrane system, the thylakoid mem- 
brane (discussed in Chapter 14). Although both organelles contain their 
own genomes and make some of their own proteins, most mitochondrial 
and chloroplast proteins are encoded by genes in the nucleus and are 
imported from the cytosol. These proteins usually have a signal sequence 
at their N-terminus that allows them to enter their specific organelle. 
Proteins destined for either organelle are translocated simultaneously 
across both the inner and outer membranes at specialized sites where 
the two membranes are closely apposed. Each protein is unfolded as it 
is transported, and its signal sequence is removed after translocation is 
complete (Figure 15-11). 


Chaperone proteins (discussed in Chapter 4) inside the organelles help to 
pull the protein across the two membranes and to fold it once it is inside. 
Subsequent transport to a particular site within the organelle, such as 
the inner or outer membrane or the thylakoid membrane in chloroplasts, 
usually requires further sorting signals in the protein, which are often 
only exposed after the first signal sequence has been removed. The inser- 
tion of transmembrane proteins into the inner membrane, for example, is 
guided by signal sequences in the protein that start and stop the transfer 
process across the membrane, as we describe later for the insertion of 
transmembrane proteins in the ER membrane. 


The growth and maintenance of mitochondria and chloroplasts require 
not only the import of new proteins but also the incorporation of new 
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Figure 15-11 Mitochondrial precursor proteins are unfolded during import. (A) A mitochondrion has an outer 
and inner membrane, both of which must be crossed for a mitochondrial precursor protein to enter the organelle. 
(B) To initiate transport, the mitochondrial signal sequence on a mitochondrial precursor protein is recognized by 
a receptor in the outer mitochondrial membrane. This receptor is associated with a protein translocator, which 
transports the signal sequence across the outer mitochondrial membrane to the intermembrane space. The 
complex of receptor, precursor protein, and translocator then diffuses laterally in the outer membrane until the 
signal sequence is recognized by a second translocator in the inner membrane. Together, the two translocators 
transport the protein across both the outer and inner membranes, unfolding the protein in the process (Movie 15.2). 
The signal sequence is finally cleaved off by a signal peptidase in the mitochondrial matrix. Proteins are imported 
into chloroplasts by a similar mechanism. The chaperone proteins that help pull the protein across the membranes 
and help it to refold are not shown. Some of the energy needed for this protein translocation comes from the 
hydrolysis of ATP, which allows the chaperones to function. 


lipids into the organelle membranes. Most of their membrane phospholip- 
ids are thought to be imported from the ER, which is the main site of lipid 
synthesis in the cell. Phospholipids are transported to these organelles 
by lipid-carrying proteins that extract a phospholipid molecule from one 
membrane and deliver it into another. Such transport frequently occurs at 
specific junctions where the membranes of different organelles are held 
in close proximity. By controlling which lipids are transported, the dif- 
ferent cell membranes are able to maintain different lipid compositions. 


Proteins Enter Peroxisomes from both the Cytosol and 
the Endoplasmic Reticulum 


Peroxisomes are packed with enzymes that digest toxins and synthe- 
size certain phospholipids, including those present in the myelin sheath 
surrounding nerve cell axons. These organelles acquire the bulk of their 
proteins via selective transport from the cytosol. A short sequence of 
only three amino acids serves as an import signal for many peroxiso- 
mal proteins (see Table 15-3, p. 502). This sequence is recognized by 
receptor proteins in the cytosol, at least one of which escorts its cargo 
protein all the way into the peroxisome before returning to the cytosol. 
Like the membranes of mitochondria and chloroplasts, the peroxisomal 
membrane contains a translocator that aids in protein transport. Unlike 
the mechanism that operates in mitochondria and chloroplasts, however, 
proteins do not need to unfold to enter the peroxisome—and the trans- 
port mechanism is still mysterious. 


Although most peroxisomal proteins come from the cytosol, a few of the 
proteins embedded in the peroxisomal membrane arrive via vesicles that 
bud from the ER. The vesicles either fuse with preexisting peroxisomes 
or import additional peroxisomal proteins from the cytosol to grow into 
mature peroxisomes. 


Mutations that block peroxisomal protein import can cause severe illness. 
Individuals with Zellweger syndrome, for example, are born with severe 
abnormalities in their brain, liver, and kidneys. Most do not survive past 


the first six months of life—a grim reminder of the crucial importance of 
peroxisomes, and peroxisomal protein transport, for proper cell function 
and for the health of the organism. 


Proteins Enter the Endoplasmic Reticulum While Being 
Synthesized 


The endoplasmic reticulum is the most extensive membrane system in a 
eukaryotic cell (Figure 15-12A). Unlike the organelles discussed so far, it 
serves as an entry point for proteins destined for other organelles, as well 
as for the ER itself. Proteins destined for the Golgi apparatus, endosomes, 
and lysosomes, as well as proteins destined for the cell surface, all first 
enter the ER from the cytosol. Once inside the ER lumen, or embedded in 
the ER membrane, individual proteins will not re-enter the cytosol during 
their onward journey. They will instead be ferried by transport vesicles 
from organelle to organelle within the endomembrane system, or to the 
plasma membrane (see Figure 15-5). 


Two kinds of proteins are transferred from the cytosol to the ER: (1) water- 
soluble proteins are completely translocated across the ER membrane 
and are released into the ER lumen; (2) prospective transmembrane pro- 
teins are only partly translocated across the ER membrane and become 
embedded in it. The water-soluble proteins are destined either for secre- 
tion (by release at the cell surface) or for the lumen of an organelle of 
the endomembrane system. The transmembrane proteins are destined 
to reside in the membrane of one of these organelles or in the plasma 
membrane. All of these proteins are initially directed to the ER by an ER 
signal sequence, a segment of eight or more hydrophobic amino acids 
(see Table 15-3, p. 502), which is also involved in the process of translo- 
cation across the membrane. 


Unlike the proteins that enter the nucleus, mitochondria, chloroplasts, 
or peroxisomes, most of the proteins that enter the ER begin to be 
threaded across the ER membrane before the polypeptide chain has been 
completely synthesized. This requires that the ribosome synthesizing 
the protein be attached to the ER membrane. These membrane-bound 
ribosomes coat the surface of the ER, creating regions termed rough 
endoplasmic reticulum because of its characteristic beaded appearance 
when viewed in an electron microscope (Figure 15-12B). 
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Figure 15-12 The endoplasmic reticulum 
is the most extensive membrane network 
in eukaryotic cells. (A) Fluorescence 
micrograph of a living plant cell showing 
the ER as a complex network of tubes. 

The cell shown here has been genetically 
engineered so that it contains a fluorescent 
protein in the ER lumen. Only part of the 

ER network in the cell is shown. (B) An 
electron micrograph showing the rough 

ER in a cell from a dog's pancreas, which 
makes and secretes large amounts of 
digestive enzymes. The cytosol is filled with 
closely packed sheets of ER, studded with 
ribosomes. A portion of the nucleus and its 
nuclear envelope can be seen at the bottom 
left; note that the outer nuclear membrane, 
which is continuous with the ER, is also 
studded with ribosomes. For a dynamic view 
of the ER network, watch Movie 15.3. 

(A, from P. Boevink et al., The Plant Journal 
15:441-447, 1998. With permission from 
John Wiley & Sons; B, courtesy of Lelio Orci.) 
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Figure 15-13 A common pool of mRNA encoding a protein 
ribosomes is used to synthesize all the with no ER signal sequence 
proteins encoded by the nuclear genome. rëmains:tree iricytosol 
Ribosomes that are translating proteins 


with no ER signal sequence remain free in s \ 
the cytosol. Ribosomes that are translating —_ 
proteins containing an ER signal sequence 3’ 5 
(red) on the growing polypeptide chain will 
be directed to the ER membrane. Many 
ribosomes bind to each mRNA molecule, 
forming a polyribosome. At the end of each f , 
round of protein synthesis, the ribosomal P ain. 
subunits are released and rejoin the common 
pool in the cytosol. As we see shortly, how 
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There are, therefore, two separate populations of ribosomes in the cyto- 
sol. Membrane-bound ribosomes are attached to the cytosolic side of the 
ER membrane (and outer nuclear membrane) and are making proteins 
that are being translocated into the ER. Free ribosomes are unattached 
to any membrane and are making all of the other proteins encoded by 
the nuclear DNA. Membrane-bound ribosomes and free ribosomes are 
structurally and functionally identical; they differ only in the proteins they 
are making at any given time. When a ribosome happens to be making a 
protein with an ER signal sequence, the signal sequence directs the ribo- 
some to the ER membrane. Because proteins with an ER signal sequence 
are translocated as they are being made, no additional energy is required 
for their transport; the elongation of each polypeptide provides the thrust 
needed to push the growing chain through the ER membrane. 


As an mRNA molecule is translated, many ribosomes bind to it, form- 
ing a polyribosome (discussed in Chapter 7). In the case of an MRNA 
molecule directing synthesis of a protein with an ER signal sequence, 
the polyribosome becomes riveted to the ER membrane by the growing 
polypeptide chains, which have become inserted into the ER membrane 
(Figure 15-13). 


Soluble Proteins Made on the ER Are Released into the 
ER Lumen 
Two protein components help guide ER signal sequences to the ER mem- 


brane: (1) a signal-recognition particle (SRP), present in the cytosol, binds 
to both the ribosome and the ER signal sequence as it emerges from 
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the ribosome; and (2) an SRP receptor, embedded in the ER membrane, 
recognizes the SRP. Binding of an SRP to a ribosome that displays an 
ER signal sequence slows protein synthesis by that ribosome until the 
SRP engages with an SRP receptor on the ER. Once bound, the SRP is 
released, the receptor passes the ribosome to a protein translocator in 
the ER membrane, and protein synthesis recommences. The polypep- 
tide is then threaded across the ER membrane through a channel in the 
translocator (Figure 15-14). The SRP and SRP receptor thus function as 
molecular matchmakers, bringing together ribosomes that are synthesiz- 
ing proteins with an ER signal sequence and protein translocators within 
the ER membrane. 


In addition to directing proteins to the ER, the signal sequence—which 
for soluble proteins is almost always at the N-terminus, the end synthe- 
sized first—functions to open the protein translocator. This sequence 
remains bound to the translocator, while the rest of the polypeptide chain 
is threaded through the membrane as a large loop. The signal sequence 
is removed by a transmembrane signal peptidase, which has an active 
site facing the lumenal side of the ER membrane. The cleaved signal 
sequence is then released from the protein translocator into the lipid 
bilayer and rapidly degraded. 


Once the C-terminus of a soluble protein has passed through the translo- 
cator, the protein is released into the ER lumen (Figure 15-15). 


Start and Stop Signals Determine the Arrangement of a 
Transmembrane Protein in the Lipid Bilayer 
Not all proteins made by ER-bound ribosomes are released into the ER 


lumen. Some remain embedded in the ER membrane as transmembrane 
proteins. The translocation process for such proteins is more complicated 
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Figure 15-14 An ER signal sequence 

and an SRP direct a ribosome to the ER 
membrane. The SRP (brown) binds to both 
the exposed ER signal sequence and the 
ribosome, thereby slowing protein synthesis 
by the ribosome. The SRP-ribosome 
complex then binds to an SRP receptor 
(dark blue) in the ER membrane. The SRP is 
released, and the ribosome passes from the 
SRP receptor to a protein translocator (light 
blue) in the ER membrane. Protein synthesis 
resumes, and the translocator starts to 
transfer the growing polypeptide across the 
lipid bilayer. 


QUESTION 15-3 


Explain how an mRNA molecule 

can remain attached to the 

ER membrane while individual 
ribosomes translating it are released 
and rejoin the cytosolic pool of 
ribosomes after each round of 
translation. 


Figure 15-15 A soluble protein crosses 
the ER membrane and enters the lumen. 
The protein translocator binds the signal 
sequence and threads the rest of the 
polypeptide across the lipid bilayer as a 
loop. At some point during the translocation 
process, the signal peptide is cleaved 

from the growing protein by a signal 
peptidase (yellow). This cleaved signal 
sequence is ejected into the bilayer, where 
it is degraded. Once protein synthesis is 
complete, the translocated polypeptide is 
released as a soluble protein into the ER 
lumen, and the protein translocator closes. 
The membrane-bound ribosome is omitted 
from this and the following two figures for 
clarity. 
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QUESTION 15-4 


A. Predict the membrane 
orientation of a protein that is 
synthesized with an uncleaved, 
internal signal sequence (shown as 
the red start-transfer sequence in 
Figure 15-17) but does not contain a 
stop-transfer sequence. 

B. Similarly, predict the membrane 
orientation of a protein that is 
synthesized with an N-terminal 
cleaved signal sequence followed by 
a stop-transfer sequence, followed 
by a start-transfer sequence. 

C. What arrangement of signal 
sequences would enable the 
insertion of a multipass protein with 
an odd number of transmembrane 
segments? 
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than it is for soluble proteins, as some parts of the polypeptide chain must 
be translocated completely across the lipid bilayer, whereas other parts 
remain fixed within the membrane. 


In the simplest case, that of a transmembrane protein with a single 
membrane-spanning segment, the N-terminal signal sequence initiates 
translocation—as it does for a soluble protein. But the transfer process 
is then halted by an additional sequence of hydrophobic amino acids, a 
stop-transfer sequence, further along the polypeptide chain. At this point, 
the protein translocator releases the growing polypeptide chain side- 
ways into the lipid bilayer. The N-terminal signal sequence is cleaved off, 
and the stop-transfer sequence remains in the bilayer, where it forms an 
a-helical membrane-spanning segment that anchors the protein in the 
membrane. As a result, the protein ends up as a single-pass transmem- 
brane protein inserted in the membrane with a defined orientation—the 
N-terminus on the lumenal side of the lipid bilayer and the C-terminus 
on the cytosolic side (Figure 15-16). Once inserted into the membrane, 
a transmembrane protein will never change its orientation; its cytosolic 
portion will always remain in the cytosol, even if the protein is subse- 
quently transported to another organelle via vesicle budding and fusion 
(see Figure 11-18). 


In some transmembrane proteins, an internal, rather than an N-terminal, 
signal sequence is used to start the protein transfer; this internal sig- 
nal sequence, called a start-transfer sequence, is never removed from 
the polypeptide. This arrangement occurs in some transmembrane pro- 
teins in which the polypeptide chain passes back and forth across the 
lipid bilayer. In these cases, hydrophobic signal sequences are thought 
to work in pairs: an internal start-transfer sequence serves to initiate 
translocation, which continues until a stop-transfer sequence is reached; 
the two hydrophobic sequences are then released into the bilayer, where 
they remain as membrane-spanning a helices (Figure 15-17). In complex 
multipass proteins, in which many hydrophobic a helices span the bilayer, 
additional pairs of start- and stop-transfer sequences come into play: one 
sequence reinitiates translocation further down the polypeptide chain, 
and the other stops translocation and causes polypeptide release, and 
so on for subsequent starts and stops. In this way, multipass membrane 
proteins are stitched into the lipid bilayer as they are being synthesized, 
by a mechanism resembling the workings of a sewing machine. 
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polypeptide chain is discharged into 
the lipid bilayer. The N-terminal signal 
sequence is cleaved off, leaving the 
transmembrane protein anchored in 
the membrane (Movie 15.4). Protein 
synthesis on the cytosolic side then 
continues to completion. 
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Having considered how proteins enter the ER lumen or become embed- 
ded in the ER membrane, we now discuss how they are carried onward 
by vesicular transport. 


VESICULAR TRANSPORT 


Entry into the ER lumen or membrane is usually only the first step on 
the pathway to another destination. That destination, initially at least, 
is generally the Golgi apparatus; there, proteins and lipids are modified 
and sorted for shipment to other sites. Transport from the ER to the Golgi 
apparatus—and from the Golgi apparatus to other compartments of the 
endomembrane system—is carried out by the continual budding and 
fusion of transport vesicles. This vesicular transport extends outward 
from the ER to the plasma membrane, where it allows proteins and other 
molecules to be secreted by exocytosis, and it reaches inward from the 
plasma membrane to lysosomes, allowing extracellular molecules to be 
imported by endocytosis (Figure 15—18). Together, these pathways thus 
provide routes of communication between the interior of the cell and its 
surroundings. 


In this section, we discuss how vesicles shuttle proteins and membranes 
between intracellular compartments, allowing cells to eat, drink, and 
secrete. We also consider how these transport vesicles are directed to 
their proper destination, be it an organelle of the endomembrane system 
or the plasma membrane. 


Transport Vesicles Carry Soluble Proteins and 
Membrane Between Compartments 


Vesicular transport between membrane-enclosed compartments of the 
endomembrane system is highly organized. A major outward secre- 
tory pathway starts with the synthesis of proteins on the ER membrane 
and their entry into the ER, and it leads through the Golgi apparatus to 
the cell surface; at the Golgi apparatus, a side branch leads off through 
endosomes to lysosomes. A major inward endocytic pathway, which is 
responsible for the ingestion and degradation of extracellular molecules, 
moves materials from the plasma membrane, through endosomes, to 
lysosomes (Figure 15-19). 
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Figure 15-17 A double-pass 
transmembrane protein has an internal 
ER signal sequence. This internal 
sequence (red) not only acts as a start- 
transfer signal, it also helps to anchor 
the final protein in the membrane. Like 
the N-terminal ER signal sequence, the 
internal signal sequence is recognized 
by an SRP, which brings the ribosome to 
the ER membrane (not shown). When a 
stop-transfer sequence (orange) enters 
the protein translocator, the translocator 
discharges both sequences into the lipid 
bilayer. Neither the start-transfer nor the 
stop-transfer sequence is cleaved off, 
and the entire polypeptide chain remains 
anchored in the membrane as a double- 
pass transmembrane protein. Proteins 
that span the membrane more than twice 
contain additional pairs of start- and 
stop-transfer sequences, and the same 
process is repeated for each pair. 
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Figure 15-18 Vesicular transport allows 
materials to exit or enter the cell. 

(A) During exocytosis, a vesicle fuses with 
the plasma membrane, releasing its content 
to the cell's surroundings. (B) During 
endocytosis, extracellular materials are 
captured by vesicles that bud inward from 
the plasma membrane and are carried into 
the cell. 
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Figure 15-19 Transport vesicles bud from 
one membrane and fuse with another, 
carrying membrane components and 
soluble proteins between compartments 
of the endomembrane system and the 
plasma membrane. The membrane of 
each compartment or vesicle maintains its 
orientation, so the cytosolic side always 
faces the cytosol and the noncytosolic 

side faces the lumen of the compartment 
or the outside of the cell (see Figure 
11-18). The extracellular space and each 
of the membrane-enclosed compartments 
(shaded gray) communicate with one 
another by means of transport vesicles, as 
shown. In the inward endocytic pathway 
(green arrows), extracellular molecules 

are ingested (endocytosed) in vesicles 
derived from the plasma membrane and are 
delivered to early endosomes and, usually, 
on to lysosomes via late endosomes. In the 
outward secretory pathway (red arrows), 
protein molecules are transported from 
the ER, through the Golgi apparatus, to 
the plasma membrane or (via early and 
late endosomes) to lysosomes. Note that 
movement through the Golgi apparatus 
occurs by vesicles that shuttle between 

its individual cisternae and by a process 

of maturation, whereby the cisternae 
themselves move through the stack (central 
red arrows). 
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To function optimally, each transport vesicle that buds off from a com- 
partment must take with it only the proteins appropriate to its destination 
and must fuse only with the appropriate target membrane. A vesicle 
carrying cargo from the Golgi apparatus to the plasma membrane, for 
example, must exclude proteins that are to stay in the Golgi apparatus, 
and it must fuse only with the plasma membrane and not with any other 
organelle. While participating in this constant flow of membrane compo- 
nents, each organelle must maintain its own distinct identity; that is, its 
own distinctive protein and lipid composition. All of these recognition 
events depend on proteins displayed on the surface of the transport vesi- 
cle. As we will see, different types of transport vesicles shuttle between 
the various organelles, each carrying a distinct set of molecules. 


Vesicle Budding Is Driven by the Assembly of a Protein 
Coat 


Vesicles that bud from membranes usually have a distinctive protein coat 
on their cytosolic surface and are therefore called coated vesicles. After 
budding from its parent organelle, the vesicle sheds its coat, allowing its 
membrane to interact directly with the membrane to which it will fuse. 
Cells produce several kinds of coated vesicles, each with a distinctive 
protein coat. The coat serves at least two functions: it helps shape the 
membrane into a bud and it captures molecules for onward transport. 


The best-studied vesicles are those that have an outer coat made of the 
protein clathrin. These clathrin-coated vesicles bud from both the Golgi 
apparatus on the outward secretory pathway and from the plasma mem- 
brane on the inward endocytic pathway. At the plasma membrane, for 
example, each vesicle starts off as a clathrin-coated pit. Clathrin mol- 
ecules assemble into a basketlike network on the cytosolic surface of 
the membrane, and it is this assembly process that starts shaping the 
membrane into a vesicle (Figure 15-20). A GTP-binding protein called 
dynamin assembles as a ring around the neck of each deeply invaginated 
clathrin-coated pit. Together with other proteins recruited to the neck of 
the vesicle, the dynamin causes the neck to constrict, thereby pinching off 
the vesicle from its parent membrane. Other kinds of transport vesicles, 
with different coat proteins, are also involved in vesicular transport. They 
form in a similar way and carry their own characteristic sets of molecules 
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Figure 15-20 Clathrin molecules form basketlike cages that help shape membranes into vesicles. 
(A) Electron micrographs showing the sequence of events in the formation of a clathrin-coated vesicle 
from a clathrin-coated pit. The clathrin-coated pits and vesicles shown here are unusually large and are 
being formed at the plasma membrane of a hen oocyte. They are involved in taking up particles made 
of lipid and protein into the oocyte to form yolk. (B) Electron micrograph showing numerous clathrin- 
coated pits and vesicles budding from the inner surface of the plasma membrane of cultured skin cells. 


2004. With permission from Macmillan Publishers Ltd.) 


between the endoplasmic reticulum, the Golgi apparatus, and the plasma 
membrane. But how does a transport vesicle select its particular cargo? 
The mechanism is best understood for clathrin-coated vesicles. 


Clathrin itself plays no part in choosing specific molecules for transport. 
This is the function of a second class of coat proteins called adaptins, 
which both secure the clathrin coat to the vesicle membrane and help 
select cargo molecules for transport. Molecules for onward transport 
carry specific transport signals that are recognized by cargo receptors in 
the Golgi or plasma membrane. Adaptins help capture specific cargo 
molecules by trapping the cargo receptors that bind them. In this way, 
a selected set of cargo molecules, bound to their specific receptors, is 
incorporated into the lumen of each newly formed clathrin-coated vesi- 
cle (Figure 15-21). There are different types of adaptins: the adaptins that 
bind cargo receptors in the plasma membrane, for example, are not the 
same as those that bind cargo receptors in the Golgi apparatus, reflect- 
ing the differences in the cargo molecules to be transported from each of 
these sources. 


Other classes of coated vesicles, called COP-coated vesicles (COP being 
shorthand for “coat protein”), are involved in transporting molecules 
between the ER and the Golgi apparatus and from one part of the Golgi 
apparatus to another (Table 15-4). 


(C) In a test tube, clathrin molecules sometimes self-assemble into basketlike cages. The structure 

of one such clathrin cage was determined by cryoelectron microscopy. The positions of three of the 
constituent clathrin molecules, each of which has a characteristic three-armed shape, are highlighted 
in red, green, and brown. (A, from M.M. Perry and A.B. Gilbert, J. Cell Sci. 39:257-272, 1979. With 
permission from The Company of Biologists Ltd; B, from J. Heuser, J. Cell Biol. 84:560-583, 1980. 
With permission from Rockefeller University Press; C, from A. Fotin et al., Nature 432:573-579, 


TABLE 15-4 SOME TYPES OF COATED VESICLES 


Clathrin-coated clathrin + adaptin 1 Golgi apparatus 
Clathrin-coated clathrin + adaptin 2 plasma membrane 
COPII-coated COPII proteins ER 


COPI-coated COPI proteins Golgi cisterna 


lysosome (via endosomes) 
endosomes 
Golgi cisterna 


ER 
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Figure 15-21 Clathrin-coated vesicles 
transport selected cargo molecules. Here, 
as in Figure 15-20, the vesicles are shown 
budding from the plasma membrane. 

(A) Cargo receptors, with their bound cargo 
molecules, are captured by adaptins, which 
also bind clathrin molecules to the cytosolic 
surface of the budding vesicle (Movie 15.5). 
Dynamin proteins assemble around the neck 
of budding vesicles; once assembled, the 
dynamin molecules hydrolyze their bound 
GTP and, with the help of other proteins 
recruited to the neck (not shown), pinch off 
the vesicle. After budding is complete, the 
coat proteins are removed, and the naked 
vesicle can fuse with its target membrane. 
Functionally similar coat proteins are 

found in other types of coated vesicles. 

(B) In flies that produce a mutant dynamin 
protein, clathrin-coated pits assemble and 
dynamin is recruited around the neck of 
budding vesicles but fail to pinch them off, 
as can be seen in this electron micrograph 
of the plasma membrane in a fly's nerve 
ending. Flies with this mutation become 
paralyzed, because clathrin-mediated 
endocytosis grinds to a halt, preventing 

the recycling of vesicles needed to release 
neurotransmitters (see Figure 12—40). 

(B, from J.H. Koenig and K. Ikeda, 

J. Neurosci. 9:3844—3860, 1989. 

With permission from the Society for 
Neuroscience.) 


VESICLE 


FORMATION VNE OATING 


Vesicle Docking Depends on Tethers and SNARES 


After a transport vesicle buds from a membrane, it must find its way to 
the correct destination to deliver its contents. Often, the vesicle is actively 
transported by motor proteins that move along cytoskeletal fibers, as dis- 
cussed in Chapter 17 (see Figure 17-20). 


Once a transport vesicle has reached its target, it must recognize and 
dock with its specific organelle. Only then can the vesicle membrane fuse 
with the target membrane and unload the vesicle’s cargo. The impressive 
specificity of vesicular transport suggests that each type of transport vesi- 
cle in the cell displays molecular markers on its surface that identify the 
vesicle according to its origin and cargo. These markers must be recog- 
nized by complementary receptors on the appropriate target membrane, 
including the plasma membrane. 


The identification process depends on a diverse family of monomeric 
GTPases called Rab proteins. Specific Rab proteins on the surface of 
each type of vesicle are recognized by corresponding tethering proteins 
on the cytosolic surface of the target membrane. Each organelle and each 
type of transport vesicle carries a unique combination of Rab proteins, 
which serve as molecular markers for each membrane type. The coding 
system of matching Rab and tethering proteins helps to ensure that trans- 
port vesicles fuse only with the correct membrane. 


Additional recognition is provided by a family of transmembrane pro- 
teins called SNAREs. Once the tethering protein has captured a vesicle by 
grabbing hold of its Rab protein, SNAREs on the vesicle (called v-SNAREs) 
interact with complementary SNAREs on the target membrane (called 
t-SNAREs), firmly docking the vesicle in place (Figure 15-22). 


The same SNAREs involved in docking also play a central role in cata- 
lyzing the membrane fusion required for a transport vesicle to deliver 
its cargo. Fusion not only delivers the soluble contents of the vesicle 
into the interior of the target organelle or to the extracellular space, 
but it also adds the vesicle membrane to the membrane of the orga- 
nelle (see Figure 15-22). After vesicle docking, the fusion of a vesicle 
with its target membrane sometimes requires a special stimulatory 
signal. Whereas docking requires only that the two membranes come 
close enough for the SNAREs protruding from the two lipid bilayers to 
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interact, fusion requires a much closer approach: the two bilayers must 
come within 1.5 nanometers (nm) of each other so that their lipids can 
intermix. For this close approach, water must be displaced from the 
hydrophilic surfaces of the membranes—a process that is energetically 
highly unfavorable and thus prevents membranes from fusing randomly. 
All membrane fusions in cells must therefore be catalyzed by special- 
ized proteins that assemble to form a fusion complex that provides the 
means to cross this energy barrier. For vesicle fusion, the SNARE pro- 
teins themselves catalyze the process: when fusion is triggered, the 
v-SNAREs and t-SNAREs wrap around each other tightly, thereby act- 
ing like a winch that pulls the two lipid bilayers into close proximity 
(Figure 15-23). 


SECRETORY PATHWAYS 


Vesicular traffic is not confined to the interior of the cell. It extends to 
and from the plasma membrane. Newly made proteins, lipids, and car- 
bohydrates are delivered from the ER, via the Golgi apparatus, to the cell 
surface by transport vesicles that fuse with the plasma membrane in the 
process of exocytosis (see Figure 15-19). Each molecule that travels 
along this secretory pathway passes through a fixed sequence of mem- 
brane-enclosed compartments and is often chemically modified en route. 


transport 
vesicle > TRANSPORT 
VESICLE MEMBRANES LIPID BILAYERS 
DOCKS COALESCE FUSE 
/ 
v-SNARE 
CYTOSOL 
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Figure 15-22 Rab proteins, tethering 
proteins, and SNAREs help direct 
transport vesicles to their target 
membranes. A filamentous tethering 
protein (green) on a membrane binds to 

a Rab protein (yellow) on the surface of a 
vesicle. This interaction allows the vesicle 

to dock on its particular target membrane. 
A v-SNARE (red) on the vesicle then binds 
to a complementary t-SNARE (blue) on 

the target membrane. Whereas Rab and 
tethering proteins provide the initial 
recognition between a vesicle and its target 
membrane, complementary SNARE proteins 
ensure that transport vesicles dock at their 
appropriate target membranes. These 
SNARE proteins also catalyze the final fusion 
of the two membranes (see Figure 15-23). 


QUESTION 15-5 


The budding of clathrin-coated 
vesicles from eukaryotic plasma 
membrane fragments can be 
observed when adaptins, clathrin, 
and dynamin-GTP are added to 
the membrane preparation. What 
would you observe if you omitted 
(A) adaptins, (B) clathrin, or 

(C) dynamin? (D) What would you 
observe if the plasma membrane 
fragments were from a prokaryotic 
cell? 


Figure 15-23 Following vesicle 
docking, SNARE proteins can catalyze 
the fusion of the vesicle and target 
membranes. Once appropriately 
triggered, the tight pairing of v-SNAREs 
and t-SNAREs draws the two lipid bilayers 
into close apposition. The force of the 
SNAREs winding together squeezes 

out any water molecules that remain 
trapped between the two membranes, 
allowing their lipids to flow together to 
form a continuous bilayer. In a cell, other 
proteins recruited to the fusion site help 
to complete the fusion process. After 
fusion, the SNAREs are pried apart so 
that they can be used again. 
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QUESTION 15-6 


Why might it be advantageous to 
add a preassembled block of 14 
sugar residues to a protein in the 
ER, rather than building the sugar 
chains step-by-step on the surface 
of the protein by the sequential 
addition of sugars by individual 
enzymes? 


In this section, we follow the outward path of proteins as they travel from 
the ER, where they are made and modified, through the Golgi apparatus, 
where they are further modified and sorted, to the plasma membrane. 
As a protein passes from one compartment to another, it is monitored 
to check that it has folded properly and assembled with its appropriate 
partners, so that only correctly built proteins make it to the cell surface. 
Incorrect assemblies, which are often in the majority, are degraded inside 
the cell. Quality, it seems, is more important than economy when it comes 
to the production and transport of proteins via the secretory pathway. 


Most Proteins Are Covalently Modified in the ER 


Most proteins that enter the ER are chemically modified there. Disulfide 
bonds are formed by the oxidation of pairs of cysteine side chains (see 
Figure 4-30), a reaction catalyzed by an enzyme that resides in the ER 
lumen. The disulfide bonds help to stabilize the structure of proteins 
that will encounter degradative enzymes and changes in pH outside the 
cell—either after they are secreted or once they have been incorporated 
into the plasma membrane. Disulfide bonds do not form in the cytosol 
because the environment there is reducing. 


Many of the proteins that enter the ER lumen or ER membrane are con- 
verted to glycoproteins in the ER by the covalent attachment of short, 
branched oligosaccharide side chains composed of multiple sugars. This 
process of glycosylation is carried out by glycosylating enzymes present 
in the ER but not in the cytosol. Very few proteins in the cytosol are gly- 
cosylated, and those that are have only a single sugar attached to them. 
The oligosaccharides on proteins can serve various functions. They can 
protect a protein from degradation, hold it in the ER until it is properly 
folded, or help guide it to the appropriate organelle by serving as a trans- 
port signal for packaging the protein into appropriate transport vesicles. 
When displayed on the cell surface, oligosaccharides form part of the 
cell's outer carbohydrate layer or glycocalyx (see Figure 11-33) and can 
function in the recognition of one cell by another. 


In the ER, individual sugars are not added one-by-one to the protein to 
create an oligosaccharide side chain. Instead, a preformed, branched oli- 
gosaccharide containing a total of 14 sugars is attached en bloc to all 
proteins that carry the appropriate site for glycosylation. The oligosac- 
charide is originally attached to a specialized lipid, called dolichol, in 
the ER membrane; it is then transferred to the amino (NH2) group of an 
asparagine side chain on the protein, immediately after a target aspar- 
agine emerges in the ER lumen during protein translocation (Figure 
15-24). The addition takes place in a single enzymatic step that is cata- 
lyzed by amembrane-bound enzyme (an oligosaccharyl transferase) that 
has its active site exposed on the lumenal side of the ER membrane— 
which explains why cytosolic proteins are not glycosylated in this way. 
A simple sequence of three amino acids, of which the target aspara- 
gine is one, defines which sites in a protein receive the oligosaccharide. 
Oligosaccharide side chains linked to an asparagine NH2 group in a pro- 
tein are said to be N-linked and this is by far the most common type of 
linkage found on glycoproteins. 


The addition of the 14-sugar oligosaccharide in the ER is only the first 
step in a series of further modifications before the mature glycoprotein 
reaches the cell surface. Despite their initial similarity, the N-linked oli- 
gosaccharides on mature glycoproteins are remarkably diverse. All of 
the diversity results from extensive modification of the original precursor 
structure shown in Figure 15-24. This oligosaccharide processing begins 
in the ER and continues in the Golgi apparatus. 
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Exit from the ER Is Controlled to Ensure Protein Quality 


Some proteins made in the ER are destined to function there. They are 
retained in the ER (and are returned to the ER should they manage to 
escape to the Golgi apparatus) by a C-terminal sequence of four amino 
acids called an ER retention signal (see Table 15-3, p. 502). This retention 
signal is recognized by a membrane-bound receptor protein in the ER 
and Golgi apparatus. Most proteins that enter the ER, however, are des- 
tined for other locations; they are packaged into transport vesicles that 
bud from the ER and fuse with the Golgi apparatus. 


Exit from the ER is highly selective. Proteins that fail to fold correctly, and 
dimeric or multimeric proteins that do not assemble properly, are actively 
retained in the ER by binding to chaperone proteins that reside there. The 
chaperones hold these proteins in the ER until proper folding or assembly 
occurs. Chaperones prevent misfolded proteins from aggregating, which 
helps steer proteins along a path toward proper folding (see Figures 4-8 
and 4-9); if proper folding and assembly still fail, the proteins are exported 
to the cytosol, where they are degraded by the proteasome (see Figure 
7-43). Antibody molecules, for example, are composed of four polypep- 
tide chains (see Figure 4-33) that assemble into the complete antibody 
molecule in the ER. Partially assembled antibodies are retained in the ER 
until all four polypeptide chains have assembled; any antibody molecule 
that fails to assemble properly is degraded. In this way, the ER controls 
the quality of the proteins that it exports to the Golgi apparatus. 


Sometimes, however, this quality control mechanism can be detrimen- 
tal to the organism. For example, the predominant mutation that causes 
the common genetic disease cystic fibrosis, which leads to severe lung 
damage, produces a plasma membrane transport protein that is slightly 
misfolded; even though the mutant protein could function normally as a 
chloride channel if it reached the plasma membrane, it is retained in the 
ER and then degraded, with dire consequences. This devastating disease 
comes about not because the mutation inactivates an important protein 
but because the active protein is discarded by the cells before it is given 
an opportunity to function. 


Secretory Pathways 


Figure 15-24 Many proteins are 
glycosylated on asparagines in the ER. 
When an appropriate asparagine in a 
growing polypeptide chain enters the ER 
lumen, it is glycosylated by addition of a 
branched oligosaccharide side chain. Each 
oligosaccharide chain is transferred as an 
intact unit to the asparagine from a lipid 
called dolichol, catalyzed by the enzyme 
oligosaccharyl transferase. Asparagines that 
are glycosylated are always present in the 
tripeptide sequences asparagine-X-serine 
or asparagine-X-threonine, where X can be 
almost any amino acid. 
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Figure 15-25 Accumulation of misfolded misfolded proteins bound to sensors 
proteins in the ER lumen triggers an fo 

unfolded protein response (UPR). The 3 
misfolded proteins are recognized by & B ER membrane 
several types of transmembrane sensor ER LUMEN 

proteins in the ER membrane, each of which CYTOSOL 

activates a different component of the UPR. 


activated 
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transcription regulators that activate genes AWS a. protein 

encoding chaperones or other proteins { synthesis j 
involved in ER quality control. Another @ 

sensor can also inhibit protein synthesis, | | 


reducing the flow of proteins through the ER 
(Movie 15.6 and Movie 15.7). 
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The Size of the ER Is Controlled by the Demand for 
Protein Folding 


Although chaperones help proteins in the ER fold properly and retain those 
that do not, this quality control system can become overwhelmed. When 
this happens, misfolded proteins accumulate in the ER. If the buildup is 
large enough, it triggers a complex program called the unfolded pro- 
tein response (UPR). This program prompts the cell to produce more ER, 
including more chaperones and other proteins concerned with quality 
control (Figure 15-25). 


The UPR allows a cell to adjust the size of its ER to properly handle the 
volume of proteins entering the secretory pathway. In some cases, how- 
ever, even an expanded ER cannot keep up with the demand, and the 
UPR directs the cell to self-destruct by undergoing apoptosis. Such a situ- 
ation may occur in adult-onset diabetes, where tissues gradually become 
resistant to the effects of insulin. To compensate for this resistance, the 
insulin-secreting cells in the pancreas produce more and more insulin. 
Eventually, their ER reaches a maximum capacity, at which point the UPR 
can trigger cell death. As more insulin-secreting cells are eliminated, the 
demand on the surviving cells increases, making it more likely that they 
will die as well, further exacerbating the disease. 


Proteins Are Further Modified and Sorted in the Golgi 
Apparatus 


The Golgi apparatus is usually located near the cell nucleus, and in ani- 
mal cells it is often close to the centrosome, a small cytoskeletal structure 
near the cell center (see Figure 17-13). The Golgi apparatus consists of a 
collection of flattened, membrane-enclosed sacs called cisternae, which 
are piled like stacks of pita bread (Figure 15-26). Each stack contains 
3-20 cisternae, and the number of Golgi stacks per cell varies greatly 
depending on the cell type: some cells contain one large stack, while oth- 
ers contain hundreds of very small ones. 


Each Golgi stack has two distinct faces: an entry, or cis, face and an exit, 
or trans, face. The cis face is adjacent to the ER, while the trans face 
points toward the plasma membrane. The outermost cisterna at each 
face is connected to a network of interconnected membranous tubes and 
vesicles (see Figure 15-26A). Soluble proteins and pieces of membrane 
enter the cis Golgi network via transport vesicles derived from the ER. 
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Figure 15-26 The Golgi apparatus consists of a stack of flattened, 
membrane-enclosed sacs. (A) A three-dimensional model of a Golgi stack 
reconstructed from a sequential series of electron micrographs of the Golgi 
apparatus in a secretory animal cell. To see how such models are assembled, 
watch Movie 15.8. (B) Electron micrograph of a Golgi stack from a plant cell, 
where the Golgi apparatus is especially distinct; the stack is oriented as in (A). 
(C) A pita-bread model of the Golgi apparatus. Vesicles are shown as olives. 
(B, courtesy of George Palade.) 


(©) 


The proteins travel through the cisternae in sequence in two ways: (1) by 
means of transport vesicles that bud from one cisterna and fuse with the 
next; and (2) by a maturation process in which the Golgi cisternae them- 
selves migrate through the Golgi stack. Proteins finally exit from the trans 
Golgi network in transport vesicles destined for either the cell surface or 
another organelle of the endomembrane system (see Figure 15-19). 


Both the cis and trans Golgi networks are thought to be important for 
protein sorting: proteins entering the cis Golgi network can either move 
onward through the Golgi stack or, if they contain an ER retention sig- 
nal, be returned to the ER; proteins exiting from the trans Golgi network 
are sorted according to whether they are destined for lysosomes (via 
endosomes) or for the cell surface. We discuss some examples of sorting 
by the trans Golgi network later, and we present some of the methods for 
tracking proteins through the secretory pathways of the cell in How We 
Know, pp. 520-521. 


Many of the oligosaccharide chains that are added to proteins in the ER 
(see Figure 15-24) undergo further modifications in the Golgi apparatus. 
On some proteins, for example, more complex oligosaccharide chains 
are created by a highly ordered process in which sugars are added and 
removed by a series of enzymes that act in a rigidly determined sequence 
as the protein passes through the Golgi stack. As would be expected, the 
enzymes that act early in the chain of processing events are located in 
cisternae close to the cis face, while enzymes that act late are located 
in cisternae near the trans face. 


Secretory Proteins Are Released from the Cell by 
Exocytosis 
In all eukaryotic cells, a steady stream of vesicles buds from the trans 


Golgi network and fuses with the plasma membrane in the process of 
exocytosis. This constitutive exocytosis pathway supplies the plasma 
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TRACKING PROTEIN AND VESICLE TRANSPORT 


Over the years, biologists have taken advantage of 
a variety of techniques to untangle the pathways 
and mechanisms by which proteins are sorted and 
transported into and out of the cell and its resident 
organelles. Biochemical, genetic, molecular biological, 
and microscopic techniques all provide ways to moni- 
tor how proteins shuttle from one cell compartment to 
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Figure 15-27 Several methods can be used to determine 
whether a labeled protein bearing a particular signal 
sequence is transported into a preparation of isolated 
organelles. (A) The labeled protein with or without a signal 
sequence is incubated with the organelles, and the preparation is 
centrifuged. Only those labeled proteins that contained a signal 
sequence will be transported and therefore will co-fractionate 
with the organelle. (B) The labeled proteins are incubated with 
the organelle, and a protease is added to the preparation. A 
transported protein will be selectively protected from digestion 
by the organelle membrane; adding a detergent that disrupts 
the organelle membrane will eliminate that protection, and the 
transported protein will also be degraded. 


another. Some can even track the migration of proteins 
and transport vesicles in real time in living cells. 


In a tube 


A protein bearing a signal sequence can be introduced 
to a preparation of isolated organelles in a test tube. 
This mixture can then be tested to see whether the pro- 
tein is taken up by the organelle. The protein is usually 
produced in vitro by cell-free translation of a purified 
mRNA encoding the polypeptide; in the process, radio- 
active amino acids can be used to label the protein so 
that it will be easy to isolate and to follow. The labeled 
protein is incubated with a selected organelle and its 
translocation is monitored by one of several methods 
(Figure 15-27). 


Ask a yeast 


Movement of proteins between different cell com- 
partments via transport vesicles has been studied 
extensively using genetic techniques. Studies of mutant 
yeast cells that are defective for secretion at high tem- 
peratures have identified numerous genes involved in 
carrying proteins from the ER to the cell surface. Many 
of these mutant genes encode temperature-sensitive 
proteins (discussed in Chapter 19). These mutant pro- 
teins may function normally at 25°C, but, when the yeast 
cells are shifted to 35°C, the proteins are inactivated. As 
a result, when researchers raise the temperature, the 
various proteins destined for secretion instead accu- 
mulate inappropriately in the ER, Golgi apparatus, or 
transport vesicles—depending on the particular muta- 
tion (Figure 15-28). 


At the movies 


The most commonly used method for tracking a pro- 
tein as it moves throughout the cell involves tagging the 
polypeptide with a fluorescent protein, such as green 
fluorescent protein (GFP). Using the genetic engineering 
techniques discussed in Chapter 10, this small protein 
can be fused to other cell proteins. Fortunately, for many 
proteins studied, the addition of GFP to one or other 
end does not perturb the protein’s normal function or 
transport. The movement of a GFP-tagged protein can 
then be monitored in a living cell with a fluorescence 
microscope. In 2008, the Nobel Prize in Chemistry was 
awarded to Osamu Shimomura, Martin Chalfie, and 
Roger Tsien for the development and refinement of this 
technology. 
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Figure 15-28 Temperature-sensitive mutants have been used to dissect the protein secretory pathway in yeast. Mutations 
in genes involved at different stages of the transport process, as indicated by the red X, result in the accumulation of proteins in the 
ER, the Golgi apparatus, or transport vesicles. 


Such GFP fusion proteins are widely used to study the protein can be used to measure the kinetics of its move- 
location and movement of proteins in cells (Figure ment through the secretory pathway. Movies 15.1, 15.9, 
15-29). GFP fused to a protein that shuttles in and outof 15.10, and 15.13 demonstrate the power and beauty of 
the nucleus, for example, can be used to study nuclear this technique. 

transport events. GFP fused to a plasma membrane 
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Figure 15-29 Tagging a protein with GFP allows the resulting fusion protein to be tracked throughout the cell. In this 
experiment, GFP is fused to a viral coat protein and expressed in cultured animal cells. In an infected cell, the viral protein moves 
through the secretory pathway from the ER to the cell surface, where the virus particles are assembled. Red arrows indicate the 
direction of protein movement. The viral coat protein used in this experiment contains a mutation that allows export from the ER only at 
a low temperature. (A) At high temperatures, the fusion protein labels the ER. (B) As the temperature is lowered, the GFP fusion protein 
rapidly accumulates at ER exit sites. (C) The fusion protein then moves to the Golgi apparatus. (D) Finally, the fusion protein is delivered 
to the plasma membrane, shown here in a more close-up view. The halo between the two white arrowheads marks the spot where a 
single vesicle has fused, allowing the fusion protein to incorporate into the plasma membrane. These images are stills taken from 
Movie 15.9. (A-D, courtesy of Jennifer Lippincott-Schwartz.) 
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QUESTION 15-7 


What would you expect to happen 
in cells that secrete large amounts 
of protein through the regulated 
secretory pathway if the ionic 
conditions in the ER lumen could be 
changed to resemble those in the 
lumen of the trans Golgi network? 


Figure 15-30 In secretory cells, the 
regulated and constitutive pathways of 
exocytosis diverge in the trans Golgi 
network. Many soluble proteins are 
continually secreted from the cell by the 
constitutive secretory pathway (blue arrows), 
which operates in all eukaryotic cells 

(Movie 15.10). This pathway also continually 
supplies the plasma membrane with newly 
synthesized lipids and proteins. Specialized 
secretory cells have, in addition, a regulated 
exocytosis pathway (red arrows) by which 
selected proteins in the trans Golgi network 
are diverted into secretory vesicles, where 
the proteins are concentrated and stored 
until an extracellular signal stimulates their 
secretion. It is unclear how these special 
aggregates of secretory proteins (red) 

are segregated into secretory vesicles. 
Secretory vesicles have unique proteins in 
their membranes; perhaps some of these 
proteins act as receptors for secretory 
protein aggregates in the trans Golgi 
network. 
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membrane with newly made lipids and proteins (Movie 15.9), enabling 
the plasma membrane to expand prior to cell division and refreshing old 
lipids and proteins in nonproliferating cells. The constitutive pathway 
also carries soluble proteins to the cell surface to be released to the out- 
side, a process called secretion. Some of these proteins remain attached 
to the cell surface; some are incorporated into the extracellular matrix; 
still others diffuse into the extracellular fluid to nourish or signal other 
cells. Entry into the constitutive pathway does not require a particular 
signal sequence like those that direct proteins to endosomes or back to 
the ER. 


In addition to the constitutive exocytosis pathway, which operates contin- 
ually in all eukaryotic cells, there is a regulated exocytosis pathway, which 
operates only in cells that are specialized for secretion. Each specialized 
secretory cell produces large quantities of a particular product—such as 
a hormone, mucus, or digestive enzymes—which is stored in secretory 
vesicles for later release. These vesicles, which are part of the endomem- 
brane system, bud off from the trans Golgi network and accumulate near 
the plasma membrane. There they wait for an extracellular signal that 
will stimulate them to fuse with the plasma membrane and release their 
contents to the cell exterior by exocytosis (Figure 15-30). An increase in 
blood glucose, for example, signals insulin-producing endocrine cells in 
the pancreas to secrete the hormone (Figure 15-31). 


Proteins destined for regulated secretion are sorted and packaged in the 
trans Golgi network. Proteins that travel by this pathway have special 
surface properties that cause them to aggregate with one another under 
the ionic conditions (acidic pH and high Ca?*) that prevail in the trans 
Golgi network. The aggregated proteins are packaged into secretory vesi- 
cles, which pinch off from the network and await a signal instructing 
them to fuse with the plasma membrane. Proteins secreted by the con- 
stitutive pathway, on the other hand, do not aggregate and are therefore 
carried automatically to the plasma membrane by the transport vesicles 
of the constitutive pathway. Selective aggregation has another function: 
it allows secretory proteins to be packaged into secretory vesicles at 
concentrations much higher than the concentration of the unaggregated 
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protein in the Golgi lumen. This increase in concentration can reach 
200-fold, enabling secretory cells to release large amounts of the protein 
promptly when triggered to do so (see Figure 15-30). 


When a secretory vesicle or transport vesicle fuses with the plasma mem- 
brane and discharges its contents by exocytosis, its membrane becomes 
part of the plasma membrane. Although this should greatly increase the 
surface area of the plasma membrane, it does so only transiently because 
membrane components are removed from other regions of the surface by 
endocytosis almost as fast as they are added by exocytosis. This removal 
returns both the lipids and the proteins of the vesicle membrane to the 
Golgi network, where they can be used again. Similar membrane retrieval 
pathways also operate in the Golgi apparatus to return lipids and selected 
proteins to the endoplasmic reticulum. 


ENDOCYTIC PATHWAYS 


Eukaryotic cells are continually taking up fluid, along with large and 
small molecules, by the process of endocytosis. Certain specialized cells 
are also able to internalize large particles and even other cells. The mat- 
erial to be ingested is progressively enclosed by a small portion of the 
plasma membrane, which first buds inward and then pinches off to form 
an intracellular endocytic vesicle. The ingested materials, including the 
membrane components, are delivered to endosomes, from which they 
can be recycled to the plasma membrane or sent to lysosomes for diges- 
tion. The metabolites generated by digestion are transferred directly out 
of the lysosome into the cytosol, where they can be used by the cell. 


Two main types of endocytosis are distinguished on the basis of the 
size of the endocytic vesicles formed. Pinocytosis (“cellular drinking”) 
involves the ingestion of fluid and molecules via small pinocytic vesicles 
(<150 nm in diameter). Phagocytosis (“cellular eating”) involves the inges- 
tion of large particles, such as microorganisms and cell debris, via large 
vesicles called phagosomes (generally >250 nm in diameter). Whereas all 
eukaryotic cells are continually ingesting fluid and molecules by pinocy- 
tosis, large particles are ingested mainly by specialized phagocytic cells. 


In this final section, we trace the endocytic pathway from the plasma 
membrane to lysosomes. We start by considering the uptake of large par- 
ticles by phagocytosis. 


Specialized Phagocytic Cells Ingest Large Particles 


The most dramatic form of endocytosis, phagocytosis, was first observed 
more than a hundred years ago. In protozoa, phagocytosis is a form of 
feeding: these unicellular eukaryotes ingest large particles such as bacte- 
ria by taking them up into phagosomes (Movie 15.11). The phagosomes 
then fuse with lysosomes, where the food particles are digested. Few cells 
in multicellular organisms are able to ingest large particles efficiently. In 
the animal gut, for example, large particles of food have to be broken 
down to individual molecules by extracellular enzymes before they can 
be taken up by the absorptive cells lining the gut. 


Nevertheless, phagocytosis is important in most animals for purposes 
other than nutrition. Phagocytic cells—including macrophages, which 
are widely distributed in tissues, and other white blood cells, such as 
neutrophils—defend us against infection by ingesting invading microor- 
ganisms. To be taken up by macrophages or neutrophils, particles must 
first bind to the phagocytic cell surface and activate one of a variety of 
surface receptors. Some of these receptors recognize antibodies, the 
proteins that help protect us against infection by binding to the surface 
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Figure 15-31 Secretory vesicles store 
and release concentrated proteins. 

The process, which takes place through 
vesicle docking and fusion (see Figure 
15-23), requires a signal to initiate. The 
cryoelectron micrograph shows the release 
of concentrated neurotransmitter from a 
cultured mouse hippocampal neuron. The 
sample was rapidly frozen just 5 ms after 
the neuron was stimulated to fire. (From 
S. Watanabe, Front. Synaptic. Neurosci. 
8:1-10, 2016.) 
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Figure 15-32 Specialized phagocytic 
cells can ingest other cells. (A) Electron 
micrograph of a phagocytic white blood 
cell (a neutrophil) ingesting a bacterium, 
which is in the process of dividing. 

(B) Scanning electron micrograph showing 
a macrophage engulfing a pair of red 
blood cells. The lines point to the edges of 
the pseudopods that the phagocytic cells 
are extending like collars to envelop their 
targets. (A, courtesy of Dorothy F. Bainton; 
B, courtesy of Jean Paul Revel.) 
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of microorganisms. Binding of antibody-coated bacteria to these recep- 
tors induces the phagocytic cell to extend sheetlike projections of the 
plasma membrane, called pseudopods, that engulf the bacterium (Figure 
15-32A). These pseudopods fuse at their tips to form a phagosome, 
which then fuses with a lysosome, where the microbe is destroyed. Some 
pathogenic bacteria have evolved tricks for subverting the system: for 
example, Mycobacterium tuberculosis, the agent responsible for tubercu- 
losis, can inhibit the membrane fusion that unites the phagosome with 
a lysosome. Instead of being destroyed, the engulfed organism survives 
and multiplies within the macrophage. Although the mechanism is not 
completely understood, identifying the proteins involved will provide 
therapeutic targets for drugs that could restore the macrophages’ ability 
to eliminate the infection. 


Phagocytic cells also play an important part in scavenging dead and 
damaged cells and cell debris. Macrophages, for example, ingest more 
than 10!! worn-out red blood cells in the human body each day (Figure 
15-32B). 


Fluid and Macromolecules Are Taken Up by Pinocytosis 


Eukaryotic cells continually ingest bits of their plasma membrane, along 
with small amounts of extracellular fluid, in the process of pinocytosis. 
The rate at which plasma membrane is internalized in pinocytic vesi- 
cles varies from cell type to cell type, but it is usually surprisingly large. 
A macrophage, for example, swallows 25% of its own volume of fluid 
each hour. This means that it removes 3% of its plasma membrane each 
minute, or 100% in about half an hour. Pinocytosis occurs more slowly in 
fibroblasts, but even more rapidly in some phagocytic amoebae. Because 
a cell’s total surface area and volume remain unchanged during this pro- 
cess, as much membrane is being added to the cell surface by exocytosis 
as is being removed by endocytosis (see Figure 15-19). It is not known 
how eukaryotic cells maintain this remarkable balance. 


Pinocytosis is carried out mainly by the clathrin-coated pits and vesi- 
cles that we discussed earlier (see Figures 15-20 and 15-21). After they 
pinch off from the plasma membrane, clathrin-coated vesicles rapidly 


shed their coat and fuse with an endosome. Extracellular fluid is trapped 
in the coated pit as it invaginates to form a coated vesicle, and so sub- 
stances dissolved in the extracellular fluid are internalized and delivered 
to endosomes. This fluid intake by clathrin-coated and other types of 
pinocytic vesicles is generally balanced by fluid loss during exocytosis. 


Receptor-mediated Endocytosis Provides a Specific 
Route into Animal Cells 


Pinocytosis, as just described, is indiscriminate. The endocytic vesicles 
simply trap any molecules that happen to be present in the extracellular 
fluid and carry them into the cell. However, pinocytosis can sometimes 
be more selective. In most animal cells, specific macromolecules can 
be taken up from the extracellular fluid via clathrin-coated vesicles. The 
macromolecules bind to complementary receptors on the cell surface and 
enter the cell as receptor-macromolecule complexes in clathrin-coated 
vesicles (see Figure 15-21). This process, called receptor-mediated 
endocytosis, provides a selective concentrating mechanism that 
increases the efficiency of internalization of particular macromolecules 
more than 1000-fold compared with ordinary pinocytosis, so that even 
minor components of the extracellular fluid can be taken up in large 
amounts without taking in a correspondingly large volume of extracel- 
lular fluid. Such is the case when animal cells import the cholesterol they 
need to make new membrane. 


Cholesterol is a lipid that is extremely insoluble in water (see Figure 11-7). 
It is transported in the bloodstream bound to proteins in the form of parti- 
cles called low-density lipoproteins, or LDL. Cholesterol-containing LDLs, 
which are secreted by the liver, bind to receptors located on the surface 
of cells. The resulting receptor-LDL complexes can then be ingested by 
receptor-mediated endocytosis and delivered to endosomes. The interior 
of endosomes is more acidic than the surrounding cytosol or the extra- 
cellular fluid, and in this acidic environment the LDL dissociates from its 
receptor: the empty receptors are returned, via transport vesicles, to the 
plasma membrane for reuse, while the LDL is delivered to lysosomes. 
In the lysosomes, the LDL is broken down by hydrolytic enzymes. Freed 
from the bulky LDLs, cholesterol escapes into the cytosol, where it can be 
used to synthesize new membrane (Figure 15-33). 
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Figure 15-33 LDL enters cells via 
receptor-mediated endocytosis. LDL 
binds to LDL receptors on the cell surface 
and is internalized in clathrin-coated 
vesicles. The vesicles lose their coat and 
then fuse with endosomes. In the acidic 
environment of the endosome, LDL 
dissociates from its receptors. The LDL ends 
up in lysosomes, where it is degraded to 
release free cholesterol (red dots), while the 
LDL receptors are returned to the plasma 
membrane via transport vesicles to be used 
again (Movie 15.12). For simplicity, only 
one LDL receptor is shown entering the cell 
and returning to the plasma membrane. 
Whether it is occupied or not, an LDL 
receptor typically makes one round trip 
into the cell and back every 10 minutes, 
making a total of several hundred trips 
over its 20-hour life-span. 
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Iron (Fe) is an essential trace metal 
that is needed by all cells. It is 
required, for example, for synthesis 
of the heme groups and iron-sulfur 
centers that are part of the active 
site of many proteins involved 

in electron-transfer reactions; it 

is also required in hemoglobin, 

the main protein in red blood 

cells. Iron is taken up by cells by 
receptor-mediated endocytosis. 
The iron-uptake system has two 
components: a soluble protein 
called transferrin, which circulates in 
the bloodstream; and a transferrin 
receptor—a transmembrane 
protein that, like the LDL receptor 
in Figure 15—33, is continually 
endocytosed and recycled to the 
plasma membrane. Fe ions bind to 
transferrin at neutral pH but not at 
acidic pH. Transferrin binds to the 
transferrin receptor at neutral pH 
only when it has an Fe ion bound, 
but it binds to the receptor at 
acidic pH even in the absence of 
bound iron. From these properties, 
describe how iron is taken up, 

and discuss the advantages of this 
elaborate scheme. 
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Figure 15-34 Viruses can enter cells via receptor-mediated 
endocytosis. (A) Electron micrograph showing viruses bound to 
receptors on the surface of a T cell; one virus particle is being 
internalized in a clathrin-coated vesicle. (B) A pair of viruses have been 
taken up by receptor-mediated endocytosis. These vesicles will fuse 
with lysosomes, where the low pH will allow the release of the viral 
genome into the cytoplasm—a necessary step in viral replication. 

(A, from E. Fries and A. Helenius, Eur. J. Biochem. 97:213-220, 1979. 

With permission from John Wiley & Sons; B, from K. Simons, H. Garoff, 

A. Helenius, Sci. Am. 246:58-66, 1982. With permission from the authors.) 


This pathway for cholesterol uptake is disrupted in individuals who 
inherit a defective version of the gene encoding the LDL receptor protein. 
In some cases, the receptors are missing; in others, they are present but 
nonfunctional. In either case, because the cells are deficient in taking up 
LDL, cholesterol accumulates in the blood and predisposes the individu- 
als to develop atherosclerosis. Unless they take drugs (statins) to reduce 
their blood cholesterol, they will likely die at an early age of heart attacks, 
which result from cholesterol clogging the coronary arteries that supply 
the heart muscle. 


Receptor-mediated endocytosis is also used to take up many other essen- 
tial metabolites, such as vitamin Bı2 and iron, which cells cannot take 
up by the processes of transmembrane transport discussed in Chapter 
12. Vitamin Bı2 and iron are both required, for example, to make hemo- 
globin, which is the major protein in red blood cells; these substances 
enter immature red blood cells as part of a complex with their respective 
receptor proteins. Many cell-surface receptors that bind extracellular sig- 
nal molecules are also ingested by this pathway: some are recycled to the 
plasma membrane for reuse, whereas others are degraded in lysosomes. 
Unfortunately, receptor-mediated endocytosis can also be exploited by 
viruses (Figure 15-34). The influenza virus, which causes the flu, and 
HIV, which causes AIDS, gain entry into cells in this way. 


Endocytosed Macromolecules Are Sorted in Endosomes 


Because most extracellular material taken up by pinocytosis is rapidly 
delivered to endosomes, it is possible to visualize the endosomal compart- 
ment by incubating living cells in fluid containing a fluorescent marker that 
will show up when viewed in a fluorescence microscope. When examined 
in this way, the endosomal compartment reveals itself to be a complex set 
of connected membrane tubes and larger vesicles. Two sets of endosomes 
can be distinguished in such loading experiments: the marker molecules 
appear first in early endosomes, just beneath the plasma membrane; 5 to 
15 minutes later, they show up in late endosomes, located closer to the 
nucleus (see Figure 15-19). Early endosomes mature gradually into late 
endosomes as they fuse with each other or with a preexisting late endo- 
some (Movie 15.13). The interior of the endosome compartment is kept 
acidic (pH 5-6) by an ATP-driven H+ (proton) pump in the endosomal 
membrane that pumps H* into the endosome lumen from the cytosol. 


Just as the Golgi network acts as the main sorting station in the outward 
secretory pathway, the endosomal compartment serves this function in 
the inward endocytic pathway. The acidic environment of the endosome 
plays a crucial part in the sorting process by causing many (but not all) 
receptors to release their bound cargo. The routes taken by receptors once 
they have entered an endosome differ according to the type of receptor: 
(1) most are returned to the same plasma membrane domain from which 
they came, as is the case for the LDL receptor discussed earlier; (2) some 
travel to lysosomes, where they are degraded; and (3) some proceed to 
a different domain of the plasma membrane, thereby transferring their 


Figure 15-35 The fate of receptor proteins following their 
endocytosis depends on the type of receptor. Three pathways 

from the endosomal compartment in an epithelial cell are shown. 
Receptors that are not specifically retrieved from early endosomes 
follow the pathway from the endosomal compartment to lysosomes, 
where they are degraded. Retrieved receptors are returned either to 
the same plasma membrane domain from which they came (recycling) 
or to a different domain of the plasma membrane (transcytosis). Tight 
junctions separate the apical and basolateral plasma membranes, 
preventing their resident receptor proteins from diffusing from one 
domain to another (see Figure 11-32). If the ligand that is endocytosed 
with its receptor stays bound to the receptor in the acidic environment 
of the endosome, it will follow the same pathway as the receptor; 
otherwise it will be delivered to lysosomes for degradation. 


bound cargo molecules across the cell from one extracellular space to 
another, a process called transcytosis (Figure 15-35). 


Cargo proteins that remain bound to their receptors share the fate of their 
receptors. Those that dissociate from their receptors in the endosome are 
doomed to destruction in lysosomes, along with most of the contents of 
the endosome lumen. Late endosomes contain some lysosomal enzymes, 
so digestion of cargo proteins and other macromolecules begins in the 
endosome and continues as the endosome gradually matures into a lyso- 
some: once it has digested most of its ingested contents, the endosome 
takes on the dense, rounded appearance characteristic of a mature, “clas- 
sical” lysosome. 


Lysosomes Are the Principal Sites of Intracellular 
Digestion 


Many extracellular particles and molecules ingested by cells end up in 
lysosomes, which are membranous sacs of hydrolytic enzymes that 
carry out the controlled intracellular digestion of both extracellular mat- 
erials and worn-out organelles. They contain about 40 types of hydrolytic 
enzymes, including those that degrade proteins, nucleic acids, oligosac- 
charides, and lipids. All of these enzymes are optimally active in the acidic 
conditions (pH ~5) maintained within lysosomes. The membrane of the 
lysosome normally keeps these destructive enzymes out of the cytosol 
(whose pH is about 7.2), but the enzymes’ acid dependence protects the 
contents of the cytosol against damage should some of them escape. 


Like all other intracellular organelles, the lysosome not only contains a 
unique collection of enzymes but also has a unique surrounding mem- 
brane. The lysosomal membrane contains transporters that allow the 
final products of the digestion of macromolecules, such as amino acids, 
sugars, and nucleotides, to be transferred to the cytosol; from there, these 
materials can be either exported or utilized by the cell. The membrane 
also contains an ATP-driven H* pump, which, like the ATPase in the 
endosome membrane, pumps Ht into the lysosome, thereby maintain- 
ing its contents at an acidic pH (Figure 15-36). Most of the lysosomal 
membrane proteins are unusually highly glycosylated; the sugars, which 
cover much of the protein surfaces facing the lysosome lumen, protect 
the proteins from digestion by the lysosomal proteases. 


The specialized digestive enzymes and membrane proteins of the lyso- 
some are synthesized in the ER and transported through the Golgi 
apparatus to the trans Golgi network. While in the ER and the cis Golgi 
network, the enzymes are tagged with a specific phosphorylated sugar 
group (mannose 6-phosphate), so that when they arrive in the trans Golgi 
network they can be recognized by an appropriate receptor, the mannose 
6-phosphate receptor. This tagging permits the lysosomal enzymes to be 
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Figure 15-36 A lysosome contains a 
large variety of hydrolytic enzymes, 
which are only active under acidic 
conditions. The lumen of the lysosome 

is maintained at an acidic pH by an ATP- 
driven H* pump in the membrane that 
hydrolyzes ATP to pump H* into the lumen. 
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Figure 15-37 Materials destined for 
degradation in lysosomes follow different 
pathways to the lysosome. Each pathway 
leads to the intracellular digestion of 
materials derived from a different source. 
Early endosomes, phagosomes, and 
autophagosomes can fuse with either 
lysosomes or late endosomes, both of which 
contain acid-dependent hydrolytic enzymes. 
Where the membrane fragments that form 
the autophagosome originate is still actively 
investigated. 
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sorted and packaged into transport vesicles, which bud off and deliver 
their contents to lysosomes via endosomes (see Figure 15-19). 


Depending on their source, materials follow different paths to lysosomes. 
We have seen that extracellular particles are taken up into phagosomes, 
which fuse with lysosomes, and that extracellular fluid and macromol- 
ecules are taken up into smaller endocytic vesicles, which deliver their 
contents to lysosomes via endosomes. 


Cells have an additional pathway that supplies materials to lysosomes; 
this pathway, called autophagy, is used to degrade obsolete parts of the 
cell: as the term suggests, the cell literally eats itself. In electron micro- 
graphs of liver cells, for example, one often sees lysosomes digesting 
mitochondria, as well as other organelles. The process involves the 
enclosure of the organelle by a double membrane, creating an autophago- 
some, which then fuses with a lysosome (Figure 15-37). Autophagy 
of organelles and cytosolic proteins—some of which are marked for 
destruction by the attachment of ubiquitin tags (as discussed in Chapter 
4)—increases when eukaryotic cells are starved or when they remodel 
themselves extensively during development. The amino acids generated 
by this cannibalistic form of digestion can then be recycled to allow con- 
tinued protein synthesis. 


ESSENTIAL CONCEPTS 


e Eukaryotic cells contain many membrane-enclosed organelles, 
including a nucleus, an endoplasmic reticulum (ER), a Golgi appa- 
ratus, lysosomes, endosomes, mitochondria, chloroplasts (in plant 
cells), and peroxisomes. The ER, Golgi apparatus, peroxisomes, 
endosomes, and lysosomes are all part of the endomembrane system. 


e Most organelle proteins are made in the cytosol and transported into 
the organelle where they function. Sorting signals in the amino acid 
sequence guide the proteins to the correct organelle; proteins that 
function in the cytosol have no such signals and remain where they 
are made. 


Nuclear proteins contain nuclear localization signals that help 
direct their active transport from the cytosol into the nucleus 
through nuclear pores, which penetrate the double membrane of the 
nuclear envelope. The proteins are transported in their fully folded 
conformation. 


Most mitochondrial and chloroplast proteins are made in the cytosol 
and are then transported into the organelles by protein translocators 
in their membranes. The proteins are unfolded during the transport 
process. 


The ER makes most of the cell's lipids and many of its proteins. The 
proteins are made by ribosomes that are directed to the ER by a 
signal-recognition particle (SRP) in the cytosol that recognizes an ER 
signal sequence on the growing polypeptide chain. The ribosome- 
SRP complex binds to a receptor on the ER membrane, which passes 
the ribosome to a protein translocator that threads the growing poly- 
peptide across the ER membrane. 


Water-soluble proteins destined for secretion or for the lumen of an 
organelle of the endomembrane system pass completely into the 
ER lumen, while transmembrane proteins destined for either the 
membrane of these organelles or for the plasma membrane remain 
anchored in the lipid bilayer by one or more membrane-spanning 
a helices. 


In the ER lumen, proteins fold up, assemble with their protein 
partners, form disulfide bonds, and become decorated with oligosac- 
charide chains. 


Exit from the ER is an important quality-control step; proteins that 
either fail to fold properly or fail to assemble with their normal part- 
ners are retained in the ER by chaperone proteins, which prevent 
their aggregation and help them fold; proteins that still fail to fold or 
assemble are transported to the cytosol, where they are degraded. 


Excessive accumulation of misfolded proteins triggers an unfolded 
protein response that expands the ER, increases its capacity to fold 
new proteins properly, and reduces protein synthesis. 


Protein transport from the ER to the Golgi apparatus and from the 
Golgi apparatus to other destinations is mediated by transport vesi- 
cles that continually bud off from one membrane and fuse with 
another, a process called vesicular transport. 


Budding transport vesicles have distinctive coat proteins on their 
cytosolic surface; the assembly of the coat helps drive both the bud- 
ding process and the incorporation of cargo receptors, with their 
bound cargo molecules, into the forming vesicle. 


Coated vesicles rapidly lose their protein coat, enabling them to dock 
and then fuse with a particular target membrane; docking and fusion 
are mediated by proteins on the surface of the vesicle and target 
membrane, including Rab, tethering, and SNARE proteins. 


The Golgi apparatus receives newly made proteins from the ER; it 
modifies their oligosaccharides, sorts the proteins, and dispatches 
them from the trans Golgi network to the plasma membrane, 
lysosomes (via endosomes), or secretory vesicles. 


In all eukaryotic cells, transport vesicles continually bud from the 
trans Golgi network and fuse with the plasma membrane; this pro- 
cess of constitutive exocytosis delivers proteins to the cell surface 
for secretion and incorporates lipids and proteins into the plasma 
membrane. 


Specialized secretory cells also have a regulated exocytosis pathway, 
in which molecules concentrated and stored in secretory vesicles 
are released from the cell by exocytosis when the cell is signaled to 
secrete. 


Essential Concepts 
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e Cells ingest fluid, molecules, and sometimes even particles by endo- 
cytosis, in which regions of plasma membrane invaginate and pinch 
off to form endocytic vesicles. 


e Much of the material that is endocytosed is delivered to endosomes, 
which mature into lysosomes, in which the material is degraded by 
hydrolytic enzymes; most of the components of the endocytic vesicle 
membrane, however, are recycled in transport vesicles back to the 
plasma membrane for reuse. 
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coated vesicle pinocytosis 
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QUESTIONS 


QUESTION 15-9 


Which of the following statements are correct? Explain your 
answers. 


A. Ribosomes are cytoplasmic structures that, during 
protein synthesis, become linked by an mRNA molecule to 
form polyribosomes. 


B. The amino acid sequence Leu-His-Arg-Leu-Asp-Ala-Gln- 
Ser-Lys-Leu-Ser-Ser is a signal sequence that directs proteins 
to the ER. 


C. All transport vesicles in the cell must have a v-SNARE 
protein in their membrane. 


D. Transport vesicles deliver proteins and lipids to the cell 
surface. 


E. If the delivery of prospective lysosomal proteins from the 
trans Golgi network to the late endosomes were blocked, 
lysosomal proteins would be secreted by the constitutive 
secretion pathways shown in Figure 15-30. 


F. Lysosomes digest only substances that have been taken 
up by cells by endocytosis. 


G. N-linked sugar chains are found on glycoproteins that 
face the cell surface, as well as on glycoproteins that face 
the lumen of the ER, trans Golgi network, and mitochondria. 


H. Ribosomes bound to the outer nuclear membrane make 
proteins that are translocated co-translationally into the 
membrane. 


QUESTION 15-10 


Some proteins shuttle back and forth between the nucleus 
and the cytosol. They need a nuclear export signal to get 
out of the nucleus. How do you suppose they get into the 
nucleus? 


QUESTION 15-11 


Influenza viruses enter the cell by receptor-mediated 
endocytosis. The viruses are surrounded by a membrane 
that contains a fusion protein, which is activated by the 
acidic pH in the endosome. Upon activation, the protein 
causes the viral membrane to fuse with cell membranes. An 
old folk remedy against flu recommends that one should 
spend a night in a horse’s stable. Odd as it may sound, 
there is a rational explanation for this advice. Air in stables 
contains ammonia (NH3) generated by bacteria in the 
horse's urine. Sketch a diagram showing the pathway (in 
detail) by which flu virus enters cells, and speculate how 
NH3 may protect cells from virus infection. 

(Hint: NH3 can neutralize acidic solutions by the reaction 
NH3 + H+ > NH4gt.) 


QUESTION 15-12 


Consider the v-SNAREs that direct transport vesicles 
from the trans Golgi network to the plasma membrane. 
They, like all other v-SNAREs, are membrane proteins that 
are integrated into the membrane of the ER during their 
biosynthesis and are then carried by transport vesicles to 


their destination. Thus, transport vesicles budding from 

the ER contain at least two kinds of v-SNAREs—those that 
target the vesicles to the cis Golgi cisternae, and those that 
are in transit to the trans Golgi network to be packaged 

in different transport vesicles destined for the plasma 
membrane. (A) Why might this be a problem? (B) Suggest 
possible ways in which the cell might solve it. 


QUESTION 15-13 


A particular type of Drosophila mutant becomes paralyzed 
when the temperature is raised. The mutation affects the 
structure of dynamin, causing it to be inactivated at the 
higher temperature. Indeed, the function of dynamin was 
discovered by analyzing the defect in these mutant fruit 
flies. The complete paralysis at the elevated temperature 
suggests that synaptic transmission between nerve and 
muscle cells (discussed in Chapter 12) is blocked. Suggest 
why signal transmission at a synapse might require dynamin. 


QUESTION 15-14 


Edit each of the following statements, if required, to make 
them true: “Because nuclear localization sequences are not 
cleaved off by proteases following protein import into the 
nucleus, they can be reused to import nuclear proteins after 
mitosis, when cytosolic and nuclear proteins have become 
intermixed. This is in contrast to ER signal sequences, which 
are cleaved off by a signal peptidase once they reach the 
lumen of the ER. ER signal sequences cannot therefore be 
reused to import ER proteins after mitosis, when cytosolic 
and ER proteins have become intermixed; these ER proteins 
must therefore be degraded and resynthesized.” 


QUESTION 15-15 


Consider a protein that contains an ER signal sequence at its 
N-terminus and a nuclear localization sequence in its middle. 
What do you think the fate of this protein would be? Explain 
your answer. 


QUESTION 15-16 


Compare and contrast protein import into the ER and 
into the nucleus. List at least two major differences in the 
mechanisms, and speculate why the ER mechanism might 
not work for nuclear import and vice versa. 


QUESTION 15-17 


During mitosis, the nuclear envelope breaks down and 
intranuclear proteins completely intermix with cytosolic 
proteins. Is this consistent with the evolutionary scheme 
proposed in Figure 15-3? Explain your answer. 


QUESTION 15-18 


A protein that inhibits certain proteolytic enzymes 
(proteases) is normally secreted into the bloodstream 

by liver cells. This inhibitor protein, antitrypsin, is absent 
from the bloodstream of patients who carry a mutation 
that results in a single amino acid change in the protein. 
Antitrypsin deficiency causes a variety of severe problems, 
particularly in lung tissue, because of the uncontrolled 
activity of proteases. Surprisingly, when the mutant 
antitrypsin is synthesized in the laboratory, it is as active 
as the normal antitrypsin at inhibiting proteases. 


Questions 


Why, then, does the mutation cause the disease? Think of 
more than one possibility and suggest ways in which you 
could distinguish between them. 


QUESTION 15-19 


Dr. Outonalimb’s claim to fame is her discovery of 
forgettin, a protein predominantly made by the pineal 
gland in human teenagers. The protein causes selective, 
short-term unresponsiveness and memory loss when the 
auditory system receives statements like “Please take 
out the garbage!” Her hypothesis is that forgettin has a 
hydrophobic ER signal sequence at its C-terminus that is 
recognized by an SRP and causes it to be translocated 
across the ER membrane by the mechanism shown in 
Figure 15-14. She predicts that the protein is secreted 
from pineal cells into the bloodstream, from where it exerts 
its devastating systemic effects. You are a member of the 
committee deciding whether she should receive a grant 
for further work on her hypothesis. Critique her proposal, 
and remember that grant reviews should be polite and 
constructive. 


QUESTION 15-20 


Taking the evolutionary scheme in Figure 15-3 one step 
further, suggest how the Golgi apparatus could have 
evolved. Sketch a simple diagram to illustrate your ideas. 
For the Golgi apparatus to be functional, what else would 
be needed? 


QUESTION 15-21 


If membrane proteins are integrated into the ER membrane 
by means of the ER protein translocator (which is itself 
composed of membrane proteins), how do the first protein 
translocation channels become incorporated into the ER 
membrane? 


QUESTION 15-22 


The sketch in Figure Q15-22 is a schematic drawing of 
the electron micrograph shown in the third panel of Figure 
15-20A. Name the structures that are labeled in the sketch. 


QUESTION 15-23 


What would happen to proteins bound for the nucleus if 
there were insufficient energy to transport them? 


Figure Q15-22 
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Cell Signaling 


Individual cells, like multicellular organisms, need to sense and respond 
to their environment. A free-living cell—even a humble bacterium—must 
be able to track down nutrients, tell the difference between light and 
dark, and avoid poisons and predators. And if such a cell is to have any 
kind of “social life,” it must be able to communicate with other cells. 
When a yeast cell is ready to mate, for example, it secretes a small pro- 
tein called a mating factor. Yeast cells of the opposite “sex” detect this 
chemical mating call and respond by halting their progress through the 
cell-division cycle and reaching out toward the cell that emitted the sig- 
nal ( J: 


In a multicellular organism, things are much more complicated. Cells 
must interpret the multitude of signals they receive from other cells to 
help coordinate their behaviors. During animal development, for exam- 
ple, cells in the embryo exchange signals to determine which specialized 
role each cell will adopt, what position it will occupy in the animal, and 
whether it will survive, divide, or die. Later in life, a large variety of sig- 
nals coordinates the animal’s growth and its day-to-day physiology and 
behavior. In plants as well, cells are in constant communication with one 
another. These cell-cell interactions allow the plant to coordinate what 
happens in its roots, stems, and leaves. 


In this chapter, we examine some of the most important mechanisms 
by which cells send signals and interpret the signals they receive. First, 
we present an overview of the general principles of cell signaling. We 
then consider two of the main systems animal cells use to receive and 
interpret signals. Finally, we describe a few signaling mechanisms that 
work in a different way—including one that operates in plants—before 
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Figure 16-1 Yeast cells respond 

to mating factor. Budding yeast 
(Saccharomyces cerevisiae) cells are 

(A) normally spherical, but (B) when they 
are exposed to an appropriate mating 
factor produced by neighboring yeast 
cells, they extend a protrusion toward 
the source of the factor. (Courtesy of 
Michael Snyder.) 


Figure 16-2 Signal transduction is the 
process whereby one type of signal is 


converted into another. (A) When a mobile 
telephone receives a radio signal, it converts 


it into a sound signal; when transmitting 

a signal, it does the reverse. (B) A target 
cell converts an extracellular signal into an 
intracellular signal. 


considering how these intricate signaling networks ultimately interact to 
control complex behaviors. 


GENERAL PRINCIPLES OF CELL SIGNALING 


Information can come in a variety of forms, and communication fre- 
quently involves converting the signals that carry that information from 
one form to another. When you receive a call from a friend on your 
mobile phone, for instance, the phone converts radio signals, which 
travel through the air, into sound waves, which you hear. This process of 
conversion is called signal transduction (Figure 16-2). 


The signals that pass between cells are simpler than the sorts of mes- 
sages that humans ordinarily exchange. In a typical communication 
between cells, the signaling cell produces a particular type of extracellular 
signal molecule that is detected by the target cell. As in human conver- 
sation, most animal cells both send and receive signals, and they can 
therefore act as both signaling cells and target cells. 


Target cells possess proteins called receptors that recognize and respond 
specifically to the signal molecule. Signal transduction begins when the 
receptor on a target cell receives an incoming extracellular signal and 
then produces intracellular signaling molecules that alter cell behavior. 
Most of this chapter is concerned with signal reception and transduc- 
tion—the events that cell biologists have in mind when they refer to 
cell signaling. First, however, we look briefly at a few of the different 
types of extracellular signals that cells send to one another—and what 
happens when target cells receive those signals. 


Signals Can Act over a Long or Short Range 


Cells in multicellular organisms use hundreds of kinds of extracellular 
signal molecules to communicate with one another. The signal mol- 
ecules can be proteins, peptides, amino acids, nucleotides, steroids, fatty 
acid derivatives, or even dissolved gases—but they all rely on just a hand- 
ful of basic styles of communication for getting the message across. 


In multicellular organisms, the most “public” style of cell-cell commu- 
nication involves broadcasting the signal throughout the whole body by 
secreting it into an animal's bloodstream or a plant's sap. Extracellular 
signal molecules used in this way are called hormones, and, in animals, 
the cells that produce hormones are called endocrine cells (Figure 16-3A). 
Part of the pancreas, for example, is an endocrine gland that produces 
several hormones—including insulin, which regulates glucose uptake in 
cells all over the body. 


Somewhat less public is the process known as paracrine signaling. In this 
case, rather than entering the bloodstream, the signal molecules diffuse 
locally through the extracellular fluid, remaining in the neighborhood of 


extracellular signal 
molecule 
IN 


target cell 


intracellular 
signaling 
molecule 

OUT 
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the cell that secretes them. Thus, they act as local mediators on nearby 
cells (Figure 16-3B). Many of the signal molecules that regulate inflam- 
mation at the site of an infection or that control cell proliferation in a 
healing wound function in this way. In some cases, cells can respond to 
the local mediators that they themselves produce, a form of paracrine 
communication called autocrine signaling; cancer cells sometimes pro- 
mote their own survival and proliferation in this way. 


Neuronal signaling is a third form of cell communication. Like endocrine 
cells, nerve cells (neurons) can deliver messages over long distances. 
In the case of neuronal signaling, however, a message is not broadcast 
widely but is instead delivered quickly and specifically to individual tar- 
get cells through private lines. As described in Chapter 12, the axon of 
a neuron terminates at specialized junctions (synapses) on target cells 
that can lie far from the neuronal cell body (Figure 16-3C). The axons 
that extend from the spinal cord to the big toe in an adult human, for 
example, can be more than a meter in length. When activated by signals 
from the environment or from other nerve cells, a neuron sends electrical 
impulses racing along its axon at speeds of up to 100 m/sec. On reaching 
the axon terminal, these electrical signals are converted into a chemi- 
cal form: each electrical impulse stimulates the nerve terminal to release 
a pulse of an extracellular signal molecule called a neurotransmitter. 
The neurotransmitter then diffuses across the narrow (<100 nm) gap that 
separates the membrane of the axon terminal from that of the target cell, 
reaching its destination in less than | msec. 


A fourth style of signal-mediated cell-cell communication—the most inti- 
mate and short-range of all—does not require the release of a secreted 
molecule. Instead, the cells make direct physical contact through sig- 
nal molecules lodged in the plasma membrane of the signaling cell 
and receptor proteins embedded in the plasma membrane of the target 
cell (Figure 16-3D). During embryonic development, for example, such 
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Figure 16-3 Animal cells use 
extracellular signal molecules to 
communicate with one another 

in various ways. (A) Hormones 
produced in endocrine glands are 
secreted into the bloodstream and 
are distributed widely throughout the 


by cells into the extracellular fluid in 
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their neighborhood and act locally. 
(C) Neuronal signals are transmitted 
electrically along a nerve cell axon. 
When this electrical signal reaches the 
nerve terminal, it causes the release 
of neurotransmitters onto adjacent 
target cells. (D) In contact-dependent 
signaling, a cell-surface-bound signal 
molecule binds to a receptor protein 
on an adjacent cell. Many of the same 
types of signal molecules are used for 
endocrine, paracrine, and neuronal 
signaling. The crucial differences lie in 


signals are delivered to their targets. 
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contact-dependent signaling allows adjacent cells that are initially similar 
to become specialized to form different cell types, as we discuss later in 
the chapter. 


To get a better feel for these different signaling styles, imagine trying to 
publicize a potentially stimulating lecture—or a concert or sporting event. 
An endocrine signal would be akin to broadcasting the information over 
the radio. A more localized paracrine signal would be the equivalent of 
posting a flyer on selected notice boards near the arena—with an auto- 
crine signal being a reminder you add to your own personal calendar. 


Neuronal signals—long-distance but personal—would be similar to a 
phone call, text message, or e-mail, and contact-dependent signaling 
would be like a good old-fashioned, face-to-face conversation. 


Table 16-1 lists some examples of hormones, local mediators, neuro- 
transmitters, and contact-dependent signal molecules. The actions of 


several of these are discussed in greater detail throughout the chapter. 


TABLE 16-1 SOME EXAMPLES OF SIGNAL MOLECULES 


Signal Molecule Site of Origin Chemical Nature 


Hormones 


Epinephrine (adrenaline) adrenal gland 


Cortisol adrenal gland 
Estradiol ovary 

Insulin B cells of pancreas 
Testosterone testis 


Thyroid hormone (thyroxine) thyroid gland 


Local Mediators 


Epidermal growth factor various cells 


(EGF) 


Platelet-derived growth various cells, including 


factor (PDGF) blood platelets 

Nerve growth factor (NGF) various innervated 
tissues 

Histamine mast cells 


Nitric oxide (NO) nerve cells; endothelial 


cells lining blood vessels 
Neurotransmitters 


Acetylcholine nerve terminals 


y-Aminobutyric acid (GABA) nerve terminals 


Contact-dependent Signal Molecules 


Delta prospective neurons; 
various other 
developing cell types 


derivative of the amino 
acid tyrosine 


steroid (derivative of 
cholesterol) 


steroid (derivative of 
cholesterol) 


protein 
steroid (derivative of 
cholesterol) 


derivative of the amino 
acid tyrosine 


protein 
protein 
protein 
derivative of the amino 


acid histidine 


dissolved gas 


derivative of choline 


derivative of the amino 
acid glutamic acid 


transmembrane 
protein 


increases blood pressure, heart rate, 
and metabolism 


affects metabolism of proteins, carbohydrates, 
and lipids in most tissues 


induces and maintains secondary female 
sexual characteristics 


stimulates glucose uptake, protein synthesis, 
and lipid synthesis in various cell types 


induces and maintains secondary male sexual 
characteristics 


stimulates metabolism in many cell types 


stimulates epidermal and many other cell 
types to proliferate 


stimulates many cell types to proliferate 
promotes survival and axonal growth of certain 
classes of neurons 


causes blood vessels to dilate and become 
leaky, helping to cause inflammation 


causes smooth muscle cells to relax; regulates 
nerve-cell activity 


excitatory neurotransmitter at many nerve- 
muscle synapses and in central nervous system 


inhibitory neurotransmitter in central nervous 
system 


inhibits neighboring cells from becoming 
specialized in same way as the signaling cell 


A Limited Set of Extracellular Signals Can Produce a 
Huge Variety of Cell Behaviors 


A typical cell in a multicellular organism is exposed to hundreds of dif- 
ferent signal molecules in its environment. These may be free in the 
extracellular fluid, embedded in the extracellular matrix in which many 
cells reside, or bound to the surface of neighboring cells. Each cell must 
respond very selectively to this mixture of signals, disregarding some and 
reacting to others, according to the cell's specialized function. 


Whether a cell responds to a signal molecule depends, first of all, on 
whether it possesses a receptor for that signal. Each receptor is usually 
activated by only one type of signal. Without the appropriate receptor, a 
cell will be deaf to the signal and will not respond to it. 


Extracellular signal molecules can be divided into two major classes, 
depending on the type of receptor with which they interact. The first and 
largest class of signals consists of molecules that are too large or too 
hydrophilic to cross the plasma membrane of the target cell. These signal 
molecules rely on receptors on the surface of the target cell to relay their 
message across the plasma membrane (Figure 16-4A). The second class 
of signals consists of molecules that are small enough or hydrophobic 
enough to pass through the plasma membrane and into the cytosol of 
the target cell, where they bind to intracellular receptor proteins (Figure 
16-4B). Here, we focus primarily on signaling through cell-surface recep- 
tors, but we will briefly describe signaling through intracellular receptors 
later in the chapter. 


By producing only a limited set of receptors out of the thousands that are 
possible, a cell restricts the types of signals that can affect it. Of course, 
even this restricted set of extracellular signal molecules can change the 
behavior of a target cell in a large variety of ways, altering its shape, move- 
ment, metabolism, gene expression, or some combination of these. How 
a cell reacts to a signal depends on the set of intracellular signaling mol- 
ecules each cell-surface receptor produces and how these molecules alter 
the activity of effector proteins, which have a direct effect on the behav- 
ior of the target cell. This intracellular relay system and the intracellular 
effector proteins on which it acts vary from one type of specialized cell to 
another, so that different types of cells respond to the same signal in dif- 
ferent ways. For example, when a heart pacemaker cell is exposed to the 
neurotransmitter acetylcholine, its rate of firing decreases. When a salivary 
gland cell is exposed to the same signal, it secretes components of saliva, 
even though the receptors on both cell types are the same. In skeletal 
muscle, acetylcholine binds to a different receptor protein, causing the 
muscle cell to contract (Figure 16-5). Thus, the extracellular signal mol- 
ecule alone is not the message: the information conveyed by the signal 
depends on how the target cell receives and interprets the signal. 


A typical cell possesses many sorts of receptors—each present in tens 
to hundreds of thousands of copies. Such variety makes the cell simul- 
taneously sensitive to many different extracellular signals and allows 


Figure 16-4 Extracellular signal molecules bind either to cell- 
surface receptors or to intracellular receptors. (A) Most extracellular 
signal molecules are large and hydrophilic and are therefore unable to 
cross the plasma membrane directly; instead, they bind to cell-surface 
receptors, which in turn generate one or more intracellular signaling 
molecules in the target cell. (B) Some small, hydrophobic, extracellular 
signal molecules pass through the target cell's plasma membrane and 
bind to intracellular receptors—in the cytosol or in the nucleus (as 
shown here)—that then regulate gene transcription or other functions. 
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QUESTION 16-1 


To keep their action local, paracrine 
signal molecules must be prevented 
from straying too far from their 
points of origin. Suggest different 
ways by which this could be 
accomplished. Explain your answers. 


(A) CELL-SURFACE RECEPTORS 
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(A) heart pacemaker cell 
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Figure 16-5 The same signal molecule 
can induce different responses in 
different target cells. Different cell 

types are configured to respond to the 
neurotransmitter acetylcholine in different 
ways. Acetylcholine binds to similar receptor 
proteins on (A) heart pacemaker cells 

and (B) salivary gland cells, but it evokes 
different responses in each cell type. 

(C) Skeletal muscle cells produce a different 
type of receptor protein for the same signal 
molecule. (D) For such a versatile molecule, 
acetylcholine has a fairly simple chemical 
structure. 


Figure 16-6 An animal cell depends on 
multiple extracellular signals. Every cell 
type displays a set of receptor proteins that 
enables it to respond to a specific set of 
extracellular signal molecules produced by 
other cells. These signal molecules work in 
combinations to regulate the behavior of 
the cell. As shown here, cells may require 
multiple signals (blue arrows) to survive, 
additional signals (red arrows) to grow and 
divide, and still other signals (green arrows) 
to differentiate. If deprived of the necessary 
survival signals, most cells undergo a 

form of cell suicide known as apoptosis 
(discussed in Chapter 18). 
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a relatively small number of signal molecules, used in different com- 
binations, to exert subtle and complex control over cell behavior. A 
combination of signals can evoke a response that is different from the 
sum of the effects that each signal would trigger on its own. As we dis- 
cuss later, this “tailoring” of a cell's response occurs, in part, because 
the intracellular relay systems activated by the different signals interact. 
Thus the presence of one signal will often modify the effects of another. 
One combination of signals might enable a cell to survive; another might 
drive it to differentiate in some specialized way; and another might cause 
it to divide. In the absence of the proper signals, most animal cells are 
programmed to kill themselves (Figure 16-6). 


A Cell’s Response to a Signal Can Be Fast or Slow 


The length of time a cell takes to respond to an extracellular signal can 
vary greatly, depending on what needs to happen once the message has 
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Figure 16-7 Extracellular signals can act slowly or rapidly. Certain types of cell 
responses—such as cell differentiation or increased cell growth and division (see 
Figure 16-6)—involve changes in gene expression and the synthesis of new proteins; 
they therefore occur relatively slowly. Other responses—such as changes in cell 
movement, secretion, or metabolism—need not involve changes in gene expression 
and therefore occur more quickly (see Figure 16-5). 


been received. Some extracellular signals act swiftly: acetylcholine can 
stimulate a skeletal muscle cell to contract within milliseconds and a sali- 
vary gland cell to secrete within a minute or so. Such rapid responses are 
possible because, in each case, the signal affects the activity of proteins 
that are already present inside the target cell, awaiting their marching 
orders. 


Other responses take more time. Cell growth and cell division, when 
triggered by the appropriate signal molecules, can take many hours 
to execute. This is because the response to these extracellular signals 
requires changes in gene expression and the production of new proteins 
(Figure 16-7). We will encounter additional examples of both fast and 
slow responses—and the signal molecules that stimulate them—later in 
the chapter. 


Cell-Surface Receptors Relay Extracellular Signals via 
Intracellular Signaling Pathways 


The majority of extracellular signal molecules are proteins, peptides, or 
small, hydrophilic molecules that bind to cell-surface receptors that span 
the plasma membrane (see Figure 16-4A). Transmembrane receptors 
detect a signal on the outside and relay the message, in a new form, 
across the membrane into the interior of the cell. 


The receptor protein performs the primary step in signal transduction: it 
recognizes the extracellular signal and generates new intracellular sig- 
nals in response (see Figure 16-2B). The resulting intracellular signaling 
process usually works like a molecular relay race, in which the message 
is passed “downstream” from one intracellular signaling molecule to 
another, each activating or generating the next signaling molecule in the 
pathway, until a metabolic enzyme is kicked into action, the cytoskeleton 
is tweaked into a new configuration, or a gene is switched on or off. This 
final outcome is called the response of the cell (Figure 16-8). 
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Figure 16-8 Many extracellular signals 
activate intracellular signaling pathways 
to change the behavior of the target cell. 
A cell-surface receptor protein activates one 
or more intracellular signaling pathways, 
each mediated by a series of intracellular 
signaling molecules, which can be proteins 
or small messenger molecules; only one 
pathway is shown. Signaling molecules 
eventually interact with specific effector 
proteins, altering them to change the 
behavior of the cell in various ways. 


QUESTION 16-2 


In principle, how might an 
intracellular signaling protein amplify 
a signal as it relays it onward? 
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The components of these intracellular signaling pathways perform one 
or more crucial functions (Figure 16-9): 


1. They can relay the signal onward and thereby help spread it through 
the cell. 

2. They can amplify the signal received, making it stronger, so that a few 
extracellular signal molecules are enough to evoke a large intracellular 
response. 

3. They can detect signals from more than one intracellular signaling 
pathway and integrate them before relaying a signal onward. 

4. They can distribute the signal to more than one effector protein, 
creating branches in the information flow diagram and evoking a 
complex response. 

5. They can modulate the response to the signal by regulating the activity 
of components upstream in the signaling pathway, a process known 
as feedback. 

Feedback regulation, although it is last on our list, is actually a very 
important feature of cell signaling. It can occur anywhere in the signaling 
pathway and can either boost or weaken the response to the signal. In 
positive feedback, a component that lies downstream in the pathway acts 
on an earlier component in the same pathway to enhance the response to 
the initial signal; in negative feedback, a downstream component acts to 
inhibit an earlier component in the pathway to diminish the response to 
the initial signal (Figure 16-10). Such feedback regulation is very common 
in biological systems and can lead to sophisticated responses: positive 
feedback can generate all-or-none, switchlike responses, for example, 
whereas negative feedback can generate responses that oscillate on and 
off as the activities or concentrations of the inhibitory components rise 
and fall. 
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Figure 16-9 Intracellular signaling 
proteins can relay, amplify, integrate, 
distribute, and modulate via feedback 
an incoming signal. In this example, a 
receptor protein located on the cell surface 
transduces an extracellular signal into an 
intracellular signal, which initiates one or 
more intracellular signaling pathways that 
relay the signal into the cell interior. Each 
pathway includes intracellular signaling 
proteins that can function in one of the 
various ways shown; some, for example, 
integrate signals from other intracellular 
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Some Intracellular Signaling Proteins Act as Molecular 
Switches 


Many intracellular signaling proteins behave as molecular switches: 
receipt of a signal causes them to toggle from an inactive to an active 
state. Once activated, these proteins can stimulate—or in some cases 
suppress—other proteins in the signaling pathway. They then persist in 
an active state until some other process switches them off again. 


The importance of the switching-off process is often underappreciated: 
imagine the consequences if a signaling pathway that boosts your heart 
rate were to remain active indefinitely. If a signaling pathway is to recover 
after transmitting a signal and make itself ready to transmit another, 
every activated protein in the pathway must be reset to its original, 
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Figure 16-10 Feedback regulation within an intracellular signaling pathway can adjust the response to an extracellular signal. 
(A) In this simple example, a downstream protein in a signaling pathway, protein Y, acts to increase the activity of the protein that 
activated it—a form of positive feedback. Positive feedback loops can ignite an explosive response, such as the activation of the 
proteins that trigger cell division (discussed in Chapter 18). (B) In a simple example of negative feedback, protein Y inhibits the 
protein that activated it. Negative feedback loops can generate oscillations, similar to the way that populations of predators and prey 
can seesaw: an increase in prey (here, protein T) would promote the expansion of predators (protein Y); as the number of predators 
increases, the availability of prey will fall (via negative feedback), which will ultimately cause the predator population to decline. As the 
predators disappear, the prey populations will recover and multiply, providing food for more predators, and so on. 
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Figure 16-11 Many intracellular signaling 
proteins act as molecular switches. These 
proteins can be activated—or in some cases 
inhibited—by the addition or removal of a 
phosphate group. (A) In one class of switch 
protein, the phosphate is added covalently 
by a protein kinase, which transfers the 
terminal phosphate group from ATP to 

the signaling protein; the phosphate is 

then removed by a protein phosphatase. 

(B) In the other class of switch protein, a 
GTP-binding protein is activated when it 
exchanges its bound GDP for GTP (which, in 
a sense, adds a phosphate to the protein); 
the protein then switches itself off by 
hydrolyzing its bound GTP to GDP. 


INACTIVE 
MONOMERIC GTPase 


Figure 16-12 The activity of monomeric 
GTPases is controlled by two types of 
regulatory proteins. Guanine nucleotide 
exchange factors (GEFs) promote the 
exchange of GDP for GTP, thereby switching 
the protein on. GTPase-activating proteins 
(GAPs) stimulate the hydrolysis of GTP to 
GDP, thereby switching the protein off. 
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unstimulated state. Thus, for every activation step along the pathway, 
there exists an inactivation mechanism. The two are equally important 
for a signaling pathway to be useful. 


Proteins that act as molecular switches fall mostly into one of two 
classes. The first—and by far the largest—class consists of proteins that 
are activated or inactivated by phosphorylation, a chemical modification 
discussed in Chapter 4 (see Figure 4—46). For these molecules, the switch 
is thrown in one direction by a protein kinase, which covalently attaches 
a phosphate group onto the switch protein, and in the opposite direction 
by a protein phosphatase, which takes the phosphate off again (Figure 
16-11A). The activity of any protein that is regulated by phosphorylation 
depends—moment by moment—on the balance between the activities of 
the protein kinases that phosphorylate it and the protein phosphatases 
that dephosphorylate it. 


Many of the switch proteins controlled by phosphorylation are themselves 
protein kinases, and these are often organized into phosphorylation cas- 
cades: one protein kinase, activated by phosphorylation, phosphorylates 
the next protein kinase in the sequence, and so on, transmitting the sig- 
nal onward and, in the process, amplifying, distributing, and regulating 
it. Two main types of protein kinases operate in intracellular signaling 
pathways: the most common are serine/threonine kinases, which—as 
the name implies—phosphorylate proteins on serines or threonines; oth- 
ers are tyrosine kinases, which phosphorylate proteins on tyrosines. 


The other class of switch proteins involved in intracellular signaling path- 
ways are GTP-binding proteins. These toggle between an active and 
an inactive state depending on whether they have GTP or GDP bound 
to them, respectively (Figure 16-11B). Once activated by GTP binding, 
many of these proteins have intrinsic GTP-hydrolyzing (GTPase) activity, 
and they shut themselves off by hydrolyzing their bound GTP to GDP. 


Two main types of GTP-binding proteins participate in intracellular sig- 
naling. The first type—the large, trimeric GTP-binding proteins (also called 
G proteins)—relay messages from G-protein-coupled receptors. We discuss 
this major class of GTP-binding proteins in detail shortly. 


Other cell-surface receptors rely on a second type of GTP-binding pro- 
tein—the small, monomeric GTPases—to help relay their signals. These 
switch proteins are generally aided by two sets of regulatory proteins that 
help them bind and hydrolyze GTP: guanine nucleotide exchange factors 
(GEFs) activate the switches by promoting the exchange of GDP for GTP, 
and GTPase-activating proteins (GAPs) turn them off by promoting GTP 
hydrolysis (Figure 16-12). 


Cell-Surface Receptors Fall into Three Main Classes 


All cell-surface receptor proteins bind to an extracellular signal molecule 
and transduce its message into one or more intracellular signaling mol- 
ecules that alter the cell's behavior. Most of these receptors belong to 
one of three large classes, which differ in the transduction mechanism 
they use. 


1. Jon-channel-coupled receptors change the permeability of the plasma 
membrane to selected ions, thereby altering the membrane potential 
and, if the conditions are right, producing an electrical current (Figure 
16-13A). 

2. G-protein-coupled receptors activate membrane-bound, trimeric 
GTP-binding proteins (G proteins), which then activate (or inhibit) 
an enzyme or an ion channel in the plasma membrane, initiating an 
intracellular signaling cascade (Figure 16-13B). 

3. Enzyme-coupled receptors either act as enzymes or associate with 
enzymes inside the cell (Figure 16-13C); when stimulated, the enzymes 
can activate a wide variety of intracellular signaling pathways. 


The number of different types of receptors in each of these three classes 
is even greater than the number of extracellular signals that act on them. 
This is because for many extracellular signal molecules there is more 
than one type of receptor, and these may belong to different receptor 
classes. The neurotransmitter acetylcholine, for example, acts on skeletal 
muscle cells via an ion-channel-coupled receptor, whereas in heart cells 
it acts through a G-protein-coupled receptor. These two types of recep- 
tors generate different intracellular signals and thus enable the two types 
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Figure 16-13 Cell-surface receptors fall 
into one of three main classes. (A) An 
ion-channel-coupled receptor opens in 
response to binding an extracellular signal 
molecule. These channels are also called 
transmitter-gated ion channels. (B) When 

a G-protein-coupled receptor binds its 
extracellular signal molecule, the activated 
receptor signals to a trimeric G protein on 
the cytosolic side of the plasma membrane, 
which then turns on (or off) an enzyme (or 
an ion channel; not shown) in the same 
membrane. (C) When an enzyme-coupled 
receptor binds its extracellular signal 
molecule, an enzyme activity is switched on 
at the other end of the receptor, inside the 
cell. Many enzyme-coupled receptors have 
their own enzyme activity (left), while others 
rely on an enzyme that becomes associated 
with the activated receptor (right). 
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TABLE 16-2 SOME FOREIGN SUBSTANCES THAT ACT ON CELL-SURFACE RECEPTORS 


Barbiturates and y-aminobutyric acid stimulate GABA-activated ion-channel- relief of anxiety; sedation 
benzodiazepines (GABA) coupled receptors 
(Valium and Ambien) 
Nicotine acetylcholine stimulates acetylcholine-activated ion- constriction of blood vessels; elevation 
channel-coupled receptors of blood pressure 
Morphine and heroin endorphins and stimulate G-protein-coupled opiate analgesia (relief of pain); euphoria 
enkephalins receptors 
Curare acetylcholine blocks acetylcholine-activated ion- blockage of neuromuscular transmission, 
channel-coupled receptors resulting in paralysis 
Strychnine glycine blocks glycine-activated ion-channel- blockage of inhibitory synapses in spinal 
coupled receptors cord and brain, resulting in seizures and 
muscle spasm 
Capsaicin heat stimulates temperature-sensitive ion- induces painful, burning sensation; 
channel-coupled receptors prolonged exposure paradoxically leads 
to pain relief 
Menthol cold stimulates temperature-sensitive ion- in moderate amounts, induces a cool 
channel-coupled receptors sensation; in higher doses, can cause 


burning pain 


of cells to react to acetylcholine in different ways, increasing contraction 
in skeletal muscle and decreasing the rate of contractions in the heart 
(see Figure 16-5A and C). 


This plethora of cell-surface receptors also provides targets for many 
foreign substances that interfere with our physiology, from heroin and 
nicotine to tranquilizers and chili peppers. These substances either block 
or overstimulate the receptor’s natural activity. Many drugs and poisons 
act in this way (Table 16-2), and a large part of the pharmaceutical indus- 
try is devoted to producing drugs that will exert a precisely defined effect 
by binding to a specific type of cell-surface receptor. 


lon-Channel-Coupled Receptors Convert Chemical 
Signals into Electrical Ones 


Of all the types of cell-surface receptors, ion-channel-coupled receptors 
(also known as transmitter-gated ion channels) function in the simplest 
and most direct way. As we discuss in detail in Chapter 12, these recep- 
tors are responsible for the rapid transmission of signals across synapses 
in the nervous system. They transduce a chemical signal, in the form 
of a pulse of secreted neurotransmitter molecules delivered to a target 
cell, directly into an electrical signal, in the form of a change in voltage 
across the target cell’s plasma membrane (see Figure 12-41). When the 
neurotransmitter binds to ion-channel-coupled receptors on the surface 
of a target cell, the receptor alters its conformation so as to open a chan- 
nel in the target cell membrane, rendering it permeable to specific types 
of ions, such as Na+, K+, or Ca? (see Figure 16-13A and Movie 16.1). 
Driven by their electrochemical gradients, the ions rush into or out of the 
cell, creating a change in the membrane potential within milliseconds. 
This change in potential may trigger a nerve impulse or make it easier 
(or harder) for other neurotransmitters to do so. As we discuss later, the 
opening of Ca?+ channels has additional important effects, as changes in 
the Ca?* concentration in the target-cell cytosol can profoundly alter the 
activities of many Ca?+-responsive proteins. 


Whereas ion-channel-coupled receptors are especially important in 
nerve cells and other electrically excitable cells such as muscle cells, 
G-protein-coupled receptors and enzyme-coupled receptors are impor- 
tant for practically every cell type in the body. Most of the remainder of 
this chapter deals with these two receptor families and with the signal 
transduction processes that they use. 


G-PROTEIN-COUPLED RECEPTORS 


G-protein-coupled receptors (GPCRs) form the largest family of cell- 
surface receptors. There are more than 700 GPCRs in humans, and mice 
have about 1000 involved in the sense of smell alone. These receptors 
mediate responses to an enormous diversity of extracellular signal mol- 
ecules, including hormones, local mediators, and neurotransmitters. The 
signal molecules that bind GPCRs are as varied in structure as they are 
in function: they can be proteins, small peptides, or derivatives of amino 
acids or fatty acids, and for each one of them there is a different receptor 
or set of receptors. Because GPCRs are involved in such a large variety of 
cell processes, they are an attractive target for the development of drugs 
to treat many disorders. About one-third of all drugs used today work 
through GPCRs. 


Despite the diversity of the signal molecules that bind to them, all GPCRs 
have a similar structure: each is made of a single polypeptide chain that 
threads back and forth across the lipid bilayer seven times (Figure 16-14). 
The GPCR superfamily includes rhodopsin (the light-activated photore- 
ceptor protein in the vertebrate eye), the olfactory (smell) receptors in the 
vertebrate nose, and the receptors that participate in the mating rituals 
of single-celled yeasts (see Figure 16-1). Evolutionarily speaking, GPCRs 
are ancient: even prokaryotes possess structurally similar membrane 
proteins—such as the bacteriorhodopsin that functions as a light-driven 
H+ pump (see Figure 11-28). Although they resemble eukaryotic GPCRs, 
these prokaryotic proteins do not act through G proteins, but are coupled 
to other signal transduction systems. 


We begin this section with a discussion of how G proteins are activated 
by GPCRs. We then consider how activated G proteins stimulate ion 
channels and how they regulate membrane-bound enzymes that control 
the concentrations of small intracellular messenger molecules, includ- 
ing cyclic AMP and Ca?*, which in turn control the activity of important 
intracellular signaling proteins. We end with a discussion of how light- 
activated GPCRs in photoreceptors in our eyes enable us to see. 


Stimulation of GPCRs Activates G-Protein Subunits 


When an extracellular signal molecule binds to a GPCR, the receptor 
protein undergoes a conformational change that enables it to activate a 
G protein located on the other side of the plasma membrane. To explain 
how this activation leads to the transmission of a signal, we must first 
consider how G proteins are constructed and how they operate. 


There are several varieties of G proteins. Each is specific for a particular 
set of receptors and for a particular set of target enzymes or ion channels 
in the plasma membrane. All of these G proteins, however, have a simi- 
lar general structure and operate in a similar way. They are composed 
of three protein subunits—a, B, and y—two of which are tethered to the 
plasma membrane by short lipid tails. In the unstimulated state, the a 
subunit has GDP bound to it, and the G protein is idle (Figure 16-15A). 
When an extracellular signal molecule binds to its receptor, the altered 
receptor activates a G protein by causing the a subunit to decrease its 
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Figure 16-14 All GPCRs possess a similar 
structure. The polypeptide chain traverses 
the membrane as seven a helices. The 
cytoplasmic portions of the receptor bind to 
a G protein inside the cell. (A) For receptors 
that recognize small signal molecules, such 
as acetylcholine or epinephrine, the ligand 
(red) usually binds deep within the plane of 
the membrane to a pocket that is formed 
by amino acids from several transmembrane 
segments. Receptors that recognize signal 
molecules that are proteins usually have a 
large, extracellular domain that, together 
with some of the transmembrane segments, 
binds the protein ligand (not shown). 

(B) Shown here is the structure of a GPCR 
that binds to epinephrine (red). Stimulation 
of this receptor by epinephrine makes the 
heart beat faster. 
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Figure 16-15 An activated GPCR 
activates G proteins by encouraging 

the a subunit to expel its GDP and pick 
up GTP. (A) In the unstimulated state, 

the receptor and the G protein are both 
inactive. Although they are shown here as 
separate entities in the plasma membrane, 
in some cases they are associated in a 
preformed complex. (B) Binding of an 
extracellular signal molecule to the receptor 
changes the conformation of the receptor, 
which in turn alters the conformation of 

the bound G protein. The alteration of 

the a subunit of the G protein allows it to 
exchange its GDP for GTP. This exchange 
triggers an additional conformational 
change that activates both the a subunit 
and a By complex, which dissociate to 
interact with their preferred target proteins 
in the plasma membrane (Movie 16.2). The 
receptor stays active as long as the external 
signal molecule is bound to it, and it can 
therefore activate many molecules of G 
protein. Note that both the a and y subunits 
of the G protein have covalently attached 
lipid molecules (red) that help anchor the 
subunits to the plasma membrane. 
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affinity for GDP, which is then exchanged for a molecule of GTP. In some 
cases, this activation breaks up the G-protein subunits, so that the acti- 
vated a subunit, clutching its GTP, detaches from the By complex, which 
is also activated (Figure 16-15B). The two activated parts of the G pro- 
tein—the a subunit and the By complex—can then each interact directly 
with target proteins in the plasma membrane, which in turn may relay the 
signal to other destinations in the cell. The longer these target proteins 
remain bound to an a subunit or a By complex, the more prolonged the 
relayed signal will be. 


The amount of time that the a subunit and By complex remain “switched 
on”—and hence available to relay signals—also determines how long a 
response lasts. This timing is controlled by the behavior of the a subunit. 
The a subunit has an intrinsic GTPase activity, and it eventually hydro- 
lyzes its bound GTP to GDP, returning the whole G protein to its original, 
inactive conformation (Figure 16-16). GTP hydrolysis and inactivation 
usually occur within seconds after the G protein has been activated. The 
inactive G protein is then ready to be reactivated by another activated 
receptor. 
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Some Bacterial Toxins Cause Disease by Altering the 
Activity of G Proteins 


G proteins offer a striking example of the importance of being able to shut 
down a signal, as well as turn it on. Disrupting the activation—and deacti- 
vation—of G proteins can have dire consequences for a cell or organism. 
Consider cholera, for example. The disease is caused by a bacterium that 
multiplies in the human intestine, where it produces a protein called chol- 
era toxin. This protein enters the cells that line the intestine and modifies 
the a subunit of a G protein called G;—so named because it stimulates 
the enzyme adenylyl cyclase, which we discuss shortly. The modification 
prevents Gs from hydrolyzing its bound GTP, thus locking the G protein 
in an active state, in which it continuously stimulates adenylyl cyclase. 
In intestinal cells, this stimulation causes a prolonged and excessive out- 
flow of Cl and water into the gut, resulting in catastrophic diarrhea and 
dehydration. The condition often leads to death unless urgent steps are 
taken to replace the lost water and ions. 


A similar situation occurs in whooping cough (pertussis), a common res- 
piratory infection against which infants are now routinely vaccinated. In 
this case, the disease-causing bacterium colonizes the lung, where it pro- 
duces a protein called pertussis toxin. This protein alters the a subunit of 
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Figure 16-16 The G protein a subunit 
switches itself off by hydrolyzing its 
bound GTP to GDP. When an activated 

a subunit interacts with its target protein, 

it activates that target protein for as long 

as the two remain in contact. (In some 
cases, the a subunit instead inactivates 

its target; not shown.) The a subunit then 
hydrolyzes its bound GTP to GDP—an event 
that takes place usually within seconds of 
G-protein activation. The hydrolysis of GTP 
inactivates the a subunit, which dissociates 
from its target protein and—if the a subunit 
had separated from the By complex (as 
shown)—reassociates with a By complex to 
re-form an inactive G protein. The G protein 
is now ready to couple to another activated 
receptor, as in Figure 16—-15B. Both the 
activated a subunit and the activated By 
complex can interact with target proteins 

in the plasma membrane. See also 

Movie 16.2. 


QUESTION 16-3 


GPCRs activate G proteins by 
reducing the strength of GDP 
binding to the G protein. This 
results in rapid dissociation of 
bound GDP, which is then replaced 
by GTP, because GTP is present 

in the cytosol in much higher 
concentrations than GDP. What 
consequences would result from 

a mutation in the a subunit of a 

G protein that caused its affinity 
for GDP to be reduced without 
significantly changing its affinity for 
GTP? Compare the effects of this 
mutation with the effects of cholera 
toxin. 
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Figure 16-17 A G; protein directly 


a different type of G protein, called Gj because it inhibits adenylyl cyclase. 
In this case, however, modification by the toxin disables the G protein 
by locking it into its inactive GDP-bound state. Inhibiting Gj, like activat- 
ing Gs, results in the prolonged and inappropriate activation of adenylyl 
cyclase, which, in this case, stimulates coughing. Both the diarrhea- 
producing effects of cholera toxin and the cough-provoking effects of 
pertussis toxin help the disease-causing bacteria move from host to host. 


Some G Proteins Directly Regulate lon Channels 


There are about 20 different types of mammalian G proteins, each 
activated by a particular set of cell-surface receptors and dedicated to 
activating a particular set of target proteins. These target proteins are 
either enzymes or ion channels in the plasma membrane. Thus, the bind- 
ing of an extracellular signal molecule to a GPCR leads to changes in the 
activities of a specific subset of the possible target proteins in the plasma 
membrane, producing a response that is appropriate for that signal and 
that type of cell. 


We look first at an example of direct G-protein regulation of ion chan- 
nels. The heartbeat in animals is controlled by two sets of nerves: one 
speeds the heart up, the other slows it down. The nerves that signal 
a slowdown in heartbeat do so by releasing acetylcholine (see Figure 
16-5A), which binds to a GPCR on the surface of the heart pacemaker 
cells. This GPCR activates the G protein, Gj. In this case, the By complex 
binds to the intracellular face of a K* channel in the plasma membrane 
of the pacemaker cell, forcing the ion channel into an open conformation 
(Figure 16-17A and B). This channel opening slows the heart rate by 
increasing the plasma membrane’s permeability to K+, which makes it 
more difficult to electrically activate, as explained in Chapter 12. The 
original signal is terminated—and the K* channel recloses—when the 
a subunit inactivates itself by hydrolyzing its bound GTP, returning the 
G protein to its inactive state (Figure 16-17C). 
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(A) Binding of the neurotransmitter 
acetylcholine to its GPCR on the heart cells 
results in the activation of the G protein, Gi. 
(B) The activated By complex directly opens 
a K* channel in the plasma membrane, 
increasing its permeability to K* and thereby 
making the membrane harder to activate 
and slowing the heart rate. (C) Inactivation 
of the a subunit by hydrolysis of its bound 
GTP returns the G protein to its inactive 
state, allowing the K* channel to close. 
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Figure 16-18 Enzymes activated by G proteins increase the 
concentrations of small intracellular signaling molecules. 
Because each activated enzyme generates many molecules of 
these second messengers, the signal is greatly amplified at this 
step in the pathway (see Figure 16-28). The signal is relayed 
onward by the second messenger molecules, which bind to 
specific signaling proteins in the cell and influence their activity. 


Many G Proteins Activate Membrane-bound Enzymes 
That Produce Small Messenger Molecules 


When G proteins interact with ion channels they cause an immediate 
change in the state and behavior of the cell. The interaction of activated 
G proteins with enzymes, in contrast, has consequences that are less 
rapid and more complex, as they lead to the production of additional 
intracellular signaling molecules. The two most frequent target enzymes 
for G proteins are adenylyl cyclase, which produces a small molecule called 
cyclic AMP, and phospholipase C, which generates small molecules called 
inositol trisphosphate and diacylglycerol. Inositol trisphosphate, in turn, 
promotes the accumulation of cytosolic Ca?t—yet another intracellular 
signaling molecule. 


Adenylyl cyclase and phospholipase C are activated by different types of 
G proteins, allowing cells to couple the production of the small molecules 
to different extracellular signals. Although the coupling may be either 
stimulatory or inhibitory—as we saw in our discussion of the actions of 
cholera toxin and pertussis toxin—we concentrate here on G proteins 
that stimulate enzyme activity. 


The small molecules generated by these enzymes are often called second 
messengers—the “first messengers” being the extracellular signals that 
activated the enzymes in the first place. Once activated, the enzymes 
generate large quantities of second messengers, which rapidly diffuse 
away from their source, thereby amplifying and spreading the intracel- 
lular signal (Figure 16-18). 


Different second messenger molecules produce different responses. We 
first examine the consequences of an increase in the cytosolic concen- 
tration of cyclic AMP. This will take us along one of the main types of 
signaling pathways that lead from the activation of GPCRs. We then dis- 
cuss the actions of three other second messenger molecules—inositol 
trisphosphate, diacylglycerol, and Ca?*—which will lead us along a dif- 
ferent signaling route. 


The Cyclic AMP Signaling Pathway Can Activate 
Enzymes and Turn On Genes 


Many extracellular signals acting via GPCRs affect the activity of the 
enzyme adenylyl cyclase and thus alter the intracellular concentration 
of the second messenger molecule cyclic AMP. Most commonly, the acti- 
vated G protein a subunit switches on the adenylyl cyclase, causing a 
dramatic and sudden increase in the synthesis of cyclic AMP from ATP 
(which is always present in the cell). To help terminate the signal, a sec- 
ond enzyme, called cyclic AMP phosphodiesterase, rapidly converts cyclic 
AMP to ordinary AMP (Figure 16-19). One way that caffeine acts as a 
stimulant is by inhibiting this phosphodiesterase in the nervous system, 
blocking cyclic AMP degradation and thereby keeping the concentration 
of this second messenger high. 
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Figure 16-19 Cyclic AMP is synthesized 
by adenylyl cyclase and degraded by 
cyclic AMP phosphodiesterase. Cyclic 
AMP (abbreviated cAMP) is formed from 
ATP by a cyclization reaction that removes 
two phosphate groups from ATP and 
joins the “free” end of the remaining 
phosphate group to the sugar part of 

the AMP molecule (red bond). The 
degradation reaction breaks this new 
bond, forming AMP. 
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Figure 16-20 The concentration of 

cyclic AMP rises rapidly in response to 
an extracellular signal. A nerve cell in 
culture responds to the binding of the 
neurotransmitter serotonin to a GPCR by 
synthesizing cyclic AMP. The concentration 
of intracellular cyclic AMP was monitored 
by injecting into the cell a fluorescent 
protein whose fluorescence changes when 
it binds cyclic AMP. Blue indicates a low 
level of cyclic AMP, yellow an intermediate 
level, and red a high level. (A) In the resting 
cell, the cyclic AMP concentration is about 
5 x 10-8 M. (B) Fifty seconds after adding 
serotonin to the culture medium, the 
intracellular concentration of cyclic AMP 
has risen more than twentyfold (to >10% M) 
in the parts of the cell where the serotonin 
receptors are concentrated. (From B.J. 
Bacskai et al. Science 260:222-226, 1993.) 
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Cyclic AMP phosphodiesterase is continuously active inside the cell. 
Because it eliminates cyclic AMP so quickly, the cytosolic concentration 
of this second messenger can change rapidly in response to extracellular 
signals, rising or falling tenfold in a matter of seconds (Figure 16-20). 
Cyclic AMP is water-soluble, so it can, in some cases, carry the signal 
throughout the cell, traveling from the site on the membrane where 
it is synthesized to interact with proteins located in the cytosol, in the 
nucleus, or on other organelles. 


Cyclic AMP exerts most of its effects by activating the enzyme cyclic- 
AMP-dependent protein kinase (PKA). This enzyme is normally held 
inactive in a complex with a regulatory protein. The binding of cyclic AMP 
to the regulatory protein forces a conformational change that releases 
the inhibition and unleashes the active kinase. Activated PKA then cata- 
lyzes the phosphorylation of particular serines or threonines on specific 
intracellular proteins, thus altering the activity of these target proteins. In 
different cell types, different sets of proteins are available to be phospho- 
rylated, which largely explains why the effects of cyclic AMP vary with the 
type of target cell. 


Many kinds of cell responses are mediated by cyclic AMP; a few are listed 
in Table 16-3. As the table shows, different target cells respond very 
differently to extracellular signals that change intracellular cyclic AMP 
concentrations. When we are frightened or excited, for example, the 
adrenal gland releases the hormone epinephrine (also called adrenaline), 
which circulates in the bloodstream and binds to a class of GPCRs called 
adrenergic receptors (see Figure 16-14B), which are present on many 
types of cells. The consequences of epinephrine binding vary from one 
cell type to another, but all the cell responses help prepare the body for 


TABLE 16-3 SOME CELL RESPONSES MEDIATED BY CYCLIC AMP 


Extracellular Signal | Target Tissue Major Response 
Molecule* 


Epinephrine heart increase in heart rate and force of 
contraction 

Epinephrine skeletal muscle glycogen breakdown 

Epinephrine, fat fat breakdown 

glucagon 


Adrenocorticotropic cortisol secretion 


hormone (ACTH) 


adrenal gland 


*Although all of the signal molecules listed here are hormones, some responses 
to local mediators and to neurotransmitters are also mediated by cyclic AMP. 
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sudden action. In skeletal muscle, for instance, epinephrine increases 
intracellular cyclic AMP, causing the breakdown of glycogen—the polym- 
erized storage form of glucose. It does so by activating PKA, which leads 
to both the activation of an enzyme that promotes glycogen breakdown 
(Figure 16-21) and the inhibition of an enzyme that drives glycogen syn- 
thesis. By stimulating glycogen breakdown and inhibiting its synthesis, 
the increase in cyclic AMP maximizes the amount of glucose available as 
fuel for anticipated muscular activity. Epinephrine also acts on fat cells, 
stimulating the breakdown of fat to fatty acids. These fatty acids can then 
be exported to fuel ATP production in other cells. 


In some cases, the effects of increasing cyclic AMP are rapid; in skeletal 
muscle, for example, glycogen breakdown occurs within seconds of epi- 
nephrine binding to its receptor (see Figure 16-21). In other cases, cyclic 
AMP responses involve changes in gene expression that take minutes 
or hours to develop. In these slow responses, PKA typically phospho- 
rylates transcription regulators, proteins that activate the transcription 
of selected genes (as discussed in Chapter 8). For example, an increase 
in cyclic AMP in certain neurons in the brain controls the production of 
proteins involved in some forms of learning. Figure 16-22 illustrates a 
typical cyclic-AMP-mediated pathway from the plasma membrane to the 
nucleus. 


We now turn to the other enzyme-mediated signaling pathway that leads 
from GPCRs—the pathway that begins with the activation of the mem- 
brane-bound enzyme phospholipase C and leads to an increase in the 
second messengers diacylglycerol, inositol trisphosphate, and Ca?*. 
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Figure 16-21 Epinephrine stimulates 
glycogen breakdown in skeletal muscle 
cells. The hormone activates a GPCR, which 
turns on a G protein (G,) that activates 
adenylyl cyclase to boost the production 

of cyclic AMP. The increase in cyclic AMP 
activates PKA, which phosphorylates and 
activates an enzyme called phosphorylase 
kinase. This kinase activates glycogen 
phosphorylase, the enzyme that breaks 
down glycogen (see Figure 13-22). Because 
these reactions do not involve changes in 
gene transcription or new protein synthesis, 
they occur rapidly. 


QUESTION 16-4 


Explain why cyclic AMP must be 
broken down rapidly in a cell to 
allow rapid signaling. 
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Figure 16-22 A rise in intracellular cyclic 
AMP can activate gene transcription. 

PKA, activated by a rise in intracellular 

cyclic AMP, can enter the nucleus and 
phosphorylate specific transcription 
regulators. Once phosphorylated, these 
proteins stimulate the transcription of a 
whole set of target genes (Movie 16.3). 
This type of signaling pathway controls 
many processes in cells, ranging from 
hormone synthesis in endocrine cells to the 
production of proteins involved in long-term 
memory in the brain. Activated PKA can also 
phosphorylate and thereby regulate other 
proteins and enzymes in the cytosol, as 
shown in Figure 16-21. 


cyclic AMP 
ogee 
e e 
inactive PKA — ht, active 
PKA 
nuclear pore nS 
CYTOSOL \ 
NUCLEUS 
active inactive 
PKA transcription 
sor regulator 
Feri 
activated, phosphorylated 
transcription regulator 
May 
E activated target gene 
| 
P| 


| 


The Inositol Phospholipid Pathway Triggers a Rise in 
Intracellular Ca2+ 


Some GPCRs exert their effects through a G protein called Gg, which acti- 
vates the membrane-bound enzyme phospholipase C instead of adenylyl 
cyclase. Examples of signal molecules that act through phospholipase C 
are given in Table 16-4. 


Once activated, phospholipase C propagates the signal by cleaving a lipid 
molecule that is a component of the plasma membrane. The molecule is 
an inositol phospholipid (a phospholipid with the sugar inositol attached 
to its head) that is present in small quantities in the cytosolic leaflet of the 
membrane lipid bilayer (see Figure 11-19). Because of the involvement 
of this phospholipid, the signaling pathway that begins with the activa- 
tion of phospholipase C is often referred to as the inositol phospholipid 
pathway. It operates in almost all eukaryotic cells and regulates a large 
number of different effector proteins. 


The cleavage of a membrane inositol phospholipid by phospholipase C 
generates two second messenger molecules: inositol 1,4,5-trisphos- 
phate (IP3) and diacylglycerol (DAG). Both molecules play a crucial part 
in relaying the signal (Figure 16-23). 


IP3 is a water-soluble sugar phosphate that is released into the cytosol; 
there it binds to and opens Ca% channels that are embedded in the endo- 
plasmic reticulum (ER) membrane. Ca?* stored inside the ER rushes out 


TABLE 16-4 SOME CELL RESPONSES MEDIATED BY PHOSPHOLIPASE C 
ACTIVATION 


Vasopressin (a peptide hormone) liver 


Signal Molecule 
glycogen breakdown 


secretion of amylase 
(a digestive enzyme) 


Acetylcholine pancreas 


Acetylcholine skeletal muscle contraction 


Thrombin (a proteolytic enzyme) blood platelets aggregation 
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into the cytosol through these open channels, causing a sharp rise in the 
cytosolic concentration of free Ca?*, which is normally kept very low. 
This Ca** in turn signals to other proteins, as we discuss shortly. 


Diacylglycerol is a lipid that remains embedded in the plasma membrane 
after it is produced by phospholipase C; there, it helps recruit and acti- 
vate a protein kinase, which translocates from the cytosol to the plasma 
membrane. This enzyme is called protein kinase C (PKC) because it also 
needs to bind Ca% to become active (see Figure 16-23). Once activated, 
PKC phosphorylates a set of intracellular proteins that varies depending 
on the cell type. 


A Ca?+ Signal Triggers Many Biological Processes 


Ca? has such an important and widespread role as an intracellular mes- 
senger that we will digress to consider its functions more generally. A 
surge in the cytosolic concentration of free Ca?* is triggered by many 
kinds of cell stimuli, not only those that act through GPCRs. When a 
sperm fertilizes an egg cell, for example, Ca?* channels open, and the 
resulting rise in cytosolic Ca?* triggers the egg to start development 
(Figure 16-24); for muscle cells, a signal from a nerve triggers a rise in 
cytosolic Ca% that initiates muscle contraction (discussed in Chapter 17; 
see Figure 17-45); and in many secretory cells, including nerve cells, 
Ca?* triggers secretion (discussed in Chapter 12; see Figure 12-40). Ca?+ 
stimulates all these responses by binding to and influencing the activity 
of various Ca?*-responsive proteins. 


The concentration of free Ca?* in the cytosol of an unstimulated cell is 
extremely low (10-7 M) compared with its concentration in the extracel- 
lular fluid (about 10-3 M) and in the ER. These differences are maintained 
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Figure 16-23 Phospholipase C activates 
two signaling pathways. Two messenger 
molecules are produced when a membrane 
inositol phospholipid is hydrolyzed by 
activated phospholipase C. Inositol 
1,4,5-trisphosphate (IP3) diffuses through 
the cytosol and triggers the release of Ca?* 
from the ER by binding to and opening 
special Ca?* channels in the ER membrane. 
The large electrochemical gradient for 
Ca?* across this membrane causes Ca?* 

to rush out of the ER and into the cytosol. 
Diacylglycerol remains in the plasma 
membrane and, together with Ca**, helps 
activate the enzyme protein kinase C (PKC), 
which is recruited from the cytosol to the 
cytosolic face of the plasma membrane 
(Movie 16.4). PKC then phosphorylates 

its own set of intracellular proteins, further 
propagating the signal. At the start of the 
pathway, both the a subunit and the By 
complex of the G protein Gg are involved in 
activating phospholipase C. 


QUESTION 16-5 


Why do you suppose cells have 
evolved intracellular Ca2* stores 
for signaling even though there is 
abundant extracellular Ca2*? 


Figure 16-24 Fertilization of an egg by 

a sperm triggers an increase in cytosolic 
Ca?* in the egg. This starfish egg was 
injected with a Ca?*-sensitive fluorescent dye 
before it was fertilized. When a sperm enters 
the egg, a wave of cytosolic Ca?* (red)— 
released from the ER—sweeps across the egg 
from the site of sperm entry (arrow). This Ca2* 
wave provokes a change in the egg surface, 
preventing entry of other sperm, and it also 
initiates embryonic development. To catch 
this Ca? wave, go to Movie 16.5. (Adapted 
from S. Stricker, Dev. Bio. 166:34—58, 1994.) 
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by membrane-embedded Ca?+ pumps that actively remove Ca?* from the 
cytosol, sending it either into the ER or across the plasma membrane and 
out of the cell. As a result, a steep electrochemical gradient of Ca?* exists 
across both the ER membrane and the plasma membrane (discussed in 
Chapter 12). When a signal transiently opens Ca?* channels in either of 
these membranes, Ca?* rushes down its electrochemical gradient into the 
cytosol, where it triggers changes in Ca?*-responsive proteins. The same 
Ca?+ pumps that normally operate to keep cytosolic Ca?* concentrations 
low also help to terminate the Ca?* signal. 


The effects of Ca% in the cytosol are largely indirect, in that they are medi- 
ated through the interaction of Ca?+ with various kinds of Ca**+-responsive 
proteins. The most widespread and common of these is calmodulin, 
which is present in the cytosol of all eukaryotic cells that have been 
examined, including those of plants, fungi, and protozoa. When Ca?+ 
binds to calmodulin, the protein undergoes a conformational change 
that enables it to interact with a wide range of target proteins in the cell, 
altering their activities (Figure 16-25). One particularly important class 
of targets for calmodulin is the Ca?+/calmodulin-dependent protein 
kinases (CaM-kinases). When these kinases are activated by binding to 
calmodulin complexed with Ca?+, they influence other processes in the 
cell by phosphorylating selected proteins. In the mammalian brain, for 
example, a neuron-specific CaM-kinase is abundant at synapses, where 
it is thought to play an important part in some forms of learning and 
memory. This CaM-kinase is activated by the pulses of Ca?* signals that 
occur during neural activity, and mutant mice that lack the kinase show 
a marked inability to remember where things are. 


A GPCR Signaling Pathway Generates a Dissolved Gas 
That Carries a Signal to Adjacent Cells 


Second messengers like cyclic AMP and calcium are hydrophilic mol- 
ecules that generally act within the cell where they are produced. But 
some molecules produced in response to GPCR activation are small 
enough or hydrophobic enough to pass across the membrane and carry 
a signal directly to nearby cells. An important example is the gas nitric 
oxide (NO), which acts as a signaling molecule in many tissues. NO dif- 
fuses readily from its site of synthesis and slips into neighboring cells. 
The distance the gas diffuses is limited by its reaction with oxygen and 
water in the extracellular environment, which converts NO into nitrates 
and nitrites within seconds. 


Ca2+ 


Figure 16-25 Calcium binding changes 
the shape of the calmodulin protein. 
(A) Calmodulin has a dumbbell shape, 
with two globular ends connected by a 
long a helix. Each of the globular ends 
has two Ca**-binding sites. (B) Simplified 
representation of the structure, showing 
the conformational changes that occur 
when Ca?*-bound calmodulin interacts COOH 
with an isolated segment of a target 
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peptide portion 
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a helix jackknifes to surround the target Sgi caM-kinase 


(Movie 16.6). (B, adapted from W.E. (B) 
Meador, A.R. Means, and F.A. Quiocho, Ca2* 

Science 257:1251-1255, 1992, and M. Ikura 

et al., Science 256:632-638, 1992.) (A) 
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(A) Figure 16-26 Nitric oxide (NO) triggers smooth muscle relaxation in a blood- 
smooth muscle cells basal lamina vessel wall. (A) Simplified drawing showing a cross section of a blood vessel with 
endothelial cells lining its lumen and smooth muscle cells surrounding the outside 
of the vessel. (B) The neurotransmitter acetylcholine causes the blood vessel 

to dilate by binding to a GPCR on the surface of the endothelial cells, thereby 
activating a G protein, Gg, to trigger Ca?* release (as illustrated in Figure 16-23). 
Ca?* activates nitric oxide synthase, stimulating the production of NO. NO then 
diffuses out of the endothelial cells and into adjacent smooth muscle cells, where 
it regulates the activity of specific proteins, causing the muscle cells to relax. One 
key target protein that can be activated by NO in smooth muscle cells is guanylyl 
cyclase, which catalyzes the production of cyclic GMP from GTP. Note that NO gas 
is highly toxic when inhaled and should not be confused with nitrous oxide (N20), 
also known as laughing gas. 
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Endothelial cells—the flattened cells that line every blood vessel—release 
NO in response to acetylcholine secreted by nearby nerve endings. 
Acetylcholine binds to a GPCR on the endothelial cell surface, resulting in 
activation of Gg and the release of Ca? inside the cell (see Figure 16-23). 
Ca% then stimulates nitric oxide synthase, which produces NO from the 
amino acid arginine. This NO diffuses into smooth muscle cells in the 
adjacent vessel wall, causing the cells to relax; this relaxation allows 
the vessel to dilate, so that blood flows through it more freely (Figure 
16-26). The effect of NO on blood vessels accounts for the action of nitro- 
glycerin, which has been used for almost 100 years to treat patients with 
angina—pain caused by inadequate blood flow to the heart muscle. In 
the body, nitroglycerin is converted to NO, which rapidly relaxes blood 
vessels, thereby reducing the workload on the heart and decreasing the 
muscle’s need for oxygen-rich blood. Many nerve cells also use NO to 
signal neighboring cells: NO released by nerve terminals in the penis, 
for instance, acts as a local mediator to trigger the blood-vessel dilation 
responsible for penile erection. 


Inside many target cells, NO binds to and activates the enzyme guanylyl 
cyclase, stimulating the formation of cyclic GMP from the nucleotide GTP 
(see Figure 16-26B). Cyclic GMP, a second messenger similar in structure 
to cyclic AMP, is a key link in the NO signaling chain. The drug Viagra 
enhances penile erection by blocking the enzyme that degrades cyclic 
GMP, prolonging the NO signal. 


GPCR-Triggered Intracellular Signaling Cascades Can 
Achieve Astonishing Speed, Sensitivity, and Adaptability 
The steps in the signaling cascades associated with GPCRs take a long time 


to describe, but they often take only seconds to execute. Consider how 
quickly a thrill can make your heart race (when epinephrine stimulates 
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the GPCRs in your cardiac pacemaker cells), or how fast the smell of food 
can make your mouth water (through the GPCRs for odors in your nose 
and the GPCRs for acetylcholine in salivary cells, which stimulate secre- 
tion). Among the fastest of all responses mediated by a GPCR, however, is 
the response of the eye to light: it takes only 20 msec for the most quickly 
responding photoreceptor cells of the retina (the cone photoreceptors, 
which are responsible for color vision in bright light) to produce their 
electrical response to a sudden flash of light. 


This exceptional speed is achieved in spite of the necessity to relay the 
signal over the multiple steps of an intracellular signaling cascade. But 
photoreceptors also provide a beautiful illustration of the advantages of 
intracellular signaling cascades: in particular, such cascades allow spec- 
tacular amplification of the incoming signal and also allow cells to adapt 
so as to be able to detect signals of widely varying intensity. The quantita- 
tive details have been most thoroughly analyzed for the rod photoreceptor 
cells in the eye, which are responsible for noncolor vision in dim light 
(Figure 16-27). In this photoreceptor cell, light is sensed by rhodopsin, 
a G-protein-coupled light receptor. Rhodopsin, when stimulated by light, 
activates a G protein called transducin. The activated a subunit of trans- 
ducin then activates an intracellular signaling cascade that causes cation 
channels to close in the plasma membrane of the photoreceptor cell. This 
produces a change in the voltage across the cell membrane, which alters 
neurotransmitter release and ultimately leads to a nerve impulse being 
sent to the brain. 


The signal is repeatedly amplified as it is relayed along this intracellular 
signaling pathway (Figure 16-28). When lighting conditions are dim, as 
on a moonless night, the amplification is enormous: as few as a dozen 
photons absorbed across the entire retina will cause a perceptible sig- 
nal to be delivered to the brain. In bright sunlight, when photons flood 
through each photoreceptor cell at a rate of billions per second, the 
signaling cascade undergoes a form of adaptation, stepping down the 
amplification more than 10,000-fold, so that the photoreceptor cells are 
not overwhelmed and can still register increases and decreases in the 
strong light. The adaptation depends on negative feedback: an intense 
response in the photoreceptor cell decreases the cytosolic Ca** concen- 
tration, inhibiting the enzymes responsible for signal amplification. 


Adaptation frequently occurs in intracellular signaling pathways that 
respond to extracellular signal molecules, allowing cells to respond to 
fluctuations in the concentration of such molecules regardless of whether 
they are present in small or large amounts. By taking advantage of posi- 
tive and negative feedback mechanisms (see Figure 16-10), adaptation 
thus allows a cell to respond equally well to the signaling equivalents of 
shouts and whispers. 


Figure 16-27 A rod photoreceptor cell from the retina is exquisitely 
sensitive to light. The light-absorbing rhodopsin proteins are embedded 
in many pancake-shaped vesicles (discs) of membrane inside the outer 
segment of the photoreceptor cell. When the rod cell is stimulated by light, 
a signal is relayed from the rhodopsin molecules in the discs, through the 
cytosol, to ion channels that allow positive ions to flow through the plasma 
membrane of the outer segment. These cation channels close in response 
to the cytosolic signal, producing a change in the membrane potential of 
the rod cell. By mechanisms similar to those that control neurotransmitter 
release in ordinary nerve cells, the change in membrane potential alters 
the rate of neurotransmitter release from the synaptic region of the cell. 
Released neurotransmitters then act on retinal nerve cells that pass the 
signal on to the brain. (From T.L. Lentz, Cell Fine Structure. Philadelphia: 
Saunders, 1971. With permission from Elsevier.) 


Figure 16-28 The light-induced signaling cascade in rod photoreceptor 
cells greatly amplifies the light signal. When rod photoreceptors 

are adapted for dim light, the signal amplification is enormous. The 
intracellular signaling pathway from the G protein transducin uses 
components that differ from the ones in previous figures. The cascade 
functions as follows. In the absence of a light signal, the second messenger 
molecule cyclic GMP is continuously produced by guanylyl cyclase in 

the cytosol of the photoreceptor cell. The cyclic GMP then binds to 

cation channels in the photoreceptor cell plasma membrane, keeping 
them open. Activation of rhodopsin by light results in the activation 

of transducin a subunits. These turn on an enzyme called cyclic GMP 
phosphodiesterase, which breaks down cyclic GMP to GMP (much as 
cyclic AMP phosphodiesterase breaks down cyclic AMP; see Figure 16-19). 
The sharp fall in the cytosolic concentration of cyclic GMP reduces the 
amount of cyclic GMP bound to the cation channels, which therefore close. 
Closing these channels decreases the influx of Na+, thereby altering the 
voltage gradient (membrane potential) across the plasma membrane and, 
ultimately, the rate of neurotransmitter release, as described in Chapter 

12. The red arrows indicate the steps at which amplification occurs, 

with the thickness of the arrow roughly indicating the magnitude of the 
amplification. 


Taste and smell also depend on GPCRs. It seems likely that this mech- 
anism of signal reception, invented early in evolution, has its origins 
in the basic and universal need of cells to sense and respond to their 
environment. Of course, GPCRs are not the only receptors that activate 
intracellular signaling cascades. We now turn to another major class of 
cell-surface receptors—enzyme-coupled receptors—which play a key 
part in controlling cell numbers, cell differentiation, and cell movement 
in multicellular animals, especially during development. 


ENZYME-COUPLED RECEPTORS 


Like GPCRs, enzyme-coupled receptors are transmembrane proteins 
that display their ligand-binding domains on the outer surface of the 
plasma membrane (see Figure 16-13C). Instead of associating with a G 
protein, however, the cytoplasmic domain of the receptor either acts as 
an enzyme itself or forms a complex with another protein that acts as an 
enzyme. Enzyme-coupled receptors were discovered through their role 
in responses to extracellular signal proteins that regulate the growth, 
proliferation, differentiation, and survival of cells in animal tissues (see 
Table 16-1, p. 536, for examples). Most of these signal proteins function 
as local mediators and can act at very low concentrations (about 107? to 
10-!! M). Responses to them are typically slow (on the order of hours), 
and their effects may require many intracellular transduction steps that 
usually lead to a change in gene expression. 


Enzyme-coupled receptors, however, can also mediate direct, rapid 
reconfigurations of the cytoskeleton, changing the cell’s shape and the 
way that it moves. The extracellular signals that induce such changes are 
often not diffusible signal proteins, but proteins attached to the surfaces 
over which a cell is crawling. 


The largest class of enzyme-coupled receptors consists of receptors with 
a cytoplasmic domain that functions as a tyrosine kinase, which phos- 
phorylates particular tyrosines on specific intracellular signaling proteins. 
These receptors, called receptor tyrosine kinases (RTKs), will be the 
main focus of this section. 


We begin with a discussion of how RTKs are activated in response to 
extracellular signals. We then consider how activated RTKs transmit the 
signal along two major intracellular signaling pathways that terminate at 
various effector proteins in the target cell. Finally, we describe how some 
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enzyme-coupled receptors bypass such intracellular signaling cascades 
and use a more direct mechanism to regulate gene transcription. 


Abnormal cell growth, proliferation, differentiation, survival, and migra- 
tion are fundamental features of a cancer cell, and abnormalities in 
signaling via RTKs and other enzyme-coupled receptors have a major 
role in the development of most cancers. 


Activated RTKs Recruit a Complex of Intracellular 
Signaling Proteins 


To do its job as a signal transducer, an enzyme-coupled receptor has to 
switch on the enzyme activity of its intracellular domain (or of an associ- 
ated enzyme) when an external signal molecule binds to its extracellular 
domain. Unlike GPCRs, enzyme-coupled receptor proteins usually have 
only one transmembrane segment, which spans the lipid bilayer as a 
single a helix. Because a single a helix is poorly suited to transmit a con- 
formational change across the bilayer, enzyme-coupled receptors have a 
different strategy for transducing the extracellular signal. In many cases, 
the binding of an extracellular signal molecule causes two receptor mol- 
ecules to come together in the plasma membrane, forming a dimer. This 
pairing brings the two intracellular tails of the receptors together and 
activates their kinase domains, such that each receptor tail phosphoryl- 
ates the other. In the case of RTKs, the phosphorylations occur on specific 
tyrosines. 


This tyrosine phosphorylation then triggers the assembly of a transient 
but elaborate intracellular signaling complex on the cytosolic tails of the 
receptors. The newly phosphorylated tyrosines serve as docking sites for 
a whole zoo of intracellular signaling proteins—perhaps as many as 10 
or 20 different molecules (Figure 16-29). Some of these proteins become 
phosphorylated and activated on binding to the receptors, and they then 
propagate the signal; others function solely as scaffolds, which couple 
the receptors to other signaling proteins, thereby helping to build the 
active signaling complex (see Figure 16-9). All of these docked intracel- 
lular signaling proteins possess a specialized interaction domain, which 
recognizes specific phosphorylated tyrosines on the receptor tails. Other 
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Figure 16-29 Activation of an RTK stimulates the assembly of an intracellular signaling complex. Typically, the binding of a signal 
molecule to the extracellular domain of an RTK causes two receptor molecules to associate into a dimer. The signal molecule shown here 
is itself a dimer and thus can physically cross-link two receptor molecules; other signal molecules induce a conformational change in the 
RTKs, causing the receptors to dimerize (not shown). In either case, dimer formation brings the kinase domain of each cytosolic receptor 
tail into contact with the other; this activates the kinases to phosphorylate the adjacent tail on several tyrosines. Each phosphorylated 
tyrosine serves as a specific docking site for a different intracellular signaling protein, which then helps relay the signal to the cell’s 
interior; these proteins contain a specialized interaction domain—in this case, a module called an SH2 domain—that recognizes and 
binds to specific phosphorylated tyrosines on the cytosolic tail of an activated RTK or on another intracellular signaling protein. 


interaction domains allow intracellular signaling proteins to recognize 
phosphorylated lipids that are produced on the cytosolic side of the 
plasma membrane in response to certain signals, as we discuss later. 


As long as they remain together, the signaling protein complexes assem- 
bled on the cytosolic tails of the RTKs can transmit a signal along several 
routes simultaneously to many destinations in the cell, thus activating 
and coordinating the numerous biochemical changes that are required to 
trigger a complex response such as cell proliferation or differentiation. To 
help terminate the response, the tyrosine phosphorylations are reversed 
by tyrosine phosphatases, which remove the phosphates that were added 
to the tyrosines of both the RTKs and other intracellular signaling pro- 
teins in response to the extracellular signal. In some cases, activated 
RTKs (as well as some GPCRs) are inactivated in a more brutal way: they 
are dragged into the interior of the cell by endocytosis and then destroyed 
by digestion in lysosomes (as discussed in Chapter 15). 


Different RTKs recruit different collections of intracellular signaling pro- 
teins, producing different effects; however, certain components are used 
by most RTKs. These include, for example, a phospholipase C that func- 
tions in the same way as the phospholipase C activated by GPCRs to 
trigger the inositol phospholipid signaling pathway discussed earlier (see 
Figure 16-23). Another intracellular signaling protein that is activated by 
almost all RTKs is a small GTP-binding protein called Ras, as we discuss 
next. 


Most RTKs Activate the Monomeric GTPase Ras 


As we have seen, activated RTKs recruit and activate many kinds of 
intracellular signaling proteins, leading to the formation of large signal- 
ing complexes on the cytosolic tail of the RTK. One of the key members 
of these signaling complexes is Ras—a small GTP-binding protein that 
is bound by a lipid tail to the cytosolic face of the plasma membrane. 
Virtually all RTKs activate Ras, including platelet-derived growth factor 
(PDGF) receptors, which mediate cell proliferation in wound healing, and 
nerve growth factor (NGF) receptors, which play an important part in the 
development of certain vertebrate neurons. 


The Ras protein is a member of a large family of small GTP-binding pro- 
teins, often called monomeric GTPases to distinguish them from the 
trimeric G proteins that we encountered earlier. Ras resembles the a sub- 
unit of a G protein and functions as a molecular switch in much the same 
way. It cycles between two distinct conformational states—active when 
GTP is bound and inactive when GDP is bound. Interaction with an acti- 
vating protein called Ras-GEF encourages Ras to exchange its GDP for 
GTP, thus switching Ras to its activated state (Figure 16-30); after a delay, 
Ras is switched off by a GAP called Ras-GAP (see Figure 16-12), which 
promotes the hydrolysis of its bound GTP to GDP (Movie 16.7). 
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Figure 16-30 RTKs activate Ras. An adaptor 
protein docks on a particular phosphotyrosine 
on the activated receptor (the other signaling 
proteins that would be bound to the receptor, 
as shown in Figure 16-29, have been 

omitted for simplicity). The adaptor recruits 

a Ras guanine nucleotide exchange factor 
(Ras-GEF) that stimulates Ras to exchange 

its bound GDP for GTP. The activated 

Ras protein can now stimulate several 
downstream signaling pathways, one of which 
is shown in Figure 16-31. Note that the Ras 
protein contains a covalently attached lipid 
group (red) that helps anchor the protein to 
the inside of the plasma membrane. 
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Figure 16-31 Ras activates a MAP-kinase 
signaling module. The Ras protein, activated 
by the process shown in Figure 16-30, activates 
a three-kinase signaling module, which relays 
the signal onward. The final kinase in the 
module, MAP kinase, phosphorylates various 
downstream signaling or effector proteins. 
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In its active state, Ras initiates a phosphorylation cascade in which a series 
of serine/threonine kinases phosphorylate and activate one another in 
sequence, like an intracellular game of dominoes. This relay system, which 
carries the signal from the plasma membrane to the nucleus, includes a 
three-kinase module called the MAP-kinase signaling module, in honor 
of the final enzyme in the chain, the mitogen-activated protein kinase, 
or MAP kinase. (As we discuss in Chapter 18, mitogens are extracellular 
signal molecules that stimulate cell proliferation.) In this pathway, out- 
lined in Figure 16-31, MAP kinase is phosphorylated and activated by 
an enzyme called, logically enough, MAP kinase kinase. This protein is 
itself switched on by a MAP kinase kinase kinase (which is activated by 
Ras). At the end of the MAP-kinase cascade, MAP kinase phosphorylates 
various effector proteins, including certain transcription regulators, alter- 
ing their ability to control gene transcription. The resulting change in the 
pattern of gene expression may stimulate cell proliferation, promote cell 
survival, or induce cell differentiation: the precise outcome will depend 
on which other genes are active in the cell and what other signals the 
cell receives. How biologists unravel such complex signaling pathways is 
discussed in How We Know, pp. 563-564. 


Before Ras was discovered in normal cells, a mutant form of the pro- 
tein was found in human cancer cells. The mutation inactivates the 
GTPase activity of Ras, so that the protein cannot shut itself off, promot- 
ing uncontrolled cell proliferation and the development of cancer. About 
30% of human cancers contain such activating mutations in a Ras gene; 
of the cancers that do not, many have mutations in genes that encode 
proteins that function in the same signaling pathway as Ras. Many of the 
genes that encode normal intracellular signaling proteins were initially 
identified in the hunt for cancer-promoting oncogenes (discussed in 
Chapter 20). 


RTKs Activate PI 3-Kinase to Produce Lipid Docking Sites 
in the Plasma Membrane 


Many of the extracellular signal proteins that stimulate animal cells to 
survive and grow, including signal proteins belonging to the insulin-like 
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Figure 16-32 Some RTKs activate the PI-3-kinase-Akt signaling pathway. 

An extracellular survival signal, such as IGF, activates an RTK, which recruits and 
activates PI 3-kinase. PI 3-kinase then phosphorylates an inositol phospholipid 
that is embedded in the cytosolic side of the plasma membrane. The resulting 
phosphorylated inositol phospholipid attracts intracellular signaling proteins that 
have a special domain that recognizes it. One of these signaling proteins, Akt, is 
a protein kinase that is activated at the membrane by phosphorylation mediated 
by two other protein kinases (here called protein kinases 1 and 2); protein kinase 
1 is also recruited by the phosphorylated lipid docking sites. Once activated, Akt 
is released from the plasma membrane and phosphorylates various downstream 
proteins on specific serines and threonines (not shown). 


growth factor (IGF) family, act through RTKs. One crucially important 
signaling pathway that these RTKs activate to promote cell growth and 
survival involves the enzyme phosphoinositide 3-kinase (PI 3-kinase), 
which phosphorylates inositol phospholipids in the plasma membrane. 
These phosphorylated lipids serve as docking sites for specific intracel- 
lular signaling proteins, which relocate from the cytosol to the plasma 
membrane, where they can activate one another. One of the most impor- 
tant of these relocated signaling proteins is the serine/threonine kinase 
Akt (Figure 16-32). 


Akt, also called protein kinase B (PKB), promotes the growth and survival 
of many cell types, often by inactivating the signaling proteins it phos- 
phorylates. For example, Akt phosphorylates and inactivates a cytosolic 
protein called Bad. In its active state, Bad encourages the cell to kill itself 
by indirectly activating a cell-suicide program called apoptosis (discussed 
in Chapter 18). Phosphorylation by Akt thus promotes cell survival by 
inactivating a protein that otherwise promotes cell death (Figure 16-33). 


In addition to promoting cell survival, the Pl-3-kinase-Akt signaling 
pathway stimulates cells to grow in size. It does so by indirectly activating 
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QUESTION 16-7 


Would you expect to activate 
RTKs by exposing the exterior of 
cells to antibodies that bind to the 
respective proteins? Would your 
answer be different for GPCRs? 
(Hint: review Panel 4-2, on 

pp. 140-141, regarding the 
properties of antibody molecules.) 


Figure 16-33 Activated Akt promotes 
cell survival. One way it does so is by 
phosphorylating and inactivating a protein 
called Bad. In its unphosphorylated 

state, Bad promotes apoptosis (a form of 
cell death) by binding to and inhibiting 

a protein, called Bcl2, which otherwise 
suppresses apoptosis. When Bad is 
phosphorylated by Akt, Bad releases Bcl2, 
which now blocks apoptosis, thereby 
promoting cell survival. 
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growth factor Figure 16-34 Akt stimulates cells to grow in size by activating 
s the serine/threonine kinase Tor. The binding of a growth factor to 

an RTK activates the Pl-3-kinase-Akt signaling pathway (as shown in 

plasma Figure 16-32). Akt then indirectly activates Tor by phosphorylating 

membrane and inhibiting a protein that helps to keep Tor shut down (not shown). 
Tor stimulates protein synthesis and inhibits protein degradation by 
phosphorylating key proteins in these processes (not shown). The 
anticancer drug rapamycin slows cell growth by inhibiting Tor. In fact, 
the Tor protein derives its name from the fact that it is a target of 
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t 

activatedAkt a large serine/threonine kinase called Tor. Tor stimulates cells to grow 
t both by enhancing protein synthesis and by inhibiting protein degrada- 
1 tion (Figure 16-34). The anticancer drug rapamycin works by inactivating 
t Tor, indicating the importance of this signaling pathway in regulating 


activated Tor p A ; x 5 
S cell growth and survival—and the consequences of its disregulation in 
cancer. 


inhibition of stimulation 


protein of protein The main intracellular signaling cascades activated by GPCRs and RTKs 
degradation synthesis . ‘ š f 

are summarized in Figure 16-35. As dauntingly complex as such path- 

ways may seem, the complexity of cell signaling is actually much greater 

still. First, we have not discussed all of the intracellular signaling path- 


ways that operate in cells. Second, although we depict these signaling 
pathways as being relatively linear and self-contained, they do not 
operate entirely independently. We will return to this concept of signal 
integration at the chapter's conclusion. But first, we take a brief detour to 
introduce a few important types of signaling systems that we have thus 
far overlooked. 
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differs from the others, yet they PKA CaM-kinase PKC MAP kinase Akt kinase 
use some common components to 
transmit their signals. Because all five 
eventually activate protein kinases 
(gray boxes), it seems that each is 
capable in principle of regulating 
practically any process in the cell. 
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UNTANGLING CELL SIGNALING PATHWAYS 


Intracellular signaling pathways are never mapped 
out in a single experiment. Although insulin was first 
isolated from dog pancreas in the early 1920s, the 
molecular chain of events that links the binding of insu- 
lin to its receptor with the activation of the transporter 
proteins that take up glucose has taken decades to 
untangle—and is still not completely understood. 


Instead, investigators figure out, piece by piece, how all 
the links in the chain fit together—and how each con- 
tributes to the cell’s response to an extracellular signal 
molecule such as the hormone insulin. Here, we discuss 
the kinds of experiments that allow scientists to identify 
individual links and, ultimately, to piece together com- 
plex signaling pathways. 


Close encounters 


Most signaling pathways depend on proteins that phys- 
ically interact with one another. There are several ways 
to detect such direct contact. One involves using a pro- 
tein as “bait.” For example, to isolate the receptor that 
binds to insulin, one could attach insulin to a chroma- 
tography column. Cells that respond to the hormone are 
broken open with detergents that disrupt their mem- 
branes, releasing the transmembrane receptor proteins 
(see Figure 11-27). When this slurry is poured over the 
chromatography column, the proteins that bind to insu- 
lin will stick and can later be eluted and identified (see 
Figure 4-55). 
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Protein-protein interactions in a signaling pathway can 
also be identified by co-immunoprecipitation. For exam- 
ple, cells exposed to an extracellular signal molecule 
can be broken open, and antibodies can be used to 
grab the receptor protein known to recognize the signal 
molecule (see Panel 4-2, pp. 140-141, and Panel 4-3, 
pp. 164-165). If the receptor is strongly associated with 
other proteins, as shown in Figure 16-29, these will be 
captured as well. In this way, researchers can identify 
which proteins interact when an extracellular signal 
molecule stimulates cells. 


Once two proteins are known to bind to each other, 
an investigator can pinpoint which parts of the pro- 
teins are required for the interaction using the DNA 
technology discussed in Chapter 10. For example, to 
determine which phosphorylated tyrosine on a receptor 
tyrosine kinase (RTK) is recognized by a certain intra- 
cellular signaling protein, a series of mutant receptors 
can be constructed, each missing a different tyrosine 
from its cytoplasmic domain (Figure 16-36). In this 
way, the specific tyrosines required for binding can be 
determined. Similarly, one can determine whether this 
phosphotyrosine docking site is required for the recep- 
tor to transmit a signal to the cell. 


Jamming the pathway 


Ultimately, one wants to assess what role a particular 
protein plays in a signaling pathway. A first test may 
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Figure 16-36 Mutant proteins can 

help to determine exactly where an 
intracellular signaling molecule binds. As 
shown in Figure 16-29, on binding their 
extracellular signal molecule, a pair of RTKs 
come together and phosphorylate specific 
tyrosines on each other's cytoplasmic 

tails. These phosphorylated tyrosines bind 
different intracellular signaling proteins, 
which then become activated and pass on 
the signal. To determine which tyrosine 
binds to a specific intracellular signaling 
protein, a series of mutant receptors 

is constructed. In the mutants shown, 
tyrosines Tyr2 or Tyr3 have been replaced, 
one at a time, by phenylalanine (red), 
thereby preventing phosphorylation at 
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The effect on the cell’s response to the 
signal can then be determined. It is 
important that the mutant receptor is tested 
in a cell that does not have its own normal 
receptors for the signal molecule. 
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involve using DNA technology to introduce into cells a 
gene encoding a constantly active form of the protein, 
to see if this mimics the effect of the extracellular signal 
molecule. Consider Ras, for example. The mutant form 
of Ras involved in human cancers is constantly active 
because it has lost its ability to hydrolyze the bound 
GTP that keeps the Ras protein switched on. This con- 
tinuously active form of Ras can stimulate some cells to 
proliferate, even in the absence of a proliferation signal. 


Conversely, the activity of a specific signaling protein 
can be inhibited or eliminated. In the case of Ras, for 
example, one could shut down the expression of the 
Ras gene in cells by RNA interference or CRISPR (see 
Figure 10-31). Such cells do not proliferate in response 
to extracellular mitogens, indicating the importance of 
normal Ras signaling in the proliferative response. 


Making mutants 


Another powerful strategy that scientists use to deter- 
mine which proteins participate in cell signaling 
involves screening tens of thousands of animals—fruit 
flies or nematode worms, for example (discussed in 
Chapter 19)—to search for mutants in which a signaling 


pathway is not functioning properly. By examining 
enough mutant animals, many of the genes that encode 
the proteins involved in a signaling pathway can be 
identified. 


Such classical genetic screens can also help sort out 
the order in which intracellular signaling proteins act 
in a pathway. Suppose that a genetic screen uncovers 
a pair of new proteins, X and Y, involved in the Ras sig- 
naling pathway. To determine whether these proteins 
lie upstream or downstream of Ras, one could create 
cells that express an inactive, mutant form of each pro- 
tein, and then ask whether these mutant cells can be 
“rescued” by the addition of a continuously active 
form of Ras. If the constantly active Ras overcomes 
the blockage created by the mutant protein, the pro- 
tein must operate upstream of Ras in the pathway 
(Figure 16-37A). However, if Ras operates upstream of 
the protein, a constantly active Ras would be unable 
to transmit a signal past the obstruction caused by the 
disabled protein (Figure 16-37B). Through such experi- 
ments, even the most complex intracellular signaling 
pathways can be mapped out, one step at a time (Figure 
16-37C). 


A SIGNALING PATHWAY IS FOUND TO INVOLVE THREE PROTEINS: Ras, PROTEIN X, AND PROTEIN Y 
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Figure 16-37 The use of mutant cell lines and an overactive form of Ras can help dissect an intracellular signaling pathway. 

In this hypothetical pathway, Ras, protein X, and protein Y are required for proper signaling. (A) In cells in which protein X has been 
inactivated, signaling does not occur. However, this signaling blockage can be overcome by the addition of an overactive form 

of Ras, such that the pathway is active even in the absence of the extracellular signal molecule. This result indicates that Ras acts 
downstream of protein X in the pathway. (B) Signaling is also disrupted in cells in which protein Y has been inactivated. In this case, 
introduction of an overactive Ras does not restore normal signaling, indicating that protein Y operates downstream of Ras. (C) Based 
on these results, the deduced order of the signaling pathway is shown. 
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Some Receptors Activate a Fast Track to the Nucleus 


Not all receptors trigger complex signaling cascades that use multiple 
components to carry a message to the nucleus. Some take a more direct 
route to control gene expression. One such receptor is the protein Notch. 
Notch is a crucially important receptor in all animals, both during devel- 
opment and in adults. Among other things, it controls the development 
of neural cells in Drosophila (Figure 16-38). 


In this simple signaling pathway, the receptor itself acts as a transcrip- 
tion regulator. When activated by the binding of Delta, a transmembrane 
signal protein on the surface of a neighboring cell, the Notch receptor is 
cleaved. This cleavage releases the cytosolic tail of the receptor, which is 
then free to move to the nucleus, where it helps to activate the appropri- 
ate set of Notch-responsive genes (Figure 16-39). 


Some Extracellular Signal Molecules Cross the Plasma 
Membrane and Bind to Intracellular Receptors 


Another direct route to the nucleus is taken by extracellular signal mol- 
ecules that rely on intracellular receptor proteins (see Figure 16-4B). 
These molecules include the steroid hormones—cortisol, estradiol, and 
testosterone—and the thyroid hormones such as thyroxine (Figure 16-40). 
All of these hydrophobic molecules pass through the plasma membrane 
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Figure 16-38 Notch signaling controls 
nerve-cell production in the fruit fly 
Drosophila. The fly nervous system 
originates in the embryo from a sheet of 
epithelial cells. Isolated cells in this sheet 
begin to specialize as neurons (blue), while 
their neighbors remain non-neuronal and 
maintain the structure of the epithelial 
sheet. The signals that control this process 
are transmitted via direct cell-cell contacts: 
each future neuron delivers an inhibitory 
signal to the cells next to it, deterring them 
from specializing as neurons too—a process 
called lateral inhibition. Both the signal 
molecule (Delta) and the receptor molecule 
(Notch) are transmembrane proteins, and 
the pathway represents a form of contact- 
dependent signaling (see Figure 16—-3D). 


Figure 16-39 The Notch receptor 

itself is a transcription regulator. When 
the membrane-bound signal protein 

Delta binds to its receptor, Notch, on a 
neighboring cell, the receptor is cleaved 

by a protease. The released part of the 
cytosolic tail of Notch migrates to the 
nucleus, where it activates Notch-responsive 
genes. One consequence of this signaling 
process is shown in Figure 16-38. 
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Figure 16-40 Some small, hydrophobic 
hormones bind to intracellular receptors 
that act as transcription regulators. 
Although these signal molecules differ in 
their chemical structures and functions, they 
all act by binding to intracellular receptor 
proteins that act as transcription regulators. 
Their receptors are not identical, but they 
are evolutionarily related, belonging to the 
nuclear receptor superfamily. The sites of 
origin and functions of these hormones are 
given in Table 16-1 (p. 536). 


Figure 16-41 The steroid hormone 
cortisol acts by activating a transcription 
regulator. Cortisol is one of the hormones 
produced by the adrenal glands in response 
to stress. It crosses the plasma membrane 
of a target cell and binds to its receptor 
protein, which is located in the cytosol. 

The receptor-hormone complex is then 
transported into the nucleus via the 

nuclear pores. Cortisol binding activates 
the receptor protein, which is then able to 
bind to specific regulatory sequences in 
DNA and activate (or repress, not shown) 
the transcription of specific target genes. 
Whereas the receptors for cortisol and some 
other steroid hormones are located in the 
cytosol, those for other steroid hormones 
and for thyroid hormones are already bound 
to DNA in the nucleus even in the absence 
of hormone. 
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of the target cell and bind to receptor proteins located in either the cyto- 
sol or the nucleus. Regardless of their initial location, these intracellular 
receptor proteins are referred to as nuclear receptors because, when 
activated by hormone binding, they enter the nucleus, where they regu- 
late the transcription of genes. In unstimulated cells, nuclear receptors 
are typically present in an inactive form. When a hormone binds, the 
receptor undergoes a large conformational change that activates the pro- 
tein, allowing it to promote or inhibit the transcription of specific target 
genes (Figure 16-41). Each hormone binds to a different nuclear recep- 
tor, and each receptor acts at a different set of regulatory sites in DNA 
(discussed in Chapter 8). Moreover, a given hormone usually regulates 
different sets of genes in different cell types, thereby evoking different 
physiological responses in different target cells. 


Nuclear receptors and the hormones that activate them have essential 
roles in human physiology (see Table 16-1, p. 536). Loss of these signal- 
ing systems can have dramatic consequences, as illustrated by the effects 
of mutations that eliminate the receptor for the male sex hormone tes- 
tosterone. Testosterone in humans shapes the formation of the external 
genitalia and influences brain development in the fetus; at puberty, the 
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hormone triggers the development of male secondary sexual character- 
istics. Some very rare individuals are genetically male—that is, they have 
both an X and a Y chromosome—but lack the testosterone receptor as a 
result of a mutation in the corresponding gene; thus, they make testos- 
terone, but their cells cannot respond to it. As a result, these individuals 
develop as females, which is the path that sexual and brain develop- 
ment would take if no male or female hormones were produced. Such a 
sex reversal demonstrates the crucial role of the testosterone receptor in 
sexual development, and it also shows that the receptor is required not 
just in one cell type to mediate one effect of testosterone, but in many cell 
types to help produce the whole range of features that distinguish men 
from women. 


Plants Make Use of Receptors and Signaling Strategies 
That Differ from Those Used by Animals 


Plants and animals have been evolving independently for more than a bil- 
lion years, the last common ancestor being a single-celled eukaryote that 
most likely lived on its own. Because these kingdoms diverged so long 
ago—when it was still “every cell for itself’—each has evolved its own 
molecular solutions to the complex problem of becoming multicellular. 
Thus the mechanisms for cell-cell communication in plants and animals 
are in some ways quite different. At the same time, however, plants and 
animals started with a common set of eukaryotic genes—including some 
used by single-celled organisms to communicate among themselves—so 
their signaling systems also show some similarities. 


Like animals, plants make extensive use of transmembrane cell-surface 
receptors—especially enzyme-coupled receptors. The spindly weed 
Arabidopsis thaliana (see Figure 1-33) has hundreds of genes encoding 
receptor serine/threonine kinases. These are, however, structurally 
distinct from the receptor serine/threonine kinases found in animal cells 
(which we do not discuss in this chapter). The plant receptors are thought 
to play an important part in a large variety of cell signaling processes, 
including those governing plant growth, development, and disease 
resistance. In contrast to animal cells, plant cells seem not to use RTKs, 
steroid-hormone-type nuclear receptors, or cyclic AMP, and they seem to 
use few GPCRs. 


One of the best-studied signaling systems in plants mediates the response 
of cells to ethylene—a gaseous hormone that regulates a diverse array 
of developmental processes, including seed germination and fruit ripen- 
ing. Tomato growers use ethylene to ripen their fruit, even after it has 
been picked. Although ethylene receptors are not evolutionarily related 
to any of the classes of receptor proteins that we have discussed so far, 
they function as enzyme-coupled receptors. Surprisingly, it is the empty 
receptor that is active: in the absence of ethylene, the empty recep- 
tor activates an associated protein kinase that ultimately shuts off the 
ethylene-responsive genes in the nucleus; when ethylene is present, the 
receptor and kinase are inactive, and the ethylene-responsive genes are 
transcribed (Figure 16-42). This strategy, whereby signals act to relieve 
transcriptional inhibition, is commonly used in plants. 


Protein Kinase Networks Integrate Information to 
Control Complex Cell Behaviors 


Whether part of a plant or an animal, a cell receives messages from many 
sources, and it must integrate this information to generate an appropri- 
ate response: to live or die, to divide, to differentiate, to change shape, to 
move, to send out a chemical message of its own, and so on (see Figure 
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Figure 16-42 The ethylene signaling 
pathway turns on genes by relieving 
inhibition. (A) In the absence of ethylene, 
the receptor directly activates an associated 
protein kinase, which then indirectly 
promotes the destruction of the transcription 
regulator that switches on ethylene- 
responsive genes. As a result, the genes 
remain turned off. (B) In the presence 

of ethylene, the receptor and kinase 

are both inactive, and the transcription 
regulator remains intact and stimulates the 
transcription of the ethylene-responsive 
genes. The kinase that ethylene receptors 
interact with is a serine/threonine kinase that 
is closely related to the MAP kinase kinase 
kinase found in animal cells (see Figure 
16-31). Note that the ethylene receptor 

is located in the endoplasmic reticulum; 
because ethylene is hydrophobic, it passes 
easily into the cell interior to reach its 
receptor. 
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16-6, Movie 16.8, and Movie 16.9). This integration is made possible 
by connections and interactions that occur between different signal- 
ing pathways. Such cross-talk allows the cell to bring together multiple 
streams of information and react to a rich combination of signals. 


The most extensive links among the pathways are mediated by the pro- 
tein kinases present in each. These kinases often phosphorylate, and 
hence regulate, components in other signaling pathways, in addition to 
components in their own pathway (see Figure 16-35). To give an idea 
of the scale of the complexity, genome sequencing studies suggest that 
about 2% of our ~19,000 protein-coding genes code for protein kinases; 
moreover, hundreds of distinct types of protein kinases are thought to be 
present in a single mammalian cell. 


Many intracellular signaling proteins have several potential phosphoryla- 
tion sites, each of which can be phosphorylated by a different protein 
kinase. These proteins can thus act as integrating devices. Information 
received from different intracellular signaling pathways can converge on 
such proteins, which then convert a multicomponent input to a single 
outgoing signal (Figure 16-43, and see Figure 16-9). These integrating 
proteins, in turn, can deliver a signal to many downstream targets. In this 
way, the intracellular signaling system may act like a network of nerve 
cells in the brain—or like a collection of microprocessors in a computer— 
interpreting complex information and generating complex responses. 


Our understanding of these intricate networks is still evolving: we are still 
discovering new links in the chains, new signaling partners, new con- 
nections, and even new pathways. Unraveling the intracellular signaling 
pathways—in both animals and plants—is one of the most active areas 
of research in cell biology, and new discoveries are being made every 
day. Genome sequencing projects continue to provide long lists of com- 
ponents involved in signal transduction in a large variety of organisms. 
Yet even if we could identify every single component in this elaborate 
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network of signaling pathways, it will remain a major challenge to figure 
out exactly how they all work together to allow cells—and organisms—to 
integrate the diverse array of information that inundates them constantly 
and to respond in a way that enhances their ability to adapt and survive. 


ESSENTIAL CONCEPTS 


Cells in multicellular organisms communicate through a huge variety 
of extracellular chemical signals. 


In animals, hormones are carried in the blood to distant target cells, 
but most other extracellular signal molecules act over only a short 
distance. Neighboring cells often communicate through direct cell- 
cell contact. 


For an extracellular signal molecule to influence a target cell it must 
interact with a receptor protein on or in the target cell. Each receptor 
protein recognizes a particular signal molecule. 


Most extracellular signal molecules bind to cell-surface receptor 
proteins that convert (transduce) the extracellular signal into differ- 
ent intracellular signals, which are usually organized into signaling 
pathways. 


There are three main classes of cell-surface receptors: (1) ion- 
channel-coupled receptors, (2) G-protein-coupled receptors (GPCRs), 
and (3) enzyme-coupled receptors. 


GPCRs and enzyme-coupled receptors respond to extracellular sig- 
nals by activating one or more intracellular signaling pathways, 
which, in turn, activate effector proteins that alter the behavior of 
the cell. 


Turning off signaling pathways is as important as turning them on. 
Each activated component in a signaling pathway must be subse- 
quently inactivated or removed for the pathway to function again. 


GPCRs activate trimeric GTP-binding proteins called G proteins; these 
act as molecular switches, transmitting the signal onward for a short 
period before switching themselves off by hydrolyzing their bound 
GTP to GDP. 


G proteins directly regulate ion channels or enzymes in the plasma 
membrane. Some directly activate (or inactivate) the enzyme adenylyl 
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Figure 16-43 Intracellular signaling 
proteins serve to integrate incoming 
signals. Extracellular signals A, B, C, and D 
activate different receptors in the plasma 
membrane. The receptors act upon two 
protein kinases, which they either activate 
(black arrow) or inhibit (red crossbar). The 
kinases phosphorylate the same target 
protein and, when it is fully phosphorylated, 
this target protein triggers a cell response. 

It can be seen that signal molecule B 
activates both protein kinases and therefore 
produces a strong output response. Signals 
A and D each activate a different kinase and 
therefore produce a response only if they 
are simultaneously present. Signal molecule 
C inhibits the cell response and will 
compete with the other signal molecules. 
The net outcome will depend both on the 
numbers of signaling molecules and the 
strengths of their connections. In a real cell, 
these parameters would be determined by 
evolution. 
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cyclase, which increases (or decreases) the intracellular concentra- 
tion of the second messenger molecule cyclic AMP; others directly 
activate the enzyme phospholipase C, which generates the second 
messenger molecules inositol trisphosphate (IP3) and diacylglycerol. 


IP3 opens Ca?* channels in the membrane of the endoplasmic reticu- 
lum, releasing a flood of free Ca?* ions into the cytosol. The Ca?* itself 
acts as a second messenger, altering the activity of a wide range of 
Ca?+-responsive proteins. These include calmodulin, which activates 
various target proteins such as Ca?*/calmodulin-dependent protein 
kinases (CaM-kinases). 


A rise in cyclic AMP activates protein kinase A (PKA), while Ca?+ and 
diacylglycerol in combination activate protein kinase C (PKC). 


PKA, PKC, and CaM-kinases phosphorylate selected signaling and 
effector proteins on serines and threonines, thereby altering their 
activity. Different cell types contain different sets of signaling and 
effector proteins and are therefore affected in different ways. 


Enzyme-coupled receptors have intracellular protein domains that 
function as enzymes or are associated with intracellular enzymes. 
Many enzyme-coupled receptors are receptor tyrosine kinases 
(RTKs), which phosphorylate themselves and selected intracellular 
signaling proteins on tyrosines. The phosphotyrosines on RTKs then 
serve as docking sites for various intracellular signaling proteins. 


Most RTKs activate the monomeric GTPase Ras, which, in turn, acti- 
vates a three-protein MAP-kinase signaling module that helps relay 
the signal from the plasma membrane to the nucleus. 


Ras mutations stimulate cell proliferation by keeping Ras (and, con- 
sequently, the Ras-MAP kinase signaling pathway) constantly active 
and are a common feature of many human cancers. 


Some RTKs stimulate cell growth and cell survival by activating 
PI 3-kinase, which phosphorylates specific inositol phospholipids in 
the cytosolic leaflet of the plasma membrane lipid bilayer. This ino- 
sitol phosphorylation creates lipid docking sites that attract specific 
signaling proteins from the cytosol, including the protein kinase Akt, 
which becomes active and relays the signal onward. 


Other receptors, such as Notch, have a direct pathway to the nucleus. 
When activated, part of the receptor migrates from the plasma mem- 
brane to the nucleus, where it regulates the transcription of specific 
genes. 


Some extracellular signal molecules, such as steroid hormones and 
nitric oxide, are small or hydrophobic enough to cross the plasma 
membrane and activate intracellular proteins, which are usually 
either transcription regulators or enzymes. 


Plants, like animals, use enzyme-coupled cell-surface receptors to 
recognize the extracellular signal molecules that control their growth 
and development; these receptors often act by relieving the tran- 
scriptional repression of specific genes. 


Different intracellular signaling pathways interact, enabling each cell 
type to produce the appropriate response to a combination of extra- 
cellular signals. In the absence of such signals, most animal cells 
have been programmed to kill themselves by undergoing apoptosis. 


We are far from understanding how a cell integrates all of the many 
extracellular signals that bombard it to generate an appropriate 
response. 
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adaptation 

adenylyl cyclase 

Ca?*/calmodulin-dependent 
protein kinase (CaM-kinase) 


hormone 


(IP3) 


GTP-binding protein 


inositol 1,4,5-trisphosphate 


phosphoinositide 3-kinase 
(PI 3-kinase) 

phospholipase C 

protein kinase 


calmodulin inositol phospholipid protein kinase C (PKC) 
cell signaling intracellular signaling pathway protein phosphatase 
cyclic AMP ion-channel-coupled receptor Ras 
cyclic-AMP-dependent local mediator receptor 


protein kinase (PKA) MAP kinase 
diacylglycerol (DAG) 
enzyme-coupled receptor 


extracellular signal molecule 


molecular switch 


G protein neurotransmitter 
G-protein-coupled receptor nitric oxide (NO) 
(GPCR) nuclear receptor 


QUESTIONS 


QUESTION 16-8 


Which of the following statements are correct? Explain your 
answers. 


A. The extracellular signal molecule acetylcholine has 
different effects on different cell types in an animal and 
often binds to different cell-surface receptor molecules on 
different cell types. 


B. After acetylcholine is secreted from cells, it is long-lived, 
because it has to reach target cells all over the body. 


C. Both the GTP-bound a subunits and nucleotide-free 
By complexes—but not GDP-bound, fully assembled 

G proteins—can activate other molecules downstream of 
GPCRs. 


D. IP3 is produced directly by cleavage of an inositol 
phospholipid without incorporation of an additional 
phosphate group. 

E. Calmodulin regulates the intracellular Ca2* 
concentration. 


F. Different signals originating from the plasma membrane 
can be integrated by cross-talk between different signaling 
pathways inside the cell. 


G. Tyrosine phosphorylation serves to build binding sites 
for other proteins to bind to RTKs. 


QUESTION 16-9 


The Ras protein functions as a molecular switch that is set 
to its “on” state by other proteins that cause it to release 
its bound GDP and bind GTP. A GTPase-activating protein 


MAP-kinase signaling module 


monomeric GTPase 


receptor serine/threonine kinase 
receptor tyrosine kinase (RTK) 
serine/threonine kinase 

signal transduction 

steroid hormone 

tyrosine kinase 


helps reset the switch to the “off” state by inducing Ras 

to hydrolyze its bound GTP to GDP much more rapidly 
than it would without this encouragement. Thus, Ras works 
like a light switch that one person turns on and another 
turns off. You are studying a mutant cell that lacks the 
GTPase-activating protein. What abnormalities would you 
expect to find in the way in which Ras activity responds to 
extracellular signals? 


QUESTION 16-10 


A. Compare and contrast signaling by neurons, which 
secrete neurotransmitters at synapses, with signaling carried 
out by endocrine cells, which secrete hormones into the 
blood. 


B. Discuss the relative advantages of the two mechanisms. 


QUESTION 16-11 


Two intracellular molecules, X and Y, are both normally 
synthesized at a constant rate of 1000 molecules per second 
per cell. Molecule X is broken down slowly: each molecule 
of X survives on average for 100 seconds. Molecule Y is 
broken down 10 times faster: each molecule of Y survives 
on average for 10 seconds. 


A. Calculate how many molecules of X and Y the cell 
contains at any time. 


B. If the rates of synthesis of both X and Y are suddenly 
increased tenfold to 10,000 molecules per second per cell— 
without any change in their degradation rates—how many 
molecules of X and Y will there be after one second? 


C. Which molecule would be preferred for rapid signaling? 


572 CHAPTER 16 Cell Signaling 


QUESTION 16-12 


In a series of experiments, genes that code for mutant 
forms of an RTK are introduced into cells. The cells also 
express their own normal form of the receptor from their 
normal gene, although the mutant genes are constructed 
so that the mutant RTK is expressed at considerably higher 
concentration than the normal RTK. What would be the 
consequences of introducing a mutant gene that codes for 
an RTK (A) lacking its extracellular domain, or (B) lacking its 
intracellular domain? 


QUESTION 16-13 


Discuss the following statement: “Membrane proteins 

that span the membrane many times can undergo a 
conformational change upon ligand binding that can be 
sensed on the other side of the membrane. Thus, individual 
protein molecules can transmit a signal across a membrane. 
In contrast, individual single-span membrane proteins 
cannot transmit a conformational change across the 
membrane but require oligomerization.” 


QUESTION 16-14 


What are the similarities and differences between the 
reactions that lead to the activation of G proteins and the 
reactions that lead to the activation of Ras? 


QUESTION 16-15 


Why do you suppose cells use Ca** (which is kept by 

Ca** pumps at a cytosolic concentration of 10-7 M) for 
intracellular signaling and not another ion such as Na* 
(which is kept by the Na* pump at a cytosolic concentration 
of 10-3 M)? 


QUESTION 16-16 


It seems counterintuitive that a cell, having a perfectly 
abundant supply of nutrients available, would commit 
suicide if not constantly stimulated by signals from other 
cells (see Figure 16-6). What do you suppose might be the 
advantages of such regulation? 


QUESTION 16-17 


The contraction of the myosin-actin system in cardiac 
muscle cells is triggered by a rise in intracellular Ca?*. 
Cardiac muscle cells have specialized Ca?* channels—called 
ryanodine receptors because of their sensitivity to the 
drug ryanodine—that are embedded in the membrane 
of the sarcoplasmic reticulum, a specialized form of the 
endoplasmic reticulum. In contrast to the IP3-gated Ca?+ 
channels in the endoplasmic reticulum shown in Figure 
16-23, the signaling molecule that opens ryanodine 
receptors is Ca** itself. Discuss the consequences of 
this feature of ryanodine channels for cardiac muscle cell 
contraction. 


QUESTION 16-18 


Two protein kinases, K1 and K2, function in an intracellular 
signaling pathway. If either kinase contains a mutation that 
permanently inactivates its function, no response is seen 

in cells when an extracellular signal is received. A different 
mutation in K1 makes it permanently active, so that in cells 
containing that mutation, a response is observed even in 
the absence of an extracellular signal. You characterize a 
double-mutant cell that contains K2 with the inactivating 
mutation and K1 with the activating mutation. You observe 
that the response is seen even in the absence of an 
extracellular signal. In the normal signaling pathway, does 
K1 activate K2 or does K2 activate K1? Explain your answer. 


QUESTION 16-19 


A. Trace the steps of a long and indirect signaling pathway 
from a cell-surface receptor to a change in gene expression 
in the nucleus. 


B. Compare this pathway with an example of a short and 
direct pathway from the cell surface to the nucleus. 


QUESTION 16-20 


How does PI 3-kinase activate the Akt kinase after activation 
of an RTK? 


QUESTION 16-21 


Consider the structure of cholesterol, a small, hydrophobic 
molecule with a sterol backbone similar to that of three of 
the hormones shown in Figure 16—40, but possessing fewer 
polar groups such as -OH, =O, and -COO-. If cholesterol 
were not normally found in cell membranes, could it be 
used effectively as a hormone if an appropriate intracellular 
receptor evolved? 


QUESTION 16-22 


The signaling mechanisms used by a steroid-hormone-type 
nuclear receptor and by an ion-channel-coupled receptor 
are relatively simple as they have few components. Can they 
lead to an amplification of the initial signal, and, if so, how? 


QUESTION 16-23 


If some cell-surface receptors, including Notch, can rapidly 
signal to the nucleus by activating latent transcription 
regulators at the plasma membrane, why do most cell- 
surface receptors use long, indirect signaling cascades to 
influence gene transcription in the nucleus? 


QUESTION 16-24 


Animal cells and plant cells have some very different 
intracellular signaling mechanisms but also share some 
common mechanisms. Why do you think this is so? 
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Cytoskeleton 


The ability of eukaryotic cells to organize the many components in INTERMEDIATE FILAMENTS 
their interior, adopt a variety of shapes, interact mechanically with the 
environment, and carry out coordinated movements depends on the 
cytoskeleton—an intricate network of protein filaments that extends 
throughout the cytoplasm ( ). This filamentous architecture 
helps to support the large volume of cytoplasm, a function that is particu- ACTIN FILAMENTS 

larly important in animal cells, which have no cell walls. Although some 

cytoskeletal components are present in bacteria, the cytoskeleton is most 

prominent in the large and structurally complex eukaryotic cell. MUSCLE CONTRACTION 


MICROTUBULES 


Unlike our own bony skeleton, however, the cytoskeleton is a highly 
dynamic structure that is continuously reorganized as a cell changes 
shape, divides, and responds to its environment. The cytoskeleton is not 
only the “bones” of a cell but its “muscles” too, and it is directly respon- 
sible for large-scale movements, including the crawling of cells along a 
surface, the contraction of muscle cells, and the changes in cell shape 
that take place as an embryo develops. Without the cytoskeleton, wounds 
would never heal, muscles would not contract, and sperm would never 
reach the egg. 


While the interior of the eukaryotic cell is dynamic, it is also highly organ- 
ized, with organelles that carry out specialized functions concentrated in 
different areas and linked by transport systems (discussed in Chapter 15). 
It is the cytoskeleton that controls the location of the organelles and pro- 
vides the machinery for transport between them. A cytoskeletal machine 
is also responsible for the segregation of chromosomes into two daugh- 
ter cells at cell division and for pinching apart those two new cells, as we 
discuss in Chapter 18. 
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Figure 17-1 The cytoskeleton gives a cell its shape and allows the 
cell to organize its internal components and to move. An animal 
cell in culture has been labeled to show two of its major cytoskeletal 
systems, the microtubules (green) and the actin filaments (red). Where 
the two filaments overlap, they appear yellow. The DNA in the nucleus 
is labeled in blue. (Courtesy of Albert Tousson.) 


The cytoskeleton is built on a framework of three types of protein fila- 
ments: intermediate filaments, microtubules, and actin filaments. Each type 
of filament has distinct mechanical properties and is formed from a dif- 
ferent protein subunit. A family of fibrous proteins forms the intermediate 
filaments; globular tubulin subunits form microtubules; and globular actin 
subunits form actin filaments (Figure 17-2). In each case, thousands of 
these subunits assemble into fine threads that sometimes extend across 
the entire cell. 


10 um 


In this chapter, we consider the structure and function of each of these 
protein filament networks. We begin with intermediate filaments, which 
provide cells with mechanical strength. We then see how microtubules 
organize the cytoplasm of eukaryotic cells and form the hairlike, motile 
appendages that enable cells like protozoa and sperm to swim. We next 
consider how the actin cytoskeleton supports the cell surface and allows 
fibroblasts and other cells to crawl. Finally, we discuss how the actin 
cytoskeleton enables our muscles to contract. 


25 um 
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Figure 17-2 The three types of protein 
filaments that form the cytoskeleton 
differ in their composition, mechanical 
properties, and roles inside the cell. They 
are shown here in epithelial cells, but they 
are all found in almost all animal cells. 
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Intermediate filaments are ropelike fibers with a 
diameter of about 10 nm; they are made of 
fibrous intermediate filament proteins. One 
type of intermediate filament forms a 
meshwork called the nuclear lamina just 
beneath the inner nuclear membrane. Other 
types extend across the cytoplasm, giving cells 
mechanical strength and distributing the 
mechanical stresses in an epithelial tissue by 
spanning the cytoplasm from one cell-cell 
junction to another. Intermediate filaments are 
very flexible and have great tensile strength. 
They deform under stress but do not rupture. 
(L. Norlen et al. Exper. Cell Res. 313:2217-2227, 
2007. With permission from Elsevier.) 
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Microtubules are hollow cylinders made of the 
protein tubulin. They are long and straight and 
typically have one end attached to a single 
microtubule-organizing center called a 
centrosome. With an outer diameter of 25 nm, 
microtubules are more rigid than actin filaments 
or intermediate filaments, and they rupture 
when stretched. (Micrograph courtesy of 
Richard Wade.) 


25 nm 


Actin filaments (also known as microfilaments) 
are helical polymers of the protein actin. They 
are flexible structures, with a diameter of about 
7 nm, that are organized into a variety of linear 
bundles, two-dimensional networks, and 
three-dimensional gels. Although actin 
filaments are dispersed throughout the cell, 
they are most highly concentrated in the cortex, 
the layer of cytoplasm just beneath the plasma 
membrane. (Micrograph courtesy of Roger 
Craig.) 


INTERMEDIATE FILAMENTS 


Intermediate filaments have great strength, and their main function 
is to enable cells to withstand the mechanical stress that occurs when 
cells are stretched. The filaments are called “intermediate” because, in 
the smooth muscle cells where they were first discovered, their diam- 
eter (about 10 nm) is between that of the thinner actin filaments and 
the thicker myosin filaments. Intermediate filaments are the toughest and 
most durable of the cytoskeletal filaments: when cells are treated with 
concentrated salt solutions and nonionic detergents, the intermediate 
filaments survive, while most of the rest of the cytoskeleton is destroyed. 


Intermediate filaments are found in the cytoplasm of most animal cells. 
They typically form a network throughout the cytoplasm, surround- 
ing the nucleus and extending out to the cell periphery. There, they are 
often anchored to the plasma membrane at cell-cell junctions called 
desmosomes (discussed in Chapter 20), where the plasma membrane is 
connected to that of another cell (Figure 17-3). Intermediate filaments 
are also found within the nucleus of animal cells. There, they form a 
meshwork called the nuclear lamina, which underlies and strengthens the 
nuclear envelope. In this section, we see how the structure and assembly 
of intermediate filaments makes them particularly suited to strengthen- 
ing cells and protecting them from tearing. 


Intermediate Filaments Are Strong and Ropelike 


An intermediate filament is like a rope in which many long strands are 
twisted together to provide tensile strength—an ability to withstand ten- 
sion without breaking (Movie 17.1). The strands of this cable are made 
of intermediate filament proteins, fibrous subunits each containing a cen- 
tral elongated rod domain with distinct unstructured domains at either 


bundles of 
intermediate 
filaments 


desmosome 
connecting 
two cells 
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Intermediate Filaments 


Figure 17-3 Intermediate filaments form a strong, durable network in the cytoplasm of the cell. (A) Immunofluorescence 


micrograph of a sheet of epithelial cells in culture stained to show the lacelike network of intermediate keratin filaments (blue), which 
surround the nuclei and extend through the cytoplasm of the cells. The filaments in each cell are indirectly connected to those of 
neighboring cells through the desmosomes, establishing a continuous mechanical link from cell to cell throughout the epithelial sheet. 
A second protein (red) has been stained to show the locations of the cell boundaries. (B) Drawing from an electron micrograph of a 
section of a skin cell, showing the bundles of intermediate filaments that traverse the cytoplasm and are inserted at desmosomes. 

(A, from K.J. Green and C.A. Gaudry, Nat. Rev. Mol. Cell. Biol. 1:208-216, 2000. With permission from Macmillan Publishers Ltd; B, from 


R.V. Krstić, Ultrastructure of the Mammalian Cell: An Atlas. Berlin: Springer, 1979. With permission from Springer-Verlag.) 
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Figure 17-4 Intermediate filaments are 
like ropes made of long, twisted strands 
of protein. (A) The intermediate filament 
monomer consists of an a-helical central 
rod domain shown with unstructured 
terminal domains at either end (not shown). 
The C-terminal end of the monomer is 
marked in dark blue to make its position 
within the assembled filament. (B) Pairs 

of monomers associate to form a dimer, 
and (C) two dimers then line up to form a 
staggered, antiparallel tetramer. 

(D) Tetramers can pack together into a 
helical array containing eight tetramer 
strands; (E) these in turn assemble into 

the final ropelike intermediate filament. 
These filaments can elongate by the 
addition of tetramer arrays to either end. 
An electron micrograph of intermediate 
filaments is shown on the upper left. 


(U. Aebi et al. Protoplasma 145:73-81, 1988. 


With permission from Springer Science and 
Business Media.) 


NH2 COOH 
(A) ggg, 


a-helical region of monomer 


Bie COOH 
B) = 
NH2 coiled-coil dimer COOH 


e 48 nmm ——__ > 


lateral association of 8 tetramers 


S aS 


Cee eee Oo 


SS SSS  —¥—S 
a eee SS FS 
— i 4 SS 


addition of 8 tetramers to growing filament 


end (Figure 17-4A). The rod domain consists of an extended a-helical 
region that enables pairs of intermediate filament proteins to form sta- 
ble dimers by wrapping around each other in a coiled-coil configuration 
(Figure 17—4B), as described in Chapter 4. Two of these coiled-coil dimers, 
running in opposite directions, associate to form a staggered tetramer 
(Figure 17-4C). These dimers and tetramers are the soluble subunits of 
intermediate filaments. The tetramers associate with each other side- 
by-side (Figure 17-4D) and then assemble to generate the final ropelike 
intermediate filament (Figure 17-4). 


Because the two dimers point in opposite directions, both ends of the 
tetramer are the same, as are the two ends of assembled intermediate 
filaments; as we will see, this distinguishes these filaments from micro- 
tubules and actin filaments, whose structural polarity is crucial for their 
function. Almost all of the interactions between the intermediate filament 
proteins depend on noncovalent bonding; it is the combined strength of 
the overlapping lateral interactions along the length of the proteins that 
gives intermediate filaments their great tensile strength. 


The central rod domains of different intermediate filament proteins are all 
similar in size and amino acid sequence, so that when they pack together 
they always form filaments of similar diameter and internal structure. By 
contrast, the terminal head and tail domains vary greatly in both size and 
amino acid sequence from one type of intermediate filament protein to 
another. These unstructured domains are exposed on the surface of the 
filament, where they allow it to interact with specific components in the 
cytoplasm. 


Intermediate Filaments Strengthen Cells Against 
Mechanical Stress 


Intermediate filaments are particularly prominent in the cytoplasm of cells 
that are subject to mechanical stress. They are present in large numbers, 
for example, along the length of nerve cell axons, providing essential 
internal reinforcement to these extremely long and fine cell extensions. 
They are also abundant in muscle cells and in epithelial cells such as 
those of the skin. In all these cells, intermediate filaments distribute the 
effects of locally applied forces, thereby keeping cells and their mem- 
branes from tearing in response to mechanical shear. A similar principle 
is used to strengthen composite materials such as fiberglass or reinforced 
concrete, in which tension-bearing linear elements such as carbon fibers 
(in fiberglass) or steel bars (in concrete) are embedded in a space-filling 
matrix to give the material strength. 


Intermediate filaments can be grouped into four classes: (1) keratin fil- 
aments in epithelial cells; (2) vimentin and vimentin-related filaments in 
connective-tissue cells, muscle cells, and supporting cells of the nerv- 
ous system (glial cells); (3) neurofilaments in nerve cells; and (4) nuclear 
lamins, which strengthen the nuclear envelope. The first three filament 
types are found in the cytoplasm, whereas the fourth is found in the 
nucleus (Figure 17-5). Filaments of each class are formed by polymeriza- 
tion of their corresponding intermediate filament subunits. 


The keratin filaments are the most diverse class of intermediate fila- 
ment. Every kind of epithelium in the vertebrate body—whether in the 
tongue, the cornea, or the lining of the gut—has its own distinctive mix- 
ture of keratin proteins. Specialized keratins also occur in hair, feathers, 
and claws. In each case, the keratin filaments are formed from a mixture 
of different keratin subunits. Keratin filaments typically span the interiors 
of epithelial cells from one side of the cell to the other, and filaments in 
adjacent epithelial cells are indirectly connected through desmosomes 
(see Figure 17-3B). The ends of the keratin filaments are anchored to the 
desmosomes, and the filaments associate laterally with other cell compo- 
nents through the globular head and tail domains that project from their 
surface. These strong cables, formed by the filaments throughout the epi- 
thelial sheet, distribute the stress that occurs when the skin is stretched. 
The importance of this function is illustrated by the rare human genetic 
disease epidermolysis bullosa simplex, in which mutations in the keratin 
genes interfere with the formation of keratin filaments in the epidermis. 
As a result, the skin is highly vulnerable to mechanical injury, and even a 
gentle pressure can rupture its cells, causing the skin to blister. The dis- 
ease can be reproduced in transgenic mice expressing a mutant keratin 
gene in their skin (Figure 17-6). 
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Figure 17-5 Intermediate filaments are 
divided into four major classes. These 
classes can include numerous subtypes. 
Humans, for example, have more than 
50 keratin genes. 


577 


578 CHAPTER 17 Cytoskeleton 


Figure 17-6 A mutant form of keratin 
makes skin more prone to blistering. 

A mutant gene encoding a truncated 
keratin protein was introduced into a 
mouse. The defective protein assembles 
with the normal keratins and thereby 
disrupts the keratin filament network in 

the skin. (A) Light micrograph of a cross 
section of normal skin, which is resistant 

to mechanical pressure. (B) Cross section 

of skin from a mutant mouse showing the 
formation of a blister, which results from the 
rupturing of cells in the basal layer of the 
mutant epidermis (short red arrow). (C) A 
sketch of three cells in the basal layer of the 
mutant epidermis. As indicated by the red 
arrow, the cells rupture between the nucleus 
and the hemidesmosomes that connect 
the cells—via their keratin filaments—to 
the underlying basal lamina. (From P.A. 
Coulombe et al., J. Cell Biol. 115:1661-1674, 
1991. With permission from The Rockefeller 
University Press.) 


Figure 17-7 Intermediate filaments 
support and strengthen the nuclear 
envelope. (A) Schematic cross section 
through the nuclear envelope. The 
intermediate filaments of the nuclear 
lamina line the inner face of the nuclear 
envelope and are thought to provide 
attachment sites for the chromosomes. 
(B) Electron micrograph of a portion 

of the nuclear lamina from a frog egg. 
The lamina is formed from a lattice of 
intermediate filaments composed of 
lamins. The nuclear lamina in other cell 
types is not always as regularly organized 
as the one shown here. (B, from U. Aebi 
et al., Nature 323:560-564, 1986. With 
permission from Macmillan Publishers Ltd.) 
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Defects in neurofilaments can also lead to disease. Neurofilaments are 
intermediate filaments that are found along the axons of vertebrate neu- 
rons, where they provide strength and stability to the long axons that 
nerve cells use to transmit information. The neurodegenerative disease 
amyotrophic lateral sclerosis (ALS, also known as Lou Gehrig’s disease) 
is associated with an abnormal accumulation of neurofilaments in the 
cell bodies and axons of motor neurons. This accretion may precipitate 
the axon degeneration and muscle weakness seen in these patients. 


The Nuclear Envelope Is Supported by a Meshwork of 
Intermediate Filaments 


Whereas cytoplasmic intermediate filaments form ropelike structures, the 
intermediate filaments lining and strengthening the inside surface of the 
inner nuclear membrane are organized as a two-dimensional meshwork 
(Figure 17-7). As mentioned earlier, the intermediate filaments that form 
this tough nuclear lamina are constructed from a class of intermediate 
filament proteins called Jamins (not to be confused with laminin, which 
is an extracellular matrix protein). The nuclear lamina disassembles and 
re-forms at each cell division, when the nuclear envelope breaks down 
during mitosis and then re-forms in each daughter cell (discussed in 
Chapter 18). 


The collapse and reassembly of the nuclear lamina is controlled by the 
phosphorylation and dephosphorylation of the lamins. Phosphorylation 
of lamins by protein kinases (discussed in Chapter 4) weakens the inter- 
actions between the lamin tetramers and causes the filaments to fall 
apart. Dephosphorylation by protein phosphatases at the end of mitosis 
allows the lamins to reassemble (see Figure 18-30). 
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Figure 17-8 Defects in a nuclear lamin can cause a rare class of 
premature aging disorders called progeria. (A) In a normal cell, 
the protein lamin A (green) is assembled into a uniform nuclear 
lamina inside the nuclear envelope. (B) In a cell with a lamin A 
mutant that is found in patients with progeria, the nuclear lamina is 
defective, resulting in structural defects in the nuclear envelope. 
(C) Children with progeria begin to show features of advanced 
aging early in life. (A and B, from P. Taimen et al., Proc. Natl Acad. 
Sci. USA 106:20788-20793, 2009. With permission from National 
Academy of Sciences; C, courtesy of The Progeria Research 
Foundation, www.progeriaresearch.org.) 


Defects in a particular nuclear lamin are associated with certain types 
of progeria—rare disorders that cause affected individuals to age prema- 
turely. Children with progeria have wrinkled skin, lose their teeth and hair, 
and often develop severe cardiovascular disease by the time they reach 
their teens (Figure 17-8). How the loss of a nuclear lamin could lead to 
this devastating condition is not yet clear, but it may be that the resulting 
nuclear instability leads to impaired cell division, increased cell death, 
a diminished capacity for tissue repair, or some combination of these. 
Because the nuclear lamina also helps properly position chromosomes, 
defects in lamins might also lead to altered chromosome movement and, 
ultimately, changes in gene expression. 


Linker Proteins Connect Cytoskeletal Filaments and 
Bridge the Nuclear Envelope 


Many intermediate filaments are further stabilized and reinforced by 
accessory proteins, such as plectin, that cross-link the filaments into 
bundles and connect them to microtubules, to actin filaments, and to 
adhesive structures in desmosomes (Figure 17-9). Mutations in the gene 
for plectin cause a devastating human disease that combines features 
of epidermolysis bullosa simplex (caused by disruption of skin keratin), 
muscular dystrophy (caused by disruption of intermediate filaments in 
muscle), and neurodegeneration (caused by disruption of neurofila- 
ments). Mice lacking a functional plectin gene die within a few days of 
birth, with blistered skin and abnormal skeletal and heart muscle. Thus, 
although plectin may not be necessary for the initial formation of inter- 
mediate filaments, its cross-linking action is required to provide cells 
with the strength they need to withstand mechanical stress. 


Plectin and other proteins also interact with protein complexes that link 
the cytoplasmic cytoskeleton to structures in the nuclear interior, includ- 
ing chromosomes and the nuclear lamina (Figure 17-10). These bridges, 
which span the nuclear envelope, mechanically couple the nucleus to 
the cytoskeleton, and they are involved in many processes, including the 
movement and positioning of the nucleus within the cell interior and the 
overall organization of the cytoskeleton. 
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QUESTION 17-1 


Which of the following types of cells 

would you expect to contain a high 

density of intermediate filaments 

in their cytoplasm? Explain your 

answers. 

A. Amoeba proteus (a free-living 
amoeba) 

B. Skin epithelial cell 

C. Smooth muscle cell in the 
digestive tract 


D. Escherichia coli 

E. Nerve cell in the spinal cord 
F. Sperm cell 

G. Plant cell 


Figure 17-9 Plectin aids in the bundling 
of intermediate filaments and links 
these filaments to other cytoskeletal 
protein networks. In this scanning 
electron micrograph of the cytoskeletal 
protein network from cultured fibroblasts, 
the actin filaments have been removed, 
and the plectin, intermediate filaments, 
and microtubules have been artificially 
colored. Note how the plectin (orange) 
links an intermediate filament (blue) to 
microtubules (green). The yellow dots are 
gold particles linked to antibodies that 
recognize plectin. (From T.M. Svitkina, 
A.B. Verkhovsky, and G.G. Borisy, J. Cell 
Biol. 135:991-1007, 1996. With permission 
from The Rockefeller University Press.) 
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Figure 17-10 Protein complexes bridge 
the nucleus and cytoplasm through 
the nuclear envelope. The cytoplasmic 


cytoskeleton is connected across the 


nuclear envelope to the nuclear lamina 
or chromosomes through sets of linker 
proteins of the SUN (orange) and KASH 


(purple) families. 
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MICROTUBULES 


Microtubules have a crucial organizing role in all eukaryotic cells. These 
long and relatively stiff, hollow tubes of protein can rapidly disassem- 
ble in one location and reassemble in another. In a typical animal cell, 
microtubules grow out from a small structure near the center of the cell 
called the centrosome (Figure 17-11A and B). Extending out toward the 
cell periphery, they create a system of tracks within the cell, along which 
vesicles, organelles, and other cell components can be transported. 
These cytoplasmic microtubules are the part of the cytoskeleton mainly 
responsible for transporting and positioning membrane-enclosed orga- 
nelles within the cell and for guiding the intracellular transport of various 
cytosolic macromolecules. 


When a cell enters mitosis, the cytoplasmic microtubules disassemble 
and then reassemble into an intricate structure called the mitotic spindle. 
As we discuss in Chapter 18, the mitotic spindle provides the machinery 
that will segregate the chromosomes equally into the two daughter cells 
just before a cell divides (Figure 17-11C). Microtubules can also form 
stable structures, such as rhythmically beating cilia and flagella (Figure 
17-11D). These hairlike structures extend from the surface of many 
eukaryotic cells, which use them either to swim or to sweep fluid over 
their surface. The core of a eukaryotic cilium or flagellum consists of a 
highly organized and stable bundle of microtubules. (Bacterial flagella 
have an entirely different structure and allow the cells to swim by a very 
different mechanism.) 


In this section, we first consider the structure and assembly of microtu- 
bules. We then discuss their role in organizing the cytoplasm—an ability 
that depends on their association with accessory proteins, especially the 
motor proteins that propel organelles along cytoskeletal tracks. Finally, 
we discuss the structure and function of cilia and flagella, in which micro- 
tubules are stably associated with motor proteins that power the beating 
of these mobile appendages. 


Figure 17-11 Microtubules usually grow out from an organizing 
center. (A) Fluorescence micrograph of a cytoplasmic array of 
microtubules in a cultured fibroblast. Unlike intermediate filaments, 
microtubules (green) extend from organizing centers such as (B) a 
centrosome, (C) the two poles of a mitotic spindle, or (D) the basal 
body of a cilium. They can also grow from fragments of existing 
microtubules (not shown). (A, reprinted with permission from Olympus 
Corporation of the Americas Scientific Solutions Group.) 
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Figure 17-12 Microtubules are hollow 
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Microtubules Are Hollow Tubes with Structurally 
Distinct Ends 


Microtubules are built from subunits—molecules of tubulin—each of 
which is a dimer composed of two very similar globular proteins called 
a-tubulin and f-tubulin, bound tightly together by noncovalent interac- 
tions. The tubulin dimers stack together, again by noncovalent bonding, 
to form the wall of the hollow, cylindrical microtubule. This tubelike struc- 
ture is made of 13 parallel protofilaments, each a linear chain of tubulin 
dimers with a- and B-tubulin alternating along its length (Figure 17-12). 
Each protofilament has a structural polarity, with a-tubulin exposed at 
one end and f-tubulin at the other, and this polarity is the same for all 
the protofilaments in the microfilament. Thus the microtubule as a whole 
has a structural polarity: the end with B-tubulin showing is called its plus 
end, and the opposite end, which contains exposed a-tubulin, is called 
the minus end. 


In a concentrated solution of pure tubulin in a test tube, tubulin dimers 
will add to either end of a growing microtubule. However, they add 
more rapidly to the plus end than to the minus end, which is why the 
ends were originally named this way—not because they are electrically 
charged. The polarity of the microtubule—the fact that its structure has 
a definite direction, with the two ends being chemically and functionally 
distinct—is crucial, both for the assembly of microtubules and for their 
role once they are formed. If microtubules had no polarity, they could not, 
for example, guide directional intracellular transport. 


The Centrosome Is the Major Microtubule-organizing 
Center in Animal Cells 
Inside cells, microtubules grow from specialized organizing centers that 


control the location, number, and orientation of the microtubules. In 
most animal cells, for example, the centrosome—which is typically close 
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Figure 17-13 Tubulin polymerizes from 
nucleation sites on a centrosome. 

(A) Schematic drawing showing that an 
animal cell centrosome consists of an 
amorphous matrix of various proteins, 
including the y-tubulin rings (red) that 
nucleate microtubule growth, surrounding 
a pair of centrioles, oriented at right angles 
to each other. Each member of the centriole 
pair is made up of a cylindrical array of short 
microtubules. (B) Diagram of a centrosome 
with attached microtubules. The minus end 
of each microtubule is embedded in the 
centrosome, having grown from a y-tubulin 
ring complex, whereas the plus end of each 
microtubule extends into the cytoplasm. 
(C) An image of a centrosome, 
reconstructed from serial sections of a 
Caenorhabditis elegans cell, showing 

a dense thicket of microtubules (green) 
emanating from y-tubulin ring complexes 
(red). A pair of centrioles, themselves 
made of short microtubules (blue), can be 
seen at the center. (C, from E.T. O'Toole 

et al., J. Cell Biol. 163:451-456, 2003. With 
permission from The Rockefeller University 
Press.) 
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Figure 17-14 Each microtubule grows and 
shrinks independently of its neighbors. 
The array of microtubules anchored in a 
centrosome is continually changing, as 
some microtubules grow (red arrows) and 
others shrink (blue arrows). 
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to the cell nucleus when the cell is not in mitosis—organizes an array of 
microtubules that radiates outward through the cytoplasm (see Figure 
17-11B). The centrosome consists of a pair of centrioles, surrounded by 
a matrix of proteins. The centrosome matrix includes hundreds of ring- 
shaped structures formed from a special type of tubulin called y-tubulin, 
and each y-tubulin ring complex serves as the starting point, or nucleation 
site, for the growth of one microtubule (Figure 17-13A). The aß-tubulin 
dimers add to each y-tubulin ring complex in a specific orientation, with 
the result that the minus end of each microtubule is embedded in the 
centrosome, and growth occurs only at the plus end that extends into the 
cytoplasm (Figure 17-13B and C). 


The paired centrioles at the center of an animal cell centrosome are 
curious structures. Each centriole, sitting perpendicular to its partner, is 
made of a cylindrical array of short microtubules (see Figure 17-13C). 
Yet centrioles have no role in the nucleation of microtubules from the 
centrosome: the y-tubulin ring complex alone is sufficient. Thus, their 
function remains something of a mystery, especially as most plant cells 
lack them. Centrioles do, however, act as the organizing centers for the 
microtubules in cilia and flagella, where they are called basal bodies (see 
Figure 17-11D), as we discuss later. 


Why do microtubules need nucleating sites such as those provided by the 
y-tubulin rings in the centrosome? The answer is that it is much harder 
to start a new microtubule from scratch, by first assembling a ring of 
aß-tubulin dimers, than it is to add such dimers to a preexisting y-tubulin 
ring complex. Although purified aB-tubulin dimers at a high concentration 
can polymerize into microtubules spontaneously in vitro, the concentra- 
tion of free aß-tubulin in a living cell is too low to drive the difficult first 
step of assembling the initial ring of a new microtubule. By providing 
organizing centers at specific sites, and keeping the concentration of free 
aB-tubulin dimers low, cells can control more precisely where microtu- 
bules form. 


Microtubules Display Dynamic Instability 


Once a microtubule has been nucleated, it typically grows outward from 
the organizing center for many minutes by the addition of aB-tubulin 
dimers to its free plus end. Then, without warning, the microtubule can 
suddenly undergo a transition that causes it to shrink rapidly by losing 
tubulin dimers from its plus end (Movie 17.2). The microtubule may 
shrink partially and then, no less suddenly, start growing again, or it may 
disappear completely, to be replaced by a new microtubule that grows 
from the same y-tubulin ring complex (Figure 17-14). 
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growing microtubule unstable stable Figure 17-15 Microtubules can be 
nucleus centrosome microtubule capping microtubules microtubules stabilized by attachment to capping 


proren proteins. A newly formed microtubule will 
persist only if both its ends are protected 
from depolymerization. In cells, the 
minus ends of microtubules are generally 
protected by the organizing centers from 
which the microtubules grow. The plus ends 
(A) (B) © (D) are initially free but can be stabilized by 


binding to specific capping proteins. 
This remarkable behavior—switching back and forth between polym- ae H a sk ee cell 
erization and depolymerization—is known as dynamic instability. It cle pictec wita newimictotubules crowing 
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function. In a normal cell, the centrosome (or other organizing center) is plus end happens to encounter a capping 
continually shooting out new microtubules in different directions in an protein in a specific region of the cell cortex, 
exploratory fashion, many of which then retract. A microtubule growing that microtubule will be stabilized. (C and D) 
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ifits plus end is stabilized by attachment to another molecule or cell struc- array, such that an organized system of 
ture so as to prevent its depolymerization. If stabilized by attachment to a microtubules will be set up selectively in 
structure in a more distant region of the cell, the microtubule will estab- one part of the cell. 

lish a relatively stable link between that structure and the centrosome 

(Figure 17-15). The centrosome can thus be compared to a fisherman 

casting a line: if there is no bite at the end of the line, the line is quickly 

withdrawn, and a new cast is made; but, if a fish bites, the line remains in 

place, tethering the fish to the fisherman. This simple strategy of random 

exploration and selective stabilization enables the centrosome and other 

nucleating centers to set up a highly organized system of microtubules in 

selected parts of the cell. The same strategy is used to position organelles 

relative to one another. 


Dynamic Instability Is Driven by GTP Hydrolysis 


The dynamic instability of microtubules stems from the intrinsic capacity 
of tubulin dimers to hydrolyze GTP. This energetically favorable reaction, 
which generates GDP and inorganic phosphate, is similar to the hydroly- 
sis of ATP (see Figure 3-30). 


Each free tubulin dimer contains one GTP molecule tightly bound to 
B-tubulin, which hydrolyzes the GTP to GDP shortly after the dimer is 
added to a growing microtubule. The GDP produced by this hydrolysis 
remains tightly bound to the B-tubulin. When polymerization is proceed- 
ing rapidly, tubulin dimers add to the end of the microtubule faster than 
the GTP they carry is hydrolyzed. As a result, the end of a rapidly growing 
microtubule is composed entirely of GTP-tubulin dimers, which form a 
“GTP cap." GTP-associated dimers bind more strongly to their neighbors 
in the microtubule than do dimers that bear GDP, and they pack together 
more efficiently. Thus the microtubule will continue to grow (Figure 
17-16A). 


Because of the randomness of chemical processes, however, it will occa- 
sionally happen that the tubulin dimers at the free end of the microtubule 
will hydrolyze their GTP before the next dimers are added, so that the 
free ends of protofilaments are now composed of GDP-tubulin. These 
GDP-bearing dimers associate less tightly, tipping the balance in favor of 
disassembly (Figure 17-16B). Because the rest of the microtubule is com- 
posed of GDP-tubulin, once depolymerization has started, it will tend to 
continue; the microtubule starts to shrink rapidly and continuously and 
may even disappear. 


The GDP-tubulin that is freed as the microtubule depolymerizes joins the 
pool of unpolymerized tubulin already in the cytosol. In a typical fibro- 
blast, for example, about half of the tubulin in the cell is in microtubules, 
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QUESTION 17-2 TABLE 17-1 DRUGS THAT AFFECT MICROTUBULES 


Why do you suppose it is much 
easier to add tubulin to existing 
microtubules than to start a new 
microtubule from scratch? Explain 
how y-tubulin in the centrosome 
helps to overcome this hurdle. 


% 


Figure 17-16 GTP hydrolysis controls the dynamic instability of 
microtubules. (A) Tubulin dimers carrying GTP (red) bind more tightly 
to one another than do tubulin dimers carrying GDP (dark green). The 
rapidly growing plus ends of microtubules, capped by newly added 
GTP-tubulin, therefore tend to keep growing. (B) From time to time, 
however, especially if microtubule growth is slow, the dimers in this 
GTP cap will hydrolyze their GTP to GDP before fresh dimers loaded 
with GTP have time to bind. The GTP cap is thereby lost. Because 

the GDP-carrying dimers are less tightly bound in the polymer, the 
protofilaments peel away from the plus end, and the dimers are 
released, causing the microtubule to shrink (Movie 17.3). 


while the remainder is free in the cytosol, where it is available for micro- 
tubule growth. Tubulin dimers joining this cytosolic pool rapidly exchange 
their bound GDP for GTP, thereby becoming competent to add to another 
growing microtubule. 


Microtubule Dynamics Can Be Modified by Drugs 


Drugs that prevent the polymerization or depolymerization of tubulin 
dimers can have a rapid and profound effect on the organization of micro- 
tubules—and thereby on the behavior of the cell. Consider the mitotic 
spindle, the microtubule-based apparatus that guides the chromosomes 
during mitosis (see Figure 17-11C). If a cell in mitosis is exposed to the 
drug colchicine, which binds tightly to free tubulin dimers and prevents 
their polymerization into microtubules, the mitotic spindle rapidly disap- 
pears, and the cell stalls in the middle of mitosis, unable to partition the 
chromosomes into two groups. This observation, and others like it, dem- 
onstrates that the mitotic spindle is normally maintained by a balanced 
addition and loss of tubulin subunits: when tubulin addition is blocked by 
colchicine, tubulin loss continues until the spindle disappears. 


The drug Taxol has the opposite effect on microtubule growth. It binds 
tightly to microtubules and prevents them from losing subunits. Because 
new subunits can still be added, the microtubules can grow but cannot 
shrink. Despite this difference in the mechanism of action, Taxol has 
the same overall effect as colchicine—arresting dividing cells in mitosis. 
These experiments show that for the mitotic spindle to function, microtu- 
bules must be able to assemble and disassemble. We discuss the behavior 
of the spindle in more detail in Chapter 18, when we consider mitosis. 


The inactivation or destruction of the mitotic spindle eventually kills 
dividing cells. Because cancer cells divide in a less controlled way than 
do normal cells of the body, they can sometimes be destroyed prefer- 
entially by drugs that either stabilize or destabilize microtubules. Such 
antimitotic drugs, which include colchicine and Taxol, are used to treat 
human cancers (Table 17—1). As we discuss shortly, there are also drugs 
that affect the polymerization of actin filaments. 


Microtubules Organize the Cell Interior 


Cells are able to modify the dynamic instability of their microtubules for 
particular purposes. As cells enter mitosis, for example, microtubules 


Microtubule-specific Action 
Drugs 


Binds and stabilizes microtubules 


Taxol 
Colchicine, colcemid Binds tubulin dimers and prevents their polymerization 


Vinblastine, vincristine Binds tubulin dimers and prevents their polymerization 


nerve cell body 


axon 
terminal 


become more dynamic, switching between growing and shrinking much 
more frequently than cytoplasmic microtubules normally do. This change 
enables microtubules to disassemble rapidly and then reassemble into 
the mitotic spindle. On the other hand, when a cell has differentiated into 
a specialized cell type, the dynamic instability of its microtubules is often 
suppressed by proteins that bind to the ends or the sides of the microtu- 
bules and protect them against disassembly. The stabilized microtubules 
then serve to maintain the organization of the differentiated cell. 


Most differentiated animal cells are polarized; that is, one end of the cell 
is structurally or functionally different from the other. Nerve cells, for 
example, put out an axon from one end of the cell and dendrites from 
the other (see Figure 12-30). Cells specialized for secretion have their 
Golgi apparatus positioned toward the site of secretion, and so on. The 
cell's polarity is a reflection of the polarized systems of microtubules in 
its interior, which help to position organelles in their required location 
within the cell and to guide the streams of vesicular and macromolecular 
traffic moving between one part of the cell and another. In the nerve cell, 
for example, all the microtubules in the axon point in the same direc- 
tion, with their plus ends toward the axon terminals; along these oriented 
tracks, the cell is able to transport organelles, membrane vesicles, and 
macromolecules—either from the cell body to the axon terminals or in 
the opposite direction (Figure 17-17). 


Although some of the traffic along axons travels at speeds in excess of 10 
cm per day (Figure 17-18), it could still take a week or more for materials 
to reach the end of a long axon in larger animals. Nonetheless, move- 
ment guided by microtubules is immeasurably faster and more efficient 
than movement driven by free diffusion. A protein molecule traveling 
by free diffusion could take years to reach the end of a long axon—if it 
arrived at all (see Question 17-12). 


The microtubules in living cells do not act alone. Their activity, like those 
of other cytoskeletal filaments, depends on a large variety of accessory 
proteins that bind to them. Some of these microtubule-associated pro- 
teins stabilize microtubules against disassembly, for example, while 
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Figure 17-17 Microtubules guide the 
transport of organelles, vesicles, and 
macromolecules in both directions along 
a nerve cell axon. All of the microtubules 
in the axon point in the same direction, with 
their plus ends toward the axon terminal. 
The oriented microtubules serve as tracks 
for the directional transport of materials 
synthesized in the cell body but required 
at the axon terminal. For an axon passing 
from your spinal cord to a muscle in your 
shoulder, the journey takes about two 
days. In addition to this outward traffic 
(red circles), which is driven by one set 

of motor proteins, there is traffic in the 
reverse direction (blue circles), which is 
driven by another set of motor proteins. 
The backward traffic includes worn-out 
mitochondria and materials ingested by 
the axon terminals. 


Figure 17-18 Organelles can move 
rapidly and unidirectionally in a nerve 
cell axon. In this series of video-enhanced 
images of a flattened area of an invertebrate 
nerve axon, numerous membrane vesicles 
and mitochondria are present, many of 
which can be seen to move. The white circle 
provides a fixed frame of reference. These 
images were recorded at intervals of 400 
milliseconds. The two vesicles in the circle 
are moving along microtubules toward the 
axon terminal. (Courtesy of P. Forscher.) 
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QUESTION 17-3 


Dynamic instability causes 
microtubules either to grow or to 
shrink rapidly. Consider an individual 
microtubule that is in its shrinking 
phase. 

A. What must happen at the end 

of the microtubule in order for it to 
stop shrinking and to start growing 
again? 

B. How would a change in the 
tubulin concentration affect this 
switch? 

C. What would happen if only GDP, 
but no GTP, were present in the 
solution? 

D. What would happen if the 
solution contained an analog of GTP 
that cannot be hydrolyzed? 


Figure 17-19 Motor proteins move 
along microtubules using their globular 
heads. (A) Kinesins and cytoplasmic 
dyneins are microtubule motor proteins 
that generally move in opposite directions 
along a microtubule. Most kinesins move 
toward the plus end of a microtubule, 
whereas dyneins move toward the minus 
end (Movie 17.4). Each of these proteins 
(drawn here roughly to scale) is a dimer 
composed of two identical subunits. Each 
dimer has two globular heads at one end, 
which bind and hydrolyze ATP and interact 
with microtubules, and a single tail at the 
other end, which interacts with cargo, 
either directly or indirectly through adaptor 
proteins (see Figure 17-20). (B) Schematic 


diagram of a kinesin motor protein “walking” 


hand-over-hand along a filament. The two 
heads use the energy of ATP binding and 
hydrolysis to move in one direction along 
the filament (ATP in red, ADP in pink). As 
shown here, ATP hydrolysis and phosphate 
release by the rear motor head loosens 

its attachment to the microtubule. ADP 
release and ATP binding by the front motor 
head then cause a dramatic conformational 
change that flips the rear motor head to the 
front, thereby completing a single step. (See 
also Figure 17-23B.) 


others link microtubules to other cell components, including the other 
types of cytoskeletal filaments (see Figure 17-9). Still others are motor 
proteins that actively transport organelles, vesicles, and other macromol- 
ecules along microtubules, as we discuss next. 


Motor Proteins Drive Intracellular Transport 


If a living cell is observed in a light microscope, its cytoplasm is seen 
to be in continual motion. Mitochondria and the smaller membrane- 
enclosed organelles and vesicles travel in small, jerky steps—moving for 
a short period, stopping, and then moving again. This saltatory move- 
ment is much more sustained and directional than the continual, small, 
Brownian movements caused by random thermal motions. Saltatory 
movements can occur along either microtubules or actin filaments. In 
both cases, the movements are driven by motor proteins, which use the 
energy derived from repeated cycles of ATP hydrolysis to travel steadily 
along the microtubule or actin filament in a single direction (see Figure 
4-50). Because the motor proteins also attach to other cell components, 
they can transport this cargo along the filaments. 


The motor proteins that move along cytoplasmic microtubules, such as 
those in the axon of a nerve cell, belong to two families: the kinesins 
generally move toward the plus end of a microtubule (outward from 
the cell body in Figure 17-17); the dyneins move toward the minus end 
(toward the cell body in Figure 17-17). Kinesins and cytoplasmic dyneins 
are generally dimers that have two globular ATP-binding heads and a 
single tail (Figure 17-19A); members of a second class of dyneins, the 
ciliary dyneins, have a different structure and will be discussed later. The 
heads of kinesin and cytoplasmic dynein interact with microtubules in a 
stereospecific manner, so that the motor protein will attach to a micro- 
tubule in only one direction. The tail of a motor protein generally binds 
stably to some cell component, such as a vesicle or an organelle, and 
thereby determines the type of cargo that the motor protein can transport 
(Figure 17-20). The globular heads of kinesin and dynein are enzymes 
with ATP-hydrolyzing (ATPase) activity. This reaction provides the energy 
for driving a directed series of conformational changes in the head that 
enable the motor protein to move along the microtubule by a cycle of 
binding, release, and rebinding to the microtubule (Figure 17-19B and 
see Figure 4-50). For a discussion of the discovery and study of motor 
proteins, see How We Know, pp. 588-589. 
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Microtubules and Motor Proteins Position Organelles in 
the Cytoplasm 


Microtubules and motor proteins play an important part in positioning 
organelles within a eukaryotic cell. In most animal cells, for example, the 
tubules of the endoplasmic reticulum (ER) reach almost to the edge of 
the cell (Movie 17.5), whereas the Golgi apparatus is located in the cell 
interior, near the centrosome (Figure 17-21A). The ER extends out from 
its points of connection with the nuclear envelope along microtubules, 
which reach from the centrally located centrosome out to the plasma 
membrane. As a cell grows, kinesins attached to the outside of the ER 
membrane (via adaptor proteins) pull the ER outward along microtubules, 
stretching it like a net (Figure 17-21B). Cytoplasmic dyneins attached to 
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Figure 17-20 Different motor proteins 
transport different types of cargo along 
microtubules. The transport of cargo 
toward the plus end of a microtubule is 
carried out by different types of kinesin 
motors, each of which is thought to 
transport a specific set of vesicles, 
organelles, or molecules. In some cases, the 
tail of the kinesin binds directly to the cargo, 
while in other cases, different adaptor 
proteins allow the same type of kinesin to 
carry different cargos. Transport toward 

the minus end is mediated by cytoplasmic 
dynein, which generally uses adaptor 
proteins to interact with its selected cargo. 
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Figure 17-21 Microtubules help position organelles in a eukaryotic cell. (A) Schematic diagram of a cell showing the typical 
arrangement of cytoplasmic microtubules (green), endoplasmic reticulum (blue), and Golgi apparatus (yellow). The nucleus is shown 

in brown, and the centrosome in light green. (B) One part of a cell in culture stained with antibodies to the endoplasmic reticulum 
(blue, upper panel) and to microtubules (green, lower panel). Kinesin motor proteins pull the endoplasmic reticulum outward along the 
microtubules. (C) A different cell in culture stained with antibodies to the Golgi apparatus (yellow, upper panel) and to microtubules 
(green, lower panel). In this case, cytoplasmic dyneins pull the Golgi apparatus inward along the microtubules to its position near 

the centrosome, which is not visible but is located on the Golgi side of the nucleus. (B, courtesy of Mark Terasaki, Lan Bo Chen, and 


Keigi Fujiwara; C, courtesy of Viki Allan and Thomas Kreis.) 


HOW WE KNOW 


PURSUING MICROTUBULE-ASSOCIATED MOTOR PROTEINS 


The movement of organelles throughout the cell cyto- 
plasm has been a subject of observation, measurement, 
and speculation since the middle of the nineteenth 
century. But it was not until the mid-1980s that biolo- 
gists identified the molecules that drive this movement 
of organelles and vesicles from one part of the cell to 
another. 


Why the lag between observation and understanding? 
The problem was in the proteins—or, more precisely, in 
the difficulty of studying them in isolation outside the cell. 
To investigate the activity of an enzyme, for example, 
biochemists first purify the protein: they break open cells 
or tissues and separate the enzyme from other molecu- 
lar components (see Panels 4-4 and 4-5, pp. 166-167). 
They can then study the enzyme in a test tube (in vitro), 
controlling its exposure to substrates, inhibitors, ATP, 
and so on. Unfortunately, this approach did not seem to 
work for studies of the motile machinery that underlies 
intracellular transport. It is not possible to break open a 
cell and pull out an intact, fully active transport system, 
free of extraneous material, that continues to carry mito- 
chondria and vesicles from place to place. 


That problem was solved by technical advances in 
two separate fields. First, improvements in microscopy 
allowed biologists to see that an operational transport 
system (with extraneous material still attached) could be 
squeezed from the right kind of living cell. At the same 
time, biochemists realized that they could assemble a 
working transport system from scratch—using purified 
cytoskeletal filaments, motors, and cargo—outside the 
cell. One such breakthrough started with a squid. 


Teeming cytoplasm 


Neuroscientists interested in the electrical properties of 
nerve cell membranes have long studied the giant axon 
from squid (see How We Know, pp. 412-413). Because 
of its large size, researchers found that they could 
squeeze the cytoplasm from the axon like toothpaste, 
and then study how ions move back and forth through 
various channels in the empty, tubelike plasma mem- 
brane (see Figure 12-33). These investigators discarded 
the extruded cytoplasmic jelly, as it appeared to be inert 
(and thus uninteresting) when examined under a stand- 
ard light microscope. 


Then along came video-enhanced microscopy. This 
type of microscopy, developed by Shinya Inoué, Robert 
Allen, and others, allows one to detect structures that 
are smaller than the resolving power of standard light 
microscopes, which is only about 0.2 um, or 200 nm 
(see Panel 1-1, pp. 12-13). The resulting images are 
captured by a video camera and then enhanced by com- 
puter processing to reduce the background and heighten 


contrast. When researchers in the early 1980s applied 
this new technique to preparations of squid axon cyto- 
plasm (axoplasm), they observed, for the first time, the 
motion of vesicles and other organelles along cytoskel- 
etal filaments. 


Under the video-enhanced microscope, extruded axo- 
plasm is seen to be teeming with tiny particles—from 
vesicles 30-50 nm in diameter to mitochondria some 
5000 nm long—all moving to and fro along cytoskeletal 
filaments at speeds of up to 5 um per second. If the axo- 
plasm is spread thinly enough, individual filaments can 
be seen. 


The movement continues for hours, allowing research- 
ers to manipulate the preparation and study the effects. 
Ray Lasek and Scott Brady discovered, for example, that 
the organelle movement requires ATP. Substitution of 
ATP analogs, such as AMP-PNP, which resemble ATP 
but cannot be hydrolyzed (and thus provide no energy), 
inhibit the translocation. 


Snaking tubes 


More work was needed to identify the individual com- 
ponents that comprise the transport system in squid 
axoplasm. What kind of filaments support this move- 
ment? What are the molecular motors that shuttle the 
vesicles and organelles along these filaments? Identifying 
the filaments was relatively easy: antibodies to tubulin 
revealed that they are microtubules. But what about the 
motor proteins? To find these, Ron Vale, Thomas Reese, 
and Michael Sheetz set up a system in which they could 
fish for proteins that power organelle movement. 


Their strategy was simple yet elegant: add together 
microtubules and organelles and then look for molecules 
that induce motion. They used purified microtubules 
from squid brain, added organelles isolated from squid 
axons, and showed that organelle movement could be 
triggered by the addition of an extract from squid axo- 
plasm. In this preparation, the researchers could either 
watch the organelles travel along the microtubules or 
watch the microtubules glide snakelike over the surface 
of a glass cover slip that had been coated with an axo- 
plasm extract (see Question 17-18). Their challenge was 
to isolate the protein responsible for movement in this 
reconstituted system. 


To do that, Vale and his colleagues took advantage 
of the earlier work with the ATP analog AMP-PNP. 
Although this analog inhibits the movement of vesicles 
along microtubules, it still allows organelles to attach to 
the microtubule filaments. So the researchers incubated 
the axoplasm extract with microtubules and organelles 
in the presence of AMP-PNP; they then pulled out the 
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microtubules with what they hoped were the motor 
proteins still attached. Vale and his team then added 
ATP to release the attached proteins, and they found a 
110-kilodalton polypeptide that could stimulate the 
gliding of microtubules along a glass cover slip (Figure 
17-22). They dubbed the molecule kinesin (from the 
Greek kinein, “to move’). 


Similar in vitro motility assays have been instrumental 
in the study of other motor proteins—such as myosins, 
which move along actin filaments, as we discuss later. 
Subsequent studies showed that kinesin moves along 
microtubules from the minus end to the plus end; they 
also identified many other motor proteins of the kinesin 
family. 


Lights, camera, action 
Combining such assays with ever more refined micro- 


scopic techniques, researchers can now monitor the 
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Figure 17-22 Kinesin causes microtubule gliding in vitro. 
In an in vitro motility assay, purified kinesin is mixed with 
microtubules in the presence of ATP. When a drop of the 
mixture is placed on a glass slide and examined by video- 
enhanced microscopy, individual microtubules (artificially 
colored) can be seen gliding over the slide. They are driven 
by kinesin molecules, which attach to the glass slide by 
their tails. Images were recorded at 1-second intervals. The 
microtubules moved at about 1-2 um/sec. (Courtesy of Nick 
Carter and Rob Cross.) 


movement of individual motor proteins along single 
microtubules, even in living cells. 


Observation of kinesin molecules, labeled with a fluo- 
rescent marker protein, revealed that this motor protein 
marches along microtubules processively—that is, each 
molecule takes multiple “steps” along the filament (100 
or so) before falling off. The length of each step is 8 nm, 
which corresponds to the spacing of individual tubulin 
dimers along the microtubule. Combining these obser- 
vations with assays of ATP hydrolysis, researchers have 
confirmed that one molecule of ATP is hydrolyzed per 
step. Kinesin can move in a processive manner because 
it has two heads. This enables it to walk toward the plus 
end of the microtubule in a “hand-over-hand” fashion, 
each head repetitively binding and releasing the fila- 
ment as it swings past the bound head in front (Figure 
17-23). Such studies now allow us to follow the foot- 
steps of these fascinating and industrious proteins—step 
by molecular step. 


Figure 17-23 A single molecule of 
kinesin moves along a microtubule. 
(A) Three frames, separated by intervals 
of 1 second, record the movement of an 
individual, fluorescently labeled kinesin 
molecule (red) along a microtubule 
(green); the labeled kinesin moves at 

a speed of 0.3 um/sec. (B) A series of 
molecular models of the two heads of 
a kinesin molecule, showing how they 
are thought to walk processively (left to 
right) along a microtubule in a series of 
8-nm steps in which one head swings 
past the other (Movie 17.6). (A and B, 
courtesy of Ron Vale.) 
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Figure 17-24 Many hairlike cilia project 
from the surface of the epithelial cells 
that line the human respiratory tract. In 
this scanning electron micrograph, thick 
tufts of cilia can be seen extended from 
these ciliated cells, which are interspersed 
with the dome-shaped surfaces of 
nonciliated epithelial cells. (Reproduced 
from R.G. Kessel and R.H. Kardon, Tissues 
and Organs. San Francisco: W.H. Freeman 
& Co., 1979.) 


power stroke 


Figure 17-25 A cilium beats by 
performing a repetitive cycle of 
movements, consisting of a power stroke 
followed by a recovery stroke. In the fast 
power stroke, the cilium is fully extended 
and fluid is driven over the surface of the 
cell; in the slower recovery stroke, the 
cilium curls back into position with minimal 
disturbance to the surrounding fluid. Each 
cycle typically requires 0.1-0.2 second and 


generates a force parallel to the cell surface. 


the Golgi membranes pull the Golgi apparatus along microtubules in the 
opposite direction, inward toward the nucleus (Figure 17-210C). In this 
way, the regional differences in these internal membranes—crucial for 
their respective functions—are created and maintained. 


When cells are treated with colchicine—a drug that promotes micro- 
tubule disassembly—both the ER and the Golgi apparatus change their 
location dramatically. The ER, which is physically connected to the 
nuclear envelope, collapses around the nucleus; the Golgi apparatus, 
which is not attached to any other organelle, fragments into small vesi- 
cles, which then disperse throughout the cytoplasm. When the colchicine 
is removed, the organelles return to their original positions, dragged by 
motor proteins moving along the re-formed microtubules. 


Cilia and Flagella Contain Stable Microtubules Moved 
by Dynein 


We mentioned earlier that many microtubules in cells are stabilized 
through their association with other proteins and therefore do not show 
dynamic instability. Cells use such stable microtubules as stiff supports in 
the construction of a variety of polarized structures, including motile cilia 
and flagella. Cilia are hairlike structures about 0.25 um in diameter, cov- 
ered by plasma membrane, that extend from the surface of many kinds 
of eukaryotic cells. Each cilium contains a core of stable microtubules, 
arranged in a bundle, that grow from a cytoplasmic basal body, which 
serves as an organizing center (see Figure 17-11D). 


Motile cilia beat in a whiplike fashion, either to move fluid over the sur- 
face of a cell or to propel single cells through a fluid. Some protozoa, for 
example, use cilia to collect food particles, and others use them for loco- 
motion. On the epithelial cells lining the human respiratory tract (Figure 
17-24), huge numbers of beating cilia (more than a billion per square 
centimeter) sweep layers of mucus containing trapped dust particles and 
dead cells up toward the throat, to be swallowed and eventually elimi- 
nated from the body. Similarly, beating cilia on the cells of the oviduct 
wall create a current that helps to carry eggs away from the ovary. Each 
cilium acts as a small oar, moving in a repeated cycle that generates the 
movement of fluid over the cell surface (Figure 17-25). 


The flagella (singular flagellum) that propel sperm and many protozoa 
are usually very much longer than cilia are. They are designed to move 
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Figures 17-26 Flagella propel a cell through fluid using 
repetitive wavelike motion. The movement of a single 
flagellum on an invertebrate sperm is seen in a series of images 
captured by stroboscopic illumination at 400 flashes per second. 
(Courtesy of Charles J. Brokaw.) 


the entire cell, rather than moving fluid across the cell surface. Flagella 
propagate regular waves along their length, propelling the attached cell 
along (Figure 17-26). 


Despite these slight differences in operation, cilia and flagella share a 
similar internal structure. The microtubules in both cilia and flagella are 
arranged in an elaborate and distinctive pattern, which was one of the 
most striking revelations of early electron microscopy. A cross section 
through a cilium shows nine doublet microtubules arranged in a ring 
around a pair of single microtubules (Figure 17-27A). This “9 + 2” array 
is characteristic of almost all eukaryotic cilia and flagella—from those of 
protozoa to those in humans. 


The movement of a cilium or a flagellum is produced by bending that 
takes place as its microtubules slide against each other. The microtu- 
bules are associated with numerous accessory proteins (Figure 17-27B), 
which project at regular positions along the length of the microtubule 
bundle. Some of these proteins serve as cross-links to hold the bundle of 
microtubules together; others generate the force that causes the cilium 
or flagellum to bend. 


The most important of the accessory proteins is the motor protein cili- 
ary dynein, which generates the bending motion of the structure. Ciliary 
dynein is attached by its tail to one microtubule in each outer doublet, 
while its globular heads interact with the adjacent microtubule to gener- 
ate a sliding force between the two microtubules. Because of the multiple 
links that hold the adjacent microtubule doublets together, the sliding 
force between adjacent microtubules is converted to a bending motion 
(Figure 17-28). Other ciliary components, including the central pair, 
inner sheath, and radial spokes, control dynein activity, leading to the 
complex wave forms seen in cilia and flagella. 
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Figure 17-27 Microtubules in a cilium or flagellum are arranged in a “9 + 2” array. (A) Electron micrograph of a flagellum of the 
unicellular alga Chlamydomonas shown in cross section, illustrating the distinctive 9 + 2 arrangement of microtubules. (B) Diagram of 
the flagellum in cross section. The nine outer microtubules (each a special paired structure) carry two rows of dynein molecules. The 
heads of each dynein molecule appear in this view like arms reaching toward the adjacent doublet microtubule. In a living cilium, these 
dynein heads periodically make contact with the adjacent doublet microtubule and move along it, thereby producing the force for 
ciliary beating. The various other links and projections shown are proteins that serve to hold the bundle of microtubules together and to 
convert the sliding force produced by dyneins into bending, as illustrated in Figure 17-28. (A, courtesy of Lewis Tilney.) 
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Figure 17-28 The movement of dynein t 
causes the flagellum to bend. (A) If the 
outer doublet microtubules and their 
associated dynein molecules are freed from 
other components of a sperm flagellum and 
then exposed to ATP, the doublets slide 
against each other, telescope-fashion, due 
to the repetitive action of their associated 
dyneins. (B) In an intact flagellum, however, 
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In humans, hereditary defects in ciliary dynein cause Kartagener's syn- 
drome. Men with this disorder are infertile because their sperm are 
nonmotile, and they have an increased susceptibility to bronchial infec- 
tions because the cilia that line their respiratory tract are paralyzed and 
thus unable to clear bacteria and debris from the lungs. 
Many animal cells that lack beating cilia contain a single, nonmotile pri- 
mary cilium. This appendage is much shorter than a beating cilium and 
functions as an antenna for sensing certain extracellular signal molecules. 
ACTIN FILAMENTS 
Actin filaments, polymers of the protein actin, are present in most 
eukaryotic cells and are essential for many of the cell’s movements, 
especially those involving the cell surface. Without actin filaments, for 
example, an animal cell could not crawl along a surface, engulf a large 
QUESTION 17-4 particle by phagocytosis, or divide in two. Like microtubules, many actin 
filaments are unstable, but by associating with other proteins they can 
Dynein arms in a cilium are arranged also form stable structures in cells, such as the contractile apparatus of 
so that, when activated, the heads muscle cells. Actin filaments interact with a large number of actin-binding 
push their neighboring outer proteins that enable the filaments to serve a variety of functions in cells. 
doublet outward toward the tip of Depending on which of these proteins they associate with, actin filaments 
the cilium. Consider a cross section can form stiff and stable structures, such as the microvilli on the epithelial 
of a cilium (see Figure 17-27). Why cells lining the intestine (Figure 17-29A) or the small contractile bundles 
would no bending motion of the that can contract and act like tiny muscles in most animal cells (Figure 


cilium result if all dynein molecules 
were active at the same time? 
What pattern of dynein activity can 
account for the bending of a cilium 
in one direction? 


17-29B and E). They can also form temporary structures, such as the 
dynamic protrusions formed at the leading edge of a crawling cell (Figure 
17-29C) or the contractile ring that pinches the cytoplasm in two when an 
animal cell divides (Figure 17-29D). Actin-dependent movements usu- 
ally require actin’s association with a motor protein called myosin. 


In this section, we see how the arrangements of actin filaments in a cell 
depend on the types of actin-binding proteins present. Even though actin 
filaments and microtubules are formed from unrelated types of subunit 
proteins, we will see that the principles by which they assemble and dis- 
assemble, control cell structure, and work with motor proteins to bring 
about movement are strikingly similar. 


Actin Filaments Are Thin and Flexible 


Actin filaments appear in electron micrographs as threads about 7 nm 
in diameter. Each filament is a twisted chain of identical globular actin 
monomers, all of which “point” in the same direction along the axis of 
the chain. Like a microtubule, therefore, an actin filament has a structural 
polarity, with a plus end and a minus end (Figure 17-30). 


Actin filaments are thinner, more flexible, and usually shorter than micro- 
tubules. There are, however, many more of them, so that the total length 
of all of the actin filaments in a cell is generally many times greater than 
the total length of all of the microtubules. Unlike intermediate filaments 
and microtubules, actin filaments rarely occur in isolation in the cell; they 
are generally found in cross-linked bundles and networks, which are 
much stronger than the individual filaments. 


Actin and Tubulin Polymerize by Similar Mechanisms 


Like microtubles, actin filaments can grow by the addition of monomers 
at either end but their rate of growth is faster at the plus end than at the 
minus end. A naked actin filament, like a microtubule without associated 
proteins, is inherently unstable, and it can disassemble from both ends. In 
living cells, free actin monomers carry a tightly bound nucleoside triphos- 
phate, in this case ATP. The actin monomer hydrolyzes its bound ATP to 
ADP soon after it is incorporated into the filament. As with the hydrolysis 
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Figure 17-29 Actin filaments allow animal 
cells to adopt a variety of shapes and 
perform a variety of functions. The actin 
filaments in four different structures are 
shown here in red: (A) microvilli; 

(B) contractile bundles in the cytoplasm; 

(C) fingerlike filopodia protruding from the 
leading edge of a moving cell; and 

(D) contractile ring during cell division. 

(E) Micrograph of a cell in which contractile 
bundles of actin, like those in (B), are 
stained with fluorescently labeled phalloidin, 
a molecule that binds specifically to actin 
filaments (see Table 17-2, p. 594). 

(E, courtesy of Nikon ® MicroscopyU.) 


Figure 17-30 Actin filaments are thin, 
flexible protein threads. (A) The subunit 
of each actin filament is an actin monomer. 
A cleft in the monomer provides a binding 
site for ATP or ADP. (B) Arrangement of 
actin monomers in an actin filament. Each 
filament may be thought of as a two- 
stranded helix with a twist repeating every 
37 nm. Multiple, lateral interactions between 
the two strands prevent the strands from 
separating. (C) Close-up view showing 

the extensive interactions between the 
two strands of the actin filament. 

(D) Electron micrograph of a negatively 
stained actin filament. (D, courtesy of 
Roger Craig.) 
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Figure 17-31 Actin filaments can undergo 
treadmilling. (A) Actin monomers in the 
cytosol carry ATP, which is hydrolyzed to 
ADP soon after assembly into a growing 
filament. The ADP molecules remain 
trapped within the actin filament, unable to 
exchange with ATP until the actin monomer 
that carries them dissociates from the 
filament. When ATP-actin (dark red) adds 
to the plus end of an actin filament at the 
same rate that ADP-actin (light red) is lost 
from the minus end, treadmilling occurs. 

(B) When the rates of addition and loss are 
equal, the filament stays the same length— 
although individual actin monomers (three 
of which are numbered here) move through 
the filament from the plus to the minus end. 


QUESTION 17-5 


The formation of actin filaments in 
the cytosol is controlled by actin- 
binding proteins. Some actin-binding 
proteins significantly increase the 
rate at which the formation of an 
actin filament is initiated. Suggest 

a mechanism by which they might 
do this. 


(B) TREADMILLING 


of GTP to GDP in a microtubule, hydrolysis of ATP to ADP in an actin fila- 
ment reduces the strength of binding between the monomers, thereby 
decreasing the stability of the polymer. Thus in both cases, nucleotide 
hydrolysis promotes depolymerization, helping the cell to disassemble its 
microtubules and actin filaments after they have formed. 


If the concentration of free actin monomers is very high, an actin fila- 
ment will grow rapidly, adding monomers at both ends. At intermediate 
concentrations of free actin, however, something interesting takes place. 
Actin monomers add to the plus end at a rate faster than the bound ATP 
can be hydrolyzed, so the plus end grows. At the minus end, by contrast, 
ATP is hydrolyzed faster than new monomers can be added; because 
ADP-actin destabilizes the structure, the filament loses subunits from 
its minus end at the same time as it adds them to the plus end (Figure 
17-31A). Individual monomers thus move through the filament from the 
plus to the minus end, a behavior called treadmilling. When the rates of 
addition and loss are equal, the filament remains the same size (Figure 
17-31B). 


Actin filament function can be perturbed experimentally by certain tox- 
ins produced by fungi or marine sponges. Some, such as cytochalasin 
and Jatrunculin, prevent actin polymerization; others, such as phalloidin, 
stabilize actin filaments against depolymerization (Table 17-2). Addition 
of these toxins to cells or tissues, even in low concentrations, instanta- 
neously freezes cell movements such as cell locomotion. Thus, as with 
microtubules, many of the functions of actin filaments depend on the 
ability of the filament to assemble and disassemble, the rates of which 
depend on the dynamic equilibrium between the actin filaments, the pool 
of actin monomers, and various actin-binding proteins. 


Many Proteins Bind to Actin and Modify Its Properties 


About 5% of the total protein in a typical animal cell is actin; about half 
of this actin is assembled into filaments, and the other half remains as 
actin monomers in the cytosol. With such a high concentration of actin 
monomers—much higher than the concentration required for purified 
actin monomers to polymerize spontaneously in a test tube—what, then, 


TABLE 17-2 DRUGS THAT AFFECT FILAMENTS 


Actin-specific Drugs 


Phalloidin Binds and stabilizes filaments 
Cytochalasin Caps filament plus ends, preventing polymerization there 
Latrunculin Binds actin monomers and prevents their polymerization 


keeps the actin monomers in cells from polymerizing totally into fila- 
ments? The answer is that cells contain small proteins, such as thymosin 
and profilin, that bind to actin monomers in the cytosol, preventing them 
from adding to the ends of actin filaments. By keeping actin monomers 
in reserve until they are required, these proteins play a crucial role in 
regulating actin polymerization. When actin filaments are needed, other 
actin-binding proteins such as formins and actin-related proteins (ARPs) 
promote actin polymerization. 


There are a great many actin-binding proteins in cells. Most of these 
bind to assembled actin filaments and control their behavior (Figure 
17-32). Actin-bundling proteins, for example, hold actin filaments 
together in parallel bundles in microvilli; others cross-link actin filaments 
together in a gel-like meshwork within the cell cortex—the specialized 
layer of actin-filament-rich cytoplasm just beneath the plasma mem- 
brane. Filament-severing proteins fragment actin filaments into shorter 
lengths and thus can convert an actin gel to a more fluid state. Actin 
filaments can also associate with myosin motor proteins to form contrac- 
tile bundles, as in muscle cells. And they often form tracks along which 
myosin motor proteins transport organelles, a function that is especially 
conspicuous in plant cells. 


In the remainder of this chapter, we consider some characteristic struc- 
tures that actin filaments can form, and we discuss how different types 
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Figure 17-32 Actin-binding proteins 
control the behavior of actin filaments 

in vertebrate cells. Actin is shown in red, 
and the actin-binding proteins are shown in 
green. 
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of actin-binding proteins are involved in their assembly. We begin with 
the cell cortex and its role in cell shape and movement, and we conclude 
with the contractile apparatus of muscle cells. 


A Cortex Rich in Actin Filaments Underlies the Plasma 
Membrane of Most Eukaryotic Cells 


Although actin is found throughout the cytoplasm of a eukaryotic cell, 
in many cells it is highly concentrated in a layer just beneath the plasma 
membrane. In this region, called the cell cortex, actin filaments are linked 
by actin-binding proteins into a meshwork that supports the plasma 
membrane and gives it mechanical strength. In human red blood cells, 
for example, a simple and regular network of fibrous proteins—includ- 
ing actin and spectrin filaments—attaches to the plasma membrane, 
providing the support necessary for the cells to maintain their simple dis- 
coid shape (see Figure 11-29). In other animal cells, the cortex includes 
a much denser network of actin filaments that are cross-linked into a 
three-dimensional meshwork. The rearrangements of actin filaments 
within the cortex provide much of the molecular basis for changes in 
both cell shape and cell locomotion. 


Cell Crawling Depends on Cortical Actin 


Many eukaryotic cells move by pulling themselves across surfaces, rather 
than by swimming by means of beating cilia or flagella. Carnivorous 
amoebae crawl along in search of food. The advancing tip of a devel- 
oping axon migrates in response to growth factors, following a path of 
chemical signals to its eventual synaptic target cells. White blood cells 
known as neutrophils migrate out of the blood into infected tissues when 
they “smell” small molecules released by bacteria, which the neutrophils 
seek out and destroy. For these cells, chemotactic molecules binding to 
receptors on the cell surface trigger changes in actin filament assembly 
that help steer the cells toward their targets (see Movie 17.7). 


The molecular mechanisms of these and other forms of cell crawling 
entail coordinated changes among many molecules in different regions of 
the cell; there is no single, easily identifiable locomotory organelle, such 
as a flagellum, responsible. In broad terms, however, three interrelated 
processes are known to be essential: (1) the cell sends out protrusions 
at its “front,” or leading edge; (2) these protrusions adhere to the surface 
over which the cell is crawling; and (3) the rest of the cell drags itself for- 
ward by traction on these points of anchorage (Figure 17-33). 


All three processes involve actin, but in different ways. The first step, the 
protrusion of the cell surface, is driven by actin polymerization. A crawling 
fibroblast in culture regularly extends thin, flattened lamellipodia (from 
the Latin for “sheet feet”) at its leading edge. These extensions contain a 
dense meshwork of actin filaments, oriented so that most of the filaments 
have their plus ends close to the plasma membrane. Many cells also 
extend thin, stiff protrusions called filopodia (from the Latin for “thread 
feet”), both at the leading edge and elsewhere on their surface (Figure 
17-34). These are about 0.1 um wide and 5-10 um long, and each con- 
tains a loose bundle of 10-20 actin filaments (see Figure 17-29C), again 
oriented with their plus ends pointing outward. The advancing tip (growth 
cone) of a developing nerve cell axon extends even longer filopodia, up to 
50 um long, which help it to probe its environment and find the correct 
path to its target cell. Lamellipodia and filopodia are both exploratory, 
motile structures that form and retract with great speed, moving at 
around 1 um per second. These protrusions are thought to be generated 
by the rapid, local growth of actin filaments, which assemble close to the 
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plasma membrane and elongate by the addition of actin monomers at 
their plus ends. In this way, the filaments push out the membrane without 
tearing it. 


When the lamellipodia and filopodia touch down on a favorable surface, 
they stick: transmembrane proteins in their plasma membrane, known 
as integrins (discussed in Chapter 20), adhere to molecules either in the 
extracellular matrix or on the surface of a neighboring cell over which 
the moving cell is slithering. Meanwhile, on the intracellular face of the 
crawling cell’s plasma membrane, integrins capture actin filaments in the 
cortex, thereby creating a robust anchorage for the crawling cell (see 
Figures 17-33 and 20-14C). To use this anchorage to drag its body for- 
ward, the cell calls on the help of myosin motor proteins, which slide 
along actin filaments, as we discuss shortly. 
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Figure 17-33 Forces generated in the 
actin-filament-rich cortex help move a 
cell forward. Actin polymerization at the 
leading edge of the cell pushes the plasma 
membrane forward (protrusion) and forms 
new regions of actin cortex, shown here in 
red. New points of anchorage are made 
between the bottom of the cell and the 
surface (substratum) on which the cell is 
crawling (attachment). Contraction at the 
rear of the cell—mediated by myosin motor 
proteins moving along actin filaments—then 
draws the body of the cell forward. New 
anchorage points are established at the 
front, and old ones are released at the back, 
as the cell crawls forward. The same cycle is 
repeated over and over again, moving the 
cell forward in a stepwise fashion. 


QUESTION 17—6 


Suppose that the actin molecules 
in a cultured skin cell have been 
randomly labeled in such a way 
that 1 in 10,000 molecules carries 
a fluorescent marker. What would 
you expect to see if you examined 
the lamellipodium (leading edge) 
of this cell through a fluorescence 
microscope? Assume that your 
microscope is sensitive enough to 
detect single fluorescent molecules. 
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Figure 17-34 Actin filaments allow animal cells to migrate. (A) Schematic drawing of a fibroblast, showing flattened lamellipodia 
and fine filopodia projecting from its surface, especially in the regions of the leading edge. (B) Scanning electron micrograph showing 
lamellipodia and filopodia at the leading edge of a human fibroblast migrating in culture; the arrow shows the direction of cell 
movement. As the cell moves forward, the lamellipodia that fail to attach to the substratum are swept backward over the upper surface 


of the cell— a movement referred to as ruffling. (B, courtesy of Julian Heath.) 
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Figure 17-35 A web of polymerizing actin 
filaments pushes the leading edge of a 
lamellipodium forward. (A) A highly motile 
keratocyte from frog skin was fixed, dried, 
and shadowed with platinum, and examined 
in an electron microscope. Actin filaments 
form a dense network, with the fast-growing 
plus ends of the filaments terminating at the 
leading margin of the lamellipodium (top 

of figure; Movie 17.8). (B) Drawing showing 
how the nucleation of new actin filaments 
(pink) is mediated by ARP complexes (light 
green) attached to the sides of preexisting 
actin filaments. The resulting branching 
structure pushes the plasma membrane 
forward. The plus ends of the actin filaments 
become protected from depolymerizing 

by capping proteins (dark green), while the 
minus ends of actin filaments nearer the 
center of the cell continually disassemble 
with the help of depolymerizing proteins 
(not shown). Because of this directional 
growth and disassembly, individual actin 
monomers move through this branched 
web in a rearward direction, while the web 
of actin as a whole undergoes a continual 
forward movement. This actin network is 
drawn to a different scale than the network 
shown in (A). Some pathogenic bacteria 
polymerize tails of actin filaments to move 
inside the cells they invade (Movie 17.9). (A, 
courtesy of Tatyana Svitkina and Gary Borisy.) 
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Actin-binding Proteins Influence the Type of Protrusions 
Formed at the Leading Edge 


The formation and growth of actin filaments at the leading edge of a 
cell are assisted by various actin-binding proteins. The actin-related pro- 
teins—or ARPs—mentioned earlier promote the formation of a web of 
branched actin filaments in lamellipodia. ARPs form complexes that bind 
to the sides of existing actin filaments and nucleate the formation of new 
filaments, which grow out at an angle to produce side branches. With the 
aid of additional actin-binding proteins, this web undergoes continual 
assembly at the leading edge and disassembly further back, pushing the 
lamellipodium forward (Figure 17-35). 


The other kind of cell protrusion, the filopodium, depends on formin, 
a nucleating protein that attaches to the growing plus ends of actin 
filaments and promotes the addition of new monomers to form straight, un- 
branched filaments. Formins are also used elsewhere to assemble 
unbranched filaments, such as in the contractile ring that pinches a divid- 
ing animal cell in two. 


Extracellular Signals Can Alter the Arrangement of 
Actin Filaments 


Actin-binding proteins control the location, organization, and behavior 
of actin filaments. The activities of these proteins are, in turn, controlled 
by extracellular signal molecules, allowing the cell to rearrange its actin 
cytoskeleton in response to its environment. These extracellular signals 
act through a variety of cell-surface receptor proteins, which activate var- 
ious intracellular signaling pathways. Many of these pathways converge 
on a group of closely related monomeric GTPases that are part of the Rho 
protein family. As discussed in Chapter 16, monomeric GTPases behave 
as molecular switches that control intracellular processes by cycling 
between an active GTP-bound state and an inactive GDP-bound state 
(see Figure 16-11B). 


In the case of the actin cytoskeleton, different Rho family members alter 
the organization of actin filaments in different ways. For example, one 
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(D) Cdc42 ACTIVATION 


Ld 

20 um 
Rho family member triggers the bundling of actin filaments and activation 
of the formin proteins that promote the formation of filopodia. Another 
Rho GTPase might stimulate ARP complexes at the cell’s leading edge, 
promoting the formation of lamellipodia and membrane ruffling. Finally, 
activation of the founding member of the Rho family drives the bundling 
of actin filaments with myosin motor proteins and the clustering of cell- 
surface integrins, actions that promote cell crawling (see Figure 17-33). 
Examples of these dramatic, Rho-driven cytoskeletal rearrangements are 
shown in Figure 17-36. 


Actin Associates with Myosin to Form Contractile 
Structures 


Perhaps the most familiar of all the actin-binding proteins is myosin. 
Myosins belong to a family of motor proteins that bind to and hydrolyze 
ATP, which provides the energy for their movement along actin filaments 
toward the plus end. Myosin, like actin, was first discovered in skeletal 
muscle, and much of what we know about the interaction of these two 
proteins was learned from studies of muscle. There are numerous types of 
myosins in cells, of which the myosin-I and myosin-II subfamilies are the 
most abundant. 


Myosin-I molecules, which are present in all cell types, have a head 
domain and a tail (Figure 17-37A). The head domain binds to an actin 
filament and has the ATP-hydrolyzing motor activity that enables it to 
move along the filament in a repetitive cycle of binding, detachment, 
and rebinding (Movie 17.10). The tail varies among the different types 
of myosin-I and determines what type of cargo the myosin will carry. 
For example, the tail may bind to a particular type of vesicle and propel 
it through the cell along actin filament tracks (Figure 17-378), or it may 
bind to the plasma membrane and pull it into a different shape (Figure 
17-36C). 


Myosin-II is structurally and mechanistically more complex than myo- 
sin-I. Muscle cells make use of a specialized form of myosin-II to drive 
muscle contraction, as we discuss next. 
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Figure 17-36 Activation of Rho-family 
GTPases can have a dramatic effect on 
the organization of actin filaments in 
fibroblasts. In these micrographs, actin is 
stained with fluorescently labeled phalloidin 
(see Table 17-2, p. 594). (A) Unstimulated 
fibroblasts have actin filaments primarily in 
the cortex. (B) Microinjection of an activated 
form of Rho promotes the rapid assembly 
of bundles of long, unbranched actin 
filaments; because myosin is associated 
with these bundles, they are contractile. 

(C) Microinjection of an activated form 

of Rac, a GTP-binding protein similar to 
Rho, causes the formation of an enormous 
lamellipodium that extends from the entire 
circumference of the cell. (D) Microinjection 
of an activated form of Cdc42, another Rho 
family member, stimulates the protrusion 
of a forest of filopodia at the cell periphery. 
(Courtesy of Catherine Nobes.) 


QUESTION 17-7 


At the leading edge of a crawling 
cell, the plus ends of actin filaments 
are located close to the plasma 
membrane, and actin monomers 
are added at these ends, pushing 
the membrane outward to form 
lamellipodia or filopodia. What do 
you suppose holds the filaments at 
their other ends to prevent them 
from just being pushed into the 
cell's interior? 
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Figure 17-37 Myosin-l is the simplest (A) myosin-| 
myosin. (A) Myosin-I has a single globular 
head that attaches to an actin filament and han a 


a tail that attaches to another molecule or 
organelle in the cell. (B) This arrangement head tail 


allows the head domain to move a vesicle domain 

relative to an actin filament, which in this 

case is anchored to the plasma membrane. (B) 

(C) Myosin-I can also bind to an actin 

filament in the cell cortex, ultimately pulling plus end minus end 
the plasma membrane into a new shape. yy yyy ya i i PA yy ytd 


Note that the head group always walks 
toward the plus end of the actin filament. 
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MUSCLE CONTRACTION 


Muscle contraction is the best understood of animal cell movements. In 
vertebrates, running, walking, swimming, and flying all depend on the 
ability of skeletal muscle to contract strongly and move various bones. 
Involuntary movements such as heart pumping and gut peristalsis 
depend on cardiac muscle and smooth muscle, respectively, which are 
formed from muscle cells that differ in structure from skeletal muscle but 
use actin and myosin in a similar way to contract. Much of our under- 
standing of the mechanisms of cell movement originated from studies of 
muscle cell contraction. In this section, we discuss how actin and myosin 
interact to produce this contraction. 


Muscle Contraction Depends on Interacting Filaments 
of Actin and Myosin 


Muscle myosin belongs to the myosin-II subfamily of myosins. These 
proteins are dimers, with two globular ATPase heads at one end and a 
single coiled-coil tail at the other (Figure 17-38A). Clusters of myosin-II 
molecules bind to each other through their coiled-coil tails, forming a 
bipolar myosin filament from which the heads project (Figure 17-38B). 


Figure 17-38 Myosin-Il molecules 

can associate with one another 

to form myosin filaments. (A) A (A) myosin-Il molecule 
molecule of myosin-Il contains two 

identical heavy chains, each with 


a globular head and an extended a 
tail. (It also contains two light chains NS 
bound to each head, but these are 


not shown.) The tails of the two heavy | | 
chains form a single coiled-coil tail. 150 nm 

(B) The coiled-coil tails of myosin-Il 
molecules associate with one another 
to form a bipolar myosin filament in 
which the heads project outward from 
the middle in opposite directions. 
The bare region in the middle of the 
filament consists of tails only. 1 um 


R 7 bare region 
(B) myosin-ll filament (myosin tails only) myosin heads 


minus end 


The myosin filament is like a double-headed arrow, with the two sets of 
myosin heads pointing outward, away from the middle. One set binds to 
actin filaments in one orientation and moves the filaments one way; the 
other set binds to other actin filaments in the opposite orientation and 
moves the filaments in the opposite direction. As a result, a myosin fila- 
ment slides sets of oppositely oriented actin filaments past one another 
(Figure 17-39). Thus, if actin filaments and myosin filaments are organ- 
ized together in a bundle, the bundle can generate a strong contractile 
force. This is seen most clearly in muscle contraction, but it also occurs in 
the much smaller contractile bundles of actin filaments and myosin-II fila- 
ments (see Figure 17—29B) that assemble transiently in nonmuscle cells, 
and in the contractile ring that pinches a dividing cell in two by contract- 
ing and pulling inward on the plasma membrane (see Figure 17-29D). 


Actin Filaments Slide Against Myosin Filaments During 
Muscle Contraction 


In most animals, skeletal muscle fibers are huge, multinucleated indi- 
vidual cells formed by the fusion of many separate smaller cells. The 
nuclei of the contributing cells are retained in the muscle fiber and lie just 
beneath the plasma membrane. The bulk of the cytoplasm is made up of 
myofibrils, the contractile elements of the muscle cell. These cylindrical 
structures are 1-2 um in diameter and may be as long as the muscle cell 
itself (Figure 17-40A). 


A myofibril consists of a chain of identical tiny contractile units, or 
sarcomeres. Each sarcomere is about 2.5 um long, and the repeating 
pattern of sarcomeres gives the vertebrate myofibril a striped appear- 
ance (Figure 17-40B). Sarcomeres are highly organized assemblies of 
two types of filaments—actin filaments and myosin filaments composed 
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Figure 17-40 A skeletal muscle cell is packed with myofibrils. 
(A) In an adult human, these huge, multinucleated cells (also 

called muscle fibers) are typically 50 um in diameter, and they can 
be several centimeters long. They contain numerous myofibrils, 

in which actin filaments and myosin-ll filaments are arranged in 

a highly organized structure, giving each myofibril—and skeletal 
muscle cell—a striated or striped appearance; for this reason, 
skeletal muscle is also called striated muscle. (B) Low-magnification 
electron micrograph of a longitudinal section through a skeletal 
muscle cell of a rabbit, showing that each myofibril consists of 
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Figure 17-39 A small, bipolar myosin-Il 
filament can slide two actin filaments 
of opposite orientation past each 
other. Similar sliding movement mediates 
the contraction of interacting actin and 
myosin-ll filaments in both muscle and 
nonmuscle cells. As with myosin-l, a 
myosin-ll head group walks toward the 
plus end of the actin filament with which 
it interacts. Note that multiple myosin 
molecules are required to generate 
movement: when one myosin head releases 
the filament to reposition itself, other 
myosins must remain attached so the 
structure does not fall apart. 


QUESTION 17-8 


If both the actin and myosin 
filaments of muscle are made 

up of subunits held together by 
weak noncovalent bonds, how is it 
possible for a human being to lift 
heavy objects? 


a repeating chain of sarcomeres, the contractile units of the 
myofibrils. (B, courtesy of Roger Craig.) 
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overlap Figure 17-41 Sarcomeres are the contractile units of muscle. 
region (A) Detail of the electron micrograph from Figure 17—40 showing two 


A 


myofibrils; the length of one sarcomere and the region where the actin 
and myosin filaments overlap are indicated. (B) Schematic diagram 

of a single sarcomere showing the origin of the light and dark bands 
seen in the microscope. Z discs at either end of the sarcomere are 
attachment points for the plus ends of actin filaments. The centrally 
located thick filaments (green) are each composed of many myosin-ll 
molecules. The thin vertical line running down the center of the thick 
filament bundle in (A) corresponds to the bare regions of the myosin 
sarcomere ~2.5 um filaments, as seen in Figure 17—38B. (A, courtesy of Roger Craig.) 


a 
myofibrils 


thin filament thick filament 

at (myösin-ll) of a muscle-specific form of myosin-IIl. The myosin filaments (the thick 
filaments) are centrally positioned in each sarcomere, whereas the more 
slender actin filaments (the thin filaments) extend inward from each end 
of the sarcomere, where they are anchored by their plus ends to a struc- 
ae ix ture known as the Z disc. The minus ends of the actin filaments overlap 


(B) the ends of the myosin filaments (Figure 17-41). 


The contraction of a muscle cell is caused by a simultaneous shorten- 
ing of all the cell’s sarcomeres, which is caused by the actin filaments 
sliding past the myosin filaments, with no change in the length of either 
type of filament (Figure 17-42). The sliding motion is generated by myo- 
sin heads that project from the sides of the myosin filament and interact 
with adjacent actin filaments (see Figure 17-39). When a muscle is stimu- 
lated to contract, the myosin heads start to walk along the actin filament 
in repeated cycles of attachment and detachment. During each cycle, a 
myosin head binds and hydrolyzes one molecule of ATP. This causes a 
series of conformational changes that move the tip of the head by about 
5 nm along the actin filament toward the plus end. This movement, 
repeated with each round of ATP hydrolysis, propels the myosin molecule 
unidirectionally along the actin filament (Figure 17-43). In so doing, the 
myosin heads pull against the actin filament, causing it to slide against 
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Figure 17-42 Muscles contract by a sliding-filament mechanism. (A) The myosin and actin filaments of a sarcomere overlap with 
the same relative polarity on either side of the midline. Recall that actin filaments are anchored by their plus ends to the Z disc and 
that myosin filaments are bipolar. (B) During contraction, the actin and myosin filaments slide past each other. Although the filaments 
themselves remain the same length, the sarcomere to which they belong shortens. The sliding motion is driven by the myosin heads 
walking toward the plus ends of the adjacent actin filaments (Movie 17.11). 
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Muscle Contraction 


ATTACHED At the start of the cycle shown in this figure, 
a myosin head lacking a bound ATP or ADP is attached 
tightly to an actin filament in a rigor configuration (so 
named because it is responsible for rigor mortis, the 
rigidity of death). In an actively contracting muscle, this 
state is very short-lived, being rapidly terminated by the 
binding of a molecule of ATP to the myosin head. 


RELEASED A molecule of ATP binds to the large cleft on 
the “back” of the myosin head (that is, on the side 
furthest from the actin filament) and immediately causes 
a slight change in the conformation of the domains that 
make up the actin-binding site. This reduces the affinity 
of the head for actin and allows it to let go of the 
filament. (The space drawn here between the head and 
actin emphasizes this change, although in reality the 
head probably remains very close to the actin.) 


COCKED The cleft closes like a clam shell around the 
ATP molecule, triggering a large shape change that 
causes the head to be displaced along the actin filament 
by a distance of about 5 nm. Hydrolysis of ATP occurs, but 
the ADP and inorganic phosphate (P) produced remain 
tightly bound to the myosin head. Dotted lines show the 
position of myosin head prior to ATP hydrolysis. 


FORCE-GENERATING Weak binding of the myosin head 
to a new site on the actin filament causes release of the 
inorganic phosphate produced by ATP hydrolysis. This 
release triggers the power stroke—the force-generating 
change in shape during which the head regains its 
original conformation. In the course of the power stroke, 
the head loses its bound ADP, thereby returning to the 
start of a new cycle. 


ATTACHED At the end of the cycle, the myosin head is 
again bound tightly to the actin filament in a rigor 
configuration. Note that the head has moved to a new 
position on the actin filament, which has slid to the left 
along the myosin filament. 
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the myosin filament. The concerted action of many myosin heads pulling 
the actin and myosin filaments past each other causes the sarcomere to 
contract. After a contraction is completed, the myosin heads all lose con- 
tact with the actin filaments, and the muscle relaxes. 


A myosin filament has about 300 myosin heads. Each myosin head can 
attach and detach from actin about five times per second, allowing the 
myosin and actin filaments to slide past one another at speeds of up to 
15 um per second. This speed is sufficient to take a sarcomere from a fully 
extended state (3 um) to a fully contracted state (2 um) in less than one- 
tenth of a second. All of the sarcomeres of a muscle are coupled together 
and are triggered simultaneously by the signaling system we describe 
next, so the entire muscle contracts almost instantaneously. 


Figure 17-43 The head of a myosin-ll 
molecule walks along an actin filament 
through an ATP-dependent cycle of 
conformational changes. Two actin 
monomers are highlighted to make the 
movement of the actin filament easier to 
see. Movie 17.12 shows actin and myosin in 
action. (Based on l. Rayment et al., Science 
261:50-58, 1993.) 
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Figure 17-44 T tubules and the 
sarcoplasmic reticulum surround 

each myofibril. (A) Drawing of the two 
membrane systems that relay the signal 
to contract from the muscle cell plasma 
membrane to all of the myofibrils in the 
muscle cell (see Figure 17-40). (B) Electron 
micrograph showing a cross section of a 
T tubule and the adjacent sarcoplasmic 
reticulum compartments. (B, courtesy of 
Clara Franzini-Armstrong.) 


QUESTION 17-9 


Compare the structure of 
intermediate filaments with that of 
the myosin-ll filaments in skeletal 
muscle cells. What are the major 
similarities? What are the major 
differences? How do the differences 
in structure relate to their function? 
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Figure 17-45 Skeletal muscle 
contraction is triggered by 
the release of Ca2* from the 
sarcoplasmic reticulum into the 
cytosol. This schematic diagram 
shows how a Ca?*-release 
channel in the sarcoplasmic 
reticulum membrane is opened 
by a physical linkage to a 
voltage-gated Ca?* channel 

in the T-tubule membrane. 

The T-tubule membrane 

and sarcoplasmic reticulum 
membrane are drawn in the 
same orientation shown in the 
micrograph in Figure 17—44B. 
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Muscle Contraction Is Triggered by a Sudden Rise in 
Cytosolic Ca?+ 


The force-generating molecular interaction between myosin and actin 
filaments takes place only when the skeletal muscle receives a signal 
to contract from a motor neuron. The neurotransmitter released from 
the nerve terminal triggers an action potential in the muscle cell plasma 
membrane (as discussed in Chapter 12). This electrical excitation spreads 
in a matter of milliseconds into a series of membranous tubes, called 
transverse (or T) tubules, that extend inward from the plasma membrane 
around each myofibril. The electrical signal is then relayed to the sarco- 
plasmic reticulum, an adjacent sheath of interconnected flattened vesicles 
that surrounds each myofibril like a net stocking (Figure 17-44). 


The sarcoplasmic reticulum in muscle cells is a specialized region of the 
endoplasmic reticulum. It contains a very high concentration of Ca?*. In 
response to electrical excitation, which passes along the plasma mem- 
brane and to the T tubules, much of this Ca?* is released into the cytosol 
through a specialized set of ion channels that open in the sarcoplasmic 
reticulum membrane (Figure 17-45). As discussed in Chapter 16, Ca?* is 
widely used as an intracellular signal to relay a message from the exterior 
to the interior of cells. In muscle, the rise in cytosolic Ca** concentra- 
tion activates a molecular switch made of specialized accessory proteins 
closely associated with the actin filaments (Figure 17-46A). One of these 
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proteins is tropomyosin, a rigid, rod-shaped molecule that binds in the 
groove of the actin helix, where it prevents the myosin heads from asso- 
ciating with the actin filament. The other is troponin, a protein complex 
that includes a Ca?+-sensitive protein associated with the end of a tro- 
pomyosin molecule. When the concentration of Ca? in the cytosol rises, 
Ca?* binds to troponin and induces a change in its shape. This in turn 
causes the tropomyosin molecules to shift their positions slightly, allow- 
ing myosin heads to bind to the actin filaments, initiating contraction 
(Figure 17—46B). 


Because the signal from the plasma membrane is passed within millisec- 
onds (via the T tubules and sarcoplasmic reticulum) to every sarcomere 
in the cell, all the myofibrils in the cell contract at the same time. The 
increase in Ca?* in the cytosol is transient because, when the nerve signal 
terminates, the Ca?* is rapidly pumped back into the sarcoplasmic reticu- 
lum by abundant Ca?+ pumps in its membrane (discussed in Chapter 12). 
As soon as the Ca?+ concentration returns to the resting level, troponin 
and tropomyosin molecules move back to their original positions. This 
reconfiguration once again blocks myosin binding to actin filaments, 
thereby ending the contraction. 


Different Types of Muscle Cells Perform Different 
Functions 


The highly specialized contractile machinery in muscle cells is thought 
to have evolved from the simpler contractile bundles of myosin and actin 
filaments found in all eukaryotic cells. The myosin-II in nonmuscle cells 
is also activated by a rise in cytosolic Ca**, but the mechanism of activa- 
tion is different from that of the muscle-specific myosin-II. An increase in 
Ca?* leads to the phosphorylation of nonmuscle myosin-II, which alters 
the myosin conformation and enables it to interact with actin. A simi- 
lar activation mechanism operates in smooth muscle, which is present 
in the walls of the stomach, intestine, uterus, and arteries, and in many 
other structures that undergo slow and sustained involuntary contrac- 
tions. This mode of myosin activation is relatively slow, because time is 
needed for enzyme molecules to diffuse to the myosin heads and carry 
out the phosphorylation and subsequent dephosphorylation. However, 
this mechanism has the advantage that—unlike the mechanism used 
by skeletal muscle cells—it can be activated by a variety of extracellular 
signals: thus smooth muscle, for example, is triggered to contract by epi- 
nephrine, serotonin, prostaglandins, and several other signal molecules. 


In addition to skeletal and smooth muscle, other forms of muscle each 
perform a specific mechanical function. Heart—or cardiac—muscle, for 
instance, drives the circulation of blood. The heart contracts autono- 
mously for the entire life of the organism—some 3 billion (3 x 10%) times 
in an average human lifetime. Even subtle abnormalities in the actin or 
myosin of heart muscle can lead to serious disease. For example, muta- 
tions in the genes that encode cardiac myosin-II or other proteins in the 
sarcomere cause familial hypertrophic cardiomyopathy, a hereditary dis- 
order responsible for sudden death in young athletes. 
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Figure 17-46 Skeletal muscle contraction 
is controlled by tropomyosin and 
troponin complexes. (A) An actin filament 
in muscle showing the positions of 
tropomyosin and troponin complexes along 
the filament. Every tropomyosin molecule 
has seven evenly spaced regions with a 
similar amino acid sequence, each of which 
is thought to bind to an actin monomer in 
the filament. (B) This cross section of the 
muscle actin filament reveals how Ca?* 
binding to the troponin complex (not 
shown) leads to movement of tropomyosin 
away from the myosin-binding site. 


QUESTION 17-10 


A. Note that in Figure 17—46, 
troponin molecules are evenly 
spaced along an actin filament, 

with one troponin found every 
seventh actin molecule. How do you 
suppose troponin molecules can be 
positioned this regularly? What does 
this tell you about the binding of 
troponin to actin filaments? 

B. What do you suppose would 
happen if you mixed actin 

filaments with (i) troponin alone, 

(ii) tropomyosin alone, or 

(iii) troponin plus tropomyosin, and 
then added myosin? Would the 
effects be dependent on Ca?+? 
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CHAPTER 17 


Cytoskeleton 


The contraction of muscle cells represents a highly specialized use of the 
basic components of the eukaryotic cytoskeleton. In the following chap- 
ter, we discuss the crucial roles of the cytoskeleton in perhaps the most 
fundamental cell movements of all: the segregation of newly replicated 
chromosomes and the formation of two daughter cells during the process 
of cell division. 


ESSENTIAL CONCEPTS 


The cytoplasm of a eukaryotic cell is supported and organized by 
a cytoskeleton of intermediate filaments, microtubules, and actin 
filaments. 


Intermediate filaments are stable, ropelike polymers—built from 
fibrous protein subunits—that give cells mechanical strength. Some 
intermediate filaments form the nuclear lamina that supports and 
strengthens the nuclear envelope; others are distributed throughout 
the cytoplasm. 


Microtubules are stiff, hollow tubes formed by globular tubulin 
dimers. They are polarized structures, with a slow-growing minus 
end and a fast-growing plus end. 


Microtubules grow out from organizing centers such as the centro- 
some, in which the minus ends remain embedded. 


Many microtubules display dynamic instability, alternating rap- 
idly between growth and shrinkage. Shrinkage is promoted by the 
hydrolysis of the GTP that is tightly bound to tubulin dimers, reducing 
the affinity of the dimers for their neighbors and thereby promoting 
microtubule disassembly. 


Microtubules can be stabilized by localized proteins that capture the 
plus ends, thereby helping to position the microtubules and harness 
them for specific functions. 


Kinesins and dyneins are microtubule-associated motor proteins 
that use the energy of ATP hydrolysis to move unidirectionally along 
microtubules. They carry specific organelles, vesicles, and other 
types of cargo to particular locations in the cell. 


Eukaryotic cilia and flagella contain a bundle of stable microtubules. 
Their rhythmic beating is caused by bending of the microtubules, 
driven by the ciliary dynein motor protein. 


Actin filaments are helical polymers of globular actin monomers. 
They are more flexible than microtubules and are generally found in 
bundles or networks. 


Like microtubules, actin filaments are polarized, with a fast-growing 
plus end and a slow-growing minus end. Their assembly and disas- 
sembly are controlled by the hydrolysis of ATP tightly bound to each 
actin monomer and by various actin-binding proteins. 


The varied arrangements and functions of actin filaments in cells 
stem from the diversity of actin-binding proteins, which can control 
actin polymerization, cross-link actin filaments into loose networks 
or stiff bundles, attach actin filaments to membranes, or move two 
adjacent filaments relative to each other. 


A concentrated network of actin filaments underneath the plasma 
membrane forms the bulk of the cell cortex, which is responsible 
for the shape and movement of the cell surface, including the move- 
ments involved when a cell crawls along a surface. 


Myosins are motor proteins that use the energy of ATP hydrolysis to 
move along actin filaments. In nonmuscle cells, myosin-I can carry 
organelles or vesicles along actin-filament tracks, and myosin-II can 
cause adjacent actin filaments to slide past each other in contractile 
bundles. 
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e In skeletal muscle cells, repeating arrays of overlapping filaments of 
actin and myosin-II form highly ordered myofibrils, which contract as 


these filaments slide past each other. 


e Muscle contraction is initiated by a sudden rise in cytosolic Ca?*, 
which delivers a signal to the myofibrils via Ca**-binding proteins 


associated with the actin filaments. 


KEY TERMS 


actin-binding protein 
actin filament 


lamellipodium 
microtubule 


cell cortex 

centriole motor protein 
centrosome myofibril 
cilium myosin 
cytoskeleton myosin-l 
dynamic instability myosin-ll 


dynein myosin filament 
filopodium nuclear lamina 
flagellum polarity 
intermediate filament Rho protein family 
keratin filament sarcomere 

kinesin tubulin 


QUESTIONS 


QUESTION 17-11 


Which of the following statements are correct? Explain your 
answers. 


A. Kinesin moves endoplasmic reticulum (ER) membranes 
along microtubules so that the network of ER tubules 
becomes stretched throughout the cell. 


B. Without actin, cells can form a functional mitotic spindle 
and pull their chromosomes apart but cannot divide. 


C. Lamellipodia and filopodia are “feelers” that a cell 
extends to find anchor points on the substratum that it will 
then crawl over. 


D. GTP is hydrolyzed by tubulin to cause the bending of 
flagella. 


E. Cells having an intermediate-filament network that 
cannot be depolymerized would die. 


F. The plus ends of microtubules grow faster because they 
have a larger GTP cap. 


G. The transverse tubules in muscle cells are an extension 
of the plasma membrane, with which they are continuous; 
similarly, the sarcoplasmic reticulum is an extension of the 
endoplasmic reticulum. 


H. Activation of myosin movement on actin filaments is 
triggered by the phosphorylation of troponin in some 
situations and by Ca?* binding to troponin in others. 
QUESTION 17-12 


The average time taken for a molecule or an organelle to 


microtubule-associated protein 


diffuse a distance of x cm is given by the formula 

t = x2/2D 
where t is the time in seconds and D is a constant called the 
diffusion coefficient for the molecule or particle. Using the 
above formula, calculate the time it would take for a small 
molecule, a protein, and a membrane vesicle to diffuse from 
one side to another of a cell 10 um across. Typical diffusion 
coefficients in units of cm2/sec are: small molecule, 5 x 10%; 
protein molecule, 5 x 1077; vesicle, 5 x 10-8. How long 
would a membrane vesicle take to reach the end of an axon 
10 cm long by free diffusion? How long would it take if it 
was transported along microtubules at 1 m/sec? 


QUESTION 17-13 


Why do eukaryotic cells, and especially animal cells, have 
such large and complex cytoskeletons? List the differences 
between animal cells and bacteria that depend on the 
eukaryotic cytoskeleton. 


QUESTION 17-14 


Examine the structure of an intermediate filament shown in 
Figure 17-4. Does the filament have a unique polarity—that 
is, could you distinguish one end from the other by chemical 
or other means? Explain your answer. 


QUESTION 17-15 


There are no known motor proteins that move on 
intermediate filaments. Suggest an explanation for this. 
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QUESTION 17-16 


When cells enter mitosis, their existing array of cytoplasmic 
microtubules has to be rapidly broken down and replaced 
with the mitotic spindle that forms to pull the chromosomes 
into the daughter cells. The enzyme katanin, named after 
Japanese samurai swords, is activated during the onset of 
mitosis, and chops microtubules into short pieces. What 

do you suppose is the fate of the microtubule fragments 
created by katanin? Explain your answer. 


QUESTION 17-17 


The drug Taxol, extracted from the bark of yew trees, has 
an opposite effect to the drug colchicine, an alkaloid from 
autumn crocus. Taxol binds tightly to microtubules and 
stabilizes them; when added to cells, it causes much of the 
free tubulin to assemble into microtubules. In contrast, 
colchicine prevents microtubule formation. Taxol is just 

as pernicious to dividing cells as colchicine, and both are 
used as anticancer drugs. Based on your knowledge of 
microtubule dynamics, suggest why both drugs are toxic to 
dividing cells despite their opposite actions. 


QUESTION 17-18 


A useful technique for studying microtubule motors is to 
attach them by their tails to a glass cover slip (which can 

be accomplished quite easily because the tails stick avidly 
to a clean glass surface) and then allow them to settle. 
Microtubules may then be viewed in a light microscope as 
they are propelled over the surface of the cover slip by the 
heads of the motor proteins. Because the motor proteins 
attach at random orientations to the cover slip, however, 
how can they generate coordinated movement of individual 
microtubules rather than engaging in a tug-of-war? In which 
direction will microtubules crawl on a “bed” of kinesin 
molecules (i.e., will they move plus-end first or minus-end 
first)? 


QUESTION 17-19 


A typical time course of polymerization of purified tubulin to 
form microtubules is shown in Figure Q17-19. 


A. Explain the different parts of the curve (labeled A, B, 
and C). Draw a diagram that shows the behavior of tubulin 
subunits in each of the three phases. 


B. How would the curve in the figure change if centrosomes 
were added at the outset? 
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(a) 
Figure Q17-20 


QUESTION 17-20 


The electron micrographs shown in Figure Q17-20A 

were obtained from a population of microtubules that 

were growing rapidly. Figure Q17-20B was obtained 

from microtubules undergoing “catastrophic” shrinking. 
Comment on any differences between A and B, and suggest 
likely explanations for the differences that you observe. 


QUESTION 17-21 


The locomotion of fibroblasts in culture is immediately 
halted by the drug cytochalasin, whereas colchicine causes 
fibroblasts to cease to move directionally and to begin 
extending lamellipodia in seemingly random directions. 
Injection of fibroblasts with antibodies to the intermediate 
filament protein vimentin has no discernible effect on their 
migration. What do these observations suggest to you 
about the involvement of the three different cytoskeletal 
filaments in fibroblast locomotion? 


QUESTION 17-22 


Complete the following sentence accurately, explaining your 
reason for accepting or rejecting each of the four phrases 
(more than one can be correct). The role of calcium in 
muscle contraction is: 


A. to detach myosin heads from actin. 


B. to spread the action potential from the plasma 
membrane to the contractile machinery. 


C. to bind to troponin, cause it to move tropomyosin, and 
thereby expose actin filaments to myosin heads. 


D. to maintain the structure of the myosin filament. 


QUESTION 17-23 

Which of the following changes takes place when a skeletal 
muscle contracts? 

A. Z discs move farther apart. 

B. Actin filaments contract. 

C. Myosin filaments contract. 


D. Sarcomeres become shorter. 
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The Cell-Division Cycle 


“Where a cell arises, there must be a previous cell, just as animals can 
only arise from animals and plants from plants.” This statement, which 
appears in a book written by German pathologist Rudolf Virchow in 1858, 
carries with it a profound message for the continuity of life. If every cell 
comes from a previous cell, then all living organisms—from a unicellular 
bacterium to a multicellular mammal—are products of repeated rounds 
of cell growth and division, stretching back to the beginnings of life more 
than 3 billion years ago. 


A cell reproduces by carrying out an orderly sequence of events in which 
it duplicates its contents and then divides in two. This cycle of duplica- 
tion and division, known as the cell cycle, is the essential mechanism 
by which all living things reproduce. The details of the cell cycle vary 
from organism to organism and at different times in an individual organ- 
ism’s life. In unicellular organisms, such as bacteria and yeasts, each cell 
division produces a complete new organism, whereas many rounds of 
cell division are required to make a new multicellular organism from a 
fertilized egg. Certain features of the cell cycle, however, are universal, 
as they allow every cell to perform the fundamental task of copying and 
passing on its genetic information to the next generation of cells. 


To explain how cells reproduce, we have to consider three major 
questions: (1) How do cells duplicate their contents—including the chro- 
mosomes, which carry the genetic information? (2) How do they partition 
the duplicated contents and split in two? (3) How do they coordinate all 
the steps and machinery required for these two processes? The first ques- 
tion is considered elsewhere in this book: in Chapter 6, we discuss how 
DNA is replicated, and in Chapters 7, 11, 15, and 17, we describe how 
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the eukaryotic cell manufactures its numerous other components, such 
as proteins, membranes, organelles, and cytoskeletal filaments. In this 
chapter, we tackle the second and third questions: how a eukaryotic cell 
distributes—or segregates—its duplicated contents to produce two geneti- 
cally identical daughter cells, and how it coordinates the various steps of 
this reproductive cycle. 


We begin with an overview of the events that take place during a typical 
QUESTION 18-1 cell cycle. We then describe the complex system of regulatory proteins 

called the cell-cycle control system, which orders and coordinates these 
Consider the following statement: events to ensure that they occur in the correct sequence. We next discuss 
"All present-day cells have arisen in detail the major stages of the cell cycle, in which the chromosomes are 
by an uninterrupted series of cell duplicated and then segregated into the two daughter cells. At the end of 
divisions extending back in time to the chapter, we consider how animals use extracellular signals to control 


the first cell division.” Is this strictly 


a the survival, growth, and division of their cells. These signaling systems 
rues 


allow an animal to regulate the size and number of its cells—and, ulti- 
mately, the size and form of the organism itself. 


OVERVIEW OF THE CELL CYCLE 


The most basic function of the cell cycle is to duplicate accurately the 
vast amount of DNA in the chromosomes and then to segregate the DNA 
into genetically identical daughter cells such that each cell receives a 
complete copy of the entire genome (Figure 18-1). In most cases, a cell 
also duplicates its other macromolecules and organelles and doubles in 
size before it reproduces; otherwise, each time a cell divided, it would get 
smaller and smaller. Thus, to maintain their size, proliferating cells coor- 
dinate their growth with their division. We return to the topic of cell-size 
control later in the chapter; here, we focus on cell division. 


The duration of the cell cycle varies greatly from one cell type to another. 
In an early frog embryo, cells divide every 30 minutes, whereas a mam- 
malian fibroblast in culture divides about once a day (Table 18-1). In this 
section, we describe briefly the sequence of events that occur in prolifer- 
ating mammalian cells. We then introduce the cell-cycle control system 
that ensures that the various events of the cycle take place in the correct 
sequence and at the correct time. 


©}... cells 
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Figure 18-1 Cells reproduce by CELL 1 CELL GROWTH 
duplicating their contents and dividing in CYCLE AND CHROMOSOME 
two in a process called the cell cycle. For DUPLICATION 


simplicity, we use a hypothetical eukaryotic 
cell—which has only one copy each of two 
different chromosomes—to illustrate how 


each cell cycle produces two genetically 

identical daughter cells. Each daughter cell 2 CHROMOSOME 
can divide again by going through another SEGREGATION 
cell cycle, and so on for generation after 

generation. 


TABLE 18-1 SOME EUKARYOTIC CELL-CYCLE DURATIONS 
Cell Type Duration of Cell Cycle 


Early fly embryo cells 8 minutes 
Early frog embryo cells 30 minutes 
Mammalian intestinal epithelial cells ~12 hours 
Mammalian fibroblasts in culture ~20 hours 


The Eukaryotic Cell Cycle Usually Includes Four Phases 


Seen in a microscope, the two most dramatic events in the cell cycle are 
when the nucleus divides, a process called mitosis, and when the cell 
itself then splits in two, a process called cytokinesis. These two processes 
together constitute the M phase of the cycle. In a typical mammalian cell, 
the whole of M phase takes about an hour, which is only a small fraction 
of the total cell-cycle time (see Table 18-1). 


The period between one M phase and the next is called interphase. 
Viewed with a microscope, it appears, deceptively, as an uneventful 
interlude during which the cell simply increases in size. Interphase, how- 
ever, is a very busy time for a proliferating cell, and it encompasses the 
remaining three phases of the cell cycle. During S phase (S = synthesis), 
the cell replicates its DNA. S phase is flanked by two “gap” phases—called 
G, phase and Gz phase—during which the cell continues to grow (Figure 
18-2). During these gap phases, the cell monitors both its internal state 
and external environment. This monitoring ensures that conditions are 
suitable for reproduction and that preparations are complete before the 
cell commits to the major upheavals of S phase (which follows G;) and 
mitosis (following G2). At particular points in G; and Gz, the cell decides 
whether to proceed to the next phase or pause to allow more time to 
prepare. 


During all of interphase, a cell generally continues to transcribe genes, 
synthesize proteins, and grow in mass. Together with S phase, G; and 
G2 provide the time needed for the cell to enlarge and to duplicate its 
cytoplasmic organelles. If interphase lasted only long enough for DNA 
replication, the cell would not have time to double its mass before it 
divided and would consequently shrink with each division. Indeed, in 
some special circumstances that is exactly what happens. In an early frog 
embryo, for example, the first cell divisions after fertilization (called cleav- 
age divisions) serve to subdivide the giant egg cell into many smaller cells 
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Figure 18-2 The eukaryotic cell cycle 
usually occurs in four phases. The cell 
grows continuously during interphase, which 
consists of three phases: Gi, S, and G2. DNA 
replication is confined to S phase. G is the 
gap between M phase and S phase, and 

G2 is the gap between S phase and 

M phase. During M phase, the nucleus 
divides in a process called mitosis; then 

the cytoplasm divides, in a process 

called cytokinesis. In this figure—and in 
subsequent figures in the chapter—the 
lengths of the various phases are not drawn 
to scale: M phase, for example, is typically 
much shorter and G; much longer than 
shown. 
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QUESTION 18-2 


A population of proliferating cells 
is stained with a dye that becomes 
fluorescent when it binds to DNA, so 
that the amount of fluorescence is 
directly proportional to the amount 
of DNA in each cell. To measure 
the amount of DNA in each cell, 
the cells are then passed through 

a flow cytometer, an instrument 
that measures the amount of 
fluorescence in individual cells. The 
number of cells with a given DNA 
content is plotted on the graph 
below. 


number of cells 


0 1 2 
relative amount of 
DNA per cell 


Indicate on the graph where you 
would expect to find cells that are in 
G1, S, G2, and mitosis. Which is the 
longest phase of the cell cycle in this 
population of cells? 


Figure 18-3 The cell-cycle control system 
ensures that key processes in the cycle 
occur in the proper sequence. The cell- 
cycle control system is shown as a controller 
arm that rotates clockwise, triggering 
essential processes when it reaches 
particular transition points on the outer dial. 
These processes include DNA replication in 
S phase and the segregation of duplicated 
chromosomes in mitosis. The control system 
can transiently halt the cycle at specific 
transition points—in G1, G2, and M phase— 
if extracellular or intracellular conditions are 
unfavorable. 


The Cell-Division Cycle 


as quickly as possible (see Table 18-1). In such embryonic cell cycles, the 
Gı and Gz phases are drastically shortened, and the cells do not grow 
before they divide. 


A Cell-Cycle Control System Triggers the Major 
Processes of the Cell Cycle 


To ensure that they replicate all their DNA and organelles, and divide 
in an orderly manner, eukaryotic cells possess a complex network of 
regulatory proteins known as the cell-cycle control system. This system 
guarantees that the events of the cell cycle—DNA replication, mitosis, 
and so on—occur in a set sequence and that each process has been com- 
pleted before the next one begins. To accomplish this organizational feat, 
the control system is itself regulated at certain critical points of the cycle 
by feedback from the process currently being performed. Without such 
feedback, an interruption or a delay in any of the processes could be dis- 
astrous. All of the nuclear DNA, for example, must be replicated before 
the nucleus begins to divide, which means that a complete S phase must 
precede M phase. If DNA synthesis is slowed down or stalled, mitosis 
and cell division must also be delayed. Similarly, if DNA is damaged, the 
cycle must be put on hold in Gj, S, or Gz so that the cell can repair the 
damage, either before DNA replication is started or completed or before 
the cell enters M phase. The cell-cycle control system achieves all of this 
by employing a set of molecular brakes, sometimes called checkpoints, to 
pause the cycle at certain transition points. In this way, the control sys- 
tem does not trigger the next step in the cycle unless the cell is properly 
prepared. 


The cell-cycle control system regulates progression through the cell cycle 
at three main transition points (Figure 18-3). At the transition from G to 
S phase, the control system confirms that the environment is favorable 
for proliferation before committing to DNA replication. Cell proliferation 
in animals requires both sufficient nutrients and specific signal molecules 
in the extracellular environment; if these extracellular conditions are 
unfavorable, cells can delay progress through G; and may even enter 
a specialized resting state known as Go (G zero). At the transition from 
G2 to M phase, the control system confirms that the DNA is undamaged 
and fully replicated, ensuring that the cell does not enter mitosis unless 
its DNA is intact. Finally, during mitosis, the cell-cycle control machinery 


Are all chromosomes properly 


Is all DNA replicated? WEES [AO] oN 
attached to the mitotic spindle? 


Is all DNA damage repaired? 


CONTROLLER 


Is environment favorable? 


ensures that the duplicated chromosomes are properly attached to a 
cytoskeletal machine, called the mitotic spindle, before the spindle pulls 
the chromosomes apart and segregates them into the two daughter cells. 


In animals, the transition from G to S phase is especially important as a 
point in the cell cycle where the control system is regulated. Signals from 
other cells stimulate cell proliferation when more cells are needed—and 
block it when they are not. The cell-cycle control system therefore plays 
a central part in the regulation of cell numbers in the tissues of the body; 
if the control system malfunctions such that cell division is excessive, 
cancer can result. We discuss later how extracellular signals influence the 
decisions made at the G,-to-S transition. 


Cell-Cycle Control Is Similar in All Eukaryotes 


Some features of the cell cycle, including the time required to complete 
certain events, vary greatly from one cell type to another, even within the 
same organism. The basic organization of the cycle, however, is essen- 
tially the same in all eukaryotic cells, and all eukaryotes appear to use 
similar machinery and control mechanisms to drive and regulate cell- 
cycle events. The proteins of the cell-cycle control system first appeared 
more than a billion years ago, and they have been so well conserved 
over the course of evolution that many of them function perfectly when 
transferred from a human cell to a yeast (see How We Know, pp. 30-31). 


Because of this similarity, biologists can study the cell cycle and its regu- 
lation in a variety of organisms and use the findings from all of them 
to assemble a unified picture of how the cycle works. Many discoveries 
about the cell cycle have come from a systematic search for mutations 
that inactivate essential components of the cell-cycle control system 
in yeasts. Likewise, studies of both cultured mammalian cells and the 
embryos of frogs and sea urchins have been critical for examining the 
molecular mechanisms that underlie the cycle and its control in multicel- 
lular organisms like ourselves. 


THE CELL-CYCLE CONTROL SYSTEM 


Two types of machinery are involved in cell division: one manufactures 
the new components of the growing cell, and another hauls the compo- 
nents into their correct places and partitions them appropriately when 
the cell divides in two. The cell-cycle control system switches all this 
machinery on and off at the correct times, thereby coordinating the vari- 
ous steps of the cycle. The core of the cell-cycle control system is a series 
of molecular switches that operate in a defined sequence and orchestrate 
the main events of the cycle, including DNA replication and the segrega- 
tion of duplicated chromosomes. In this section, we review the protein 
components of the control system and discuss how they work together to 
trigger the different phases of the cycle. 


The Cell-Cycle Control System Depends on Cyclically 
Activated Protein Kinases Called Cdks 


The cell-cycle control system governs the cell-cycle machinery by 
cyclically activating and then inactivating the key proteins and protein 
complexes that initiate or regulate DNA replication, mitosis, and cytoki- 
nesis. This regulation is carried out largely through the phosphorylation 
and dephosphorylation of proteins involved in these essential processes. 


As discussed in Chapter 4, phosphorylation followed by dephosphoryla- 
tion is one of the most common ways by which cells switch the activity of 
a protein on and off (see Figure 4-46), and the cell-cycle control system 


The Cell-Cycle Control System 
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cyclin-dependent 
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Figure 18-5 The accumulation of cyclins 
helps regulate the activity of Cdks. The 
formation of active cyclin-Cdk complexes 
drives various cell-cycle events, including 
entry into S phase or M phase. The figure 
shows the changes in cyclin concentration 
and Cdk protein kinase activity responsible 
for controlling entry into M phase. 
Increasing concentration of the relevant 
cyclin (called M cyclin) helps direct the 
formation of the active cyclin-Cdk complex 
(M-Cdk) that drives entry into M phase. 
Although the enzymatic activity of each 
type of cyclin-Cdk complex rises and falls 
during the course of the cell cycle, the 
concentration of the Cdk component does 
not (not shown). 


Figure 18-4 Progression through the cell cycle depends on cyclin- 
dependent protein kinases (Cdks). A Cdk must bind a regulatory 
protein called a cyclin before it can become enzymatically active. This 
activation also requires an activating phosphorylation of the Cdk (not 
shown, but see Movie 18.1). Once activated, a cyclin-Cdk complex 
phosphorylates key proteins in the cell that are required to initiate 
particular steps in the cell cycle. The cyclin also helps direct the Cdk to 
the target proteins that the Cdk phosphorylates. 


uses this mechanism extensively and repeatedly. The phosphorylation 
reactions that control the cell cycle are carried out by a specific set of 
protein kinases, while dephosphorylation is performed by a set of protein 
phosphatases. 


The protein kinases at the core of the cell-cycle control system are pre- 
sent in proliferating cells throughout the cell cycle. They are activated, 
however, only at appropriate times in the cycle, after which they are 
quickly inactivated. Thus, the activity of each of these kinases rises and 
falls in a cyclical fashion. Some of these protein kinases, for example, 
become active toward the end of G; phase and are responsible for driving 
the cell into S phase; another kinase becomes active just before M phase 
and drives the cell into mitosis. 


Switching these kinases on and off at the appropriate times is partly 
the responsibility of another set of proteins in the control system—the 
cyclins. Cyclins have no enzymatic activity themselves, but they must 
bind to the cell-cycle kinases before the kinases can become enzymati- 
cally active. The kinases of the cell-cycle control system are therefore 
known as cyclin-dependent protein kinases, or Cdks (Figure 18-4). 
Cyclins are so-named because, unlike the Cdks, their concentrations 
vary in a cyclical fashion during the cell cycle. The cyclical changes in 
cyclin concentrations help drive the cyclic assembly and activation of the 
cyclin-Cdk complexes. Once activated, cyclin-Cdk complexes help trig- 
ger various cell-cycle events, such as entry into S phase or M phase 
(Figure 18-5). We discuss how the Cdks and cyclins were discovered in 
How We Know, pp. 615-616. 


Different Cyclin-Cdk Complexes Trigger Different Steps in 
the Cell Cycle 


There are several types of cyclins and, in most eukaryotes, several types 
of Cdks involved in cell-cycle control. Different cyclin-Cdk complexes 
trigger different steps of the cell cycle. As shown in Figure 18-5, the cyc- 
lin that acts in G2 to trigger entry into M phase is called M cyclin, and the 
active complex it forms with its Cdk is called M-Cdk. Other cyclins, called 
S cyclins and G;/S cyclins, bind to a distinct Cdk protein late in G; to 
form S-Cdk and G;/S-Cdk, respectively; these cyclin-Cdk complexes help 
launch S phase. The rise and fall of S cyclin and M cyclin concentrations 
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HOW WE KNOW 
DISCOVERY OF CYCLINS AND Cdks 


For many years, cell biologists watched the “puppet 
show” of DNA synthesis, mitosis, and cytokinesis but 
had no idea what was behind the curtain, control- 
ling these events. The cell-cycle control system was 
simply a “black box” inside the cell. It was not even 
clear whether there was a separate control system, or 
whether the cell-cycle machinery somehow controlled 
itself. A breakthrough came with the identification of 
the key proteins of the control system and the realiza- 
tion that they are distinct from the components of the 
cell-cycle machinery—the enzymes and other proteins 
that perform the essential processes of DNA replication, 
chromosome segregation, and so on. 


The first components of the cell-cycle control system 
to be discovered were the cyclins and cyclin-dependent 
protein kinases (Cdks) that drive cells into M phase. 
They were found in studies of cell division conducted 
on animal eggs. 


Back to the egg 


The fertilized eggs of many animals are especially suit- 
able for biochemical studies of the cell cycle because 
they are exceptionally large and divide rapidly. An egg 
of the frog Xenopus, for example, is just over 1 mm in 
diameter (Figure 18-6). After fertilization, it divides rap- 
idly to partition the egg into many smaller cells. These 
rapid cell cycles consist mainly of repeated S and M 
phases, with very short or no G; or G2 phases between 
them. There is no new gene transcription: all of the 
mRNAs and most of the proteins required for this early 
stage of embryonic development are already packed 
into the very large egg during its development as an 
oocyte in the ovary of the mother. In these early division 
cycles (cleavage divisions), no cell growth occurs, and all 
the cells of the embryo divide synchronously, growing 
smaller and smaller with each division (Movie 18.2). 
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Because of the synchrony, it is possible to prepare an 
extract from frog eggs that is representative of the cell- 
cycle stage at which the extract is made. The biological 
activity of such an extract can then be tested by inject- 
ing it into a Xenopus oocyte (the immature precursor of 
the unfertilized egg) and observing, microscopically, its 
effects on cell-cycle behavior. The Xenopus oocyte is an 
especially convenient test system for detecting an activ- 
ity that drives cells into M phase, because of its large 
size, and because it has completed DNA replication and 
is suspended at a stage in the meiotic cell cycle (dis- 
cussed in Chapter 19) that is equivalent to the Gz phase 
of a mitotic cell cycle. 


Give us an M 


In such experiments, Kazuo Matsui and colleagues 
found that an extract from an M-phase egg instantly 
drives the oocyte into M phase, whereas cytoplasm 
from a cleaving egg at other phases of the cycle does 
not. When they first made this discovery, they did not 
know the molecules or the mechanism responsible, so 
they referred to the unidentified agent as maturation 
promoting factor, or MPF (Figure 18-7). By testing cyto- 
plasm from different stages of the cell cycle, Matsui and 
colleagues found that MPF activity oscillates dramati- 
cally during the course of each cell cycle: it increased 
rapidly just before the start of mitosis and fell rapidly to 
zero toward the end of mitosis (see Figure 18-5). This 
oscillation made MPF a strong candidate for a compo- 
nent involved in cell-cycle control. 


When MPF was finally purified, it was found to contain a 
protein kinase that was required for its activity. But the 
kinase portion of MPF did not act alone. It had to have a 
specific protein (now known to be M cyclin) bound to it 
in order to function. M cyclin was discovered in a differ- 
ent type of experiment, involving clam eggs. 


Figure 18-6 A mature Xenopus egg 
provides a convenient system for 
studying the cell cycle. (Courtesy of 
Tony Mills.) 
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Figure 18-7 MPF activity was discovered by injecting Xenopus egg cytoplasm 
into Xenopus oocytes. (A) A Xenopus oocyte is injected with cytoplasm taken from 
a Xenopus egg in M phase. The cell extract drives the oocyte into M phase of the 
first meiotic division (a process called maturation), causing the large nucleus to break 
down and a spindle to form. (B) When the cytoplasm is instead taken from a cleaving 
egg in interphase, it does not cause the oocyte to enter M phase. Thus, the extract in 
(A) must contain some activity—a maturation promoting factor (MPF)—that triggers 


entry into M phase. 


Fishing in clams 


M cyclin was initially identified by Tim Hunt as a protein 
whose concentration rose gradually during interphase 
and then fell rapidly to zero as cleaving clam eggs 
went through M phase (see Figure 18-5). The protein 
repeated this performance in each cell cycle. Its role in 
cell-cycle control, however, was initially obscure. The 
breakthrough occurred when cyclin was found to be a 
component of MPF and to be required for MPF activity. 
Thus, MPF, which we now call M-Cdk, is a protein com- 
plex containing two subunits—a regulatory subunit, M 
cyclin, and a catalytic subunit, the mitotic Cdk. After the 
components of M-Cdk were identified, other types of 
cyclins and Cdks were isolated, whose concentrations 
or activities, respectively, rose and fell at other stages in 
the cell cycle. 


All in the family 


While biochemists were identifying the proteins that 
regulate the cell cycles of frog and clam embryos, 
yeast geneticists—led by Lee Hartwell, studying bak- 
er’s yeast (Saccharomyces cerevisiae), and Paul Nurse, 


studying fission yeast (S. pombe)—were taking a genetic 
approach to dissecting the cell-cycle control system. By 
studying mutants that get stuck or misbehave at spe- 
cific points in the cell cycle, these researchers were able 
to identify many genes responsible for cell-cycle con- 
trol. Some of these genes turned out to encode cyclin 
or Cdk proteins, which were unmistakably similar—in 
both amino acid sequence and function—to their coun- 
terparts in frogs and clams. Similar genes were soon 
identified in human cells. 


Many of the cell-cycle control genes have changed so 
little during evolution that the human version of the 
gene will function perfectly well in a yeast cell. For 
example, Nurse and colleagues were the first to show 
that a yeast with a defective copy of the gene encod- 
ing its only Cdk fails to divide, but it divides normally 
if a copy of the appropriate human gene is artificially 
introduced into the defective cell. Surely, even Darwin 
would have been astonished at such clear evidence that 
humans and yeasts are cousins. Despite a billion years 
of divergent evolution, all eukaryotic cells—whether 
yeast, animal, or plant—use essentially the same mol- 
ecules to control the events of their cell cycle. 
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S cyclin M cyclin Figure 18-8 Distinct Cdks associate with 
different cyclins to trigger the different 
events of the cell cycle. For simplicity, 
only two types of cyclin-Cdk complexes are 
shown: one that triggers S phase and one 
that triggers M phase. 
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are shown in Figure 18-8. Another group of cyclins, called G, cyclins, 
act earlier in G; and bind to other Cdk proteins to form G1-Cdks, which 
help drive the cell through G; toward S phase. We see later that the for- 
mation of these G;-Cdks in animal cells usually depends on extracellular 
signal molecules that stimulate cells to divide. The names of the main 
cyclins and their Cdks are listed in Table 18-2. 


Each of these cyclin-Cdk complexes phosphorylates a different set of tar- 
get proteins in the cell. G;/S-Cdks, for example, phosphorylate regulatory 
proteins that activate transcription of genes required for DNA replication. 
By activating different sets of target proteins, each type of complex trig- 
gers a different transition step in the cell cycle. 


Cyclin Concentrations Are Regulated by Transcription 
and by Proteolysis 


As discussed in Chapter 7, the concentration of a given protein in the cell 
is determined by the rate at which the protein is synthesized and the rate 
at which it is degraded. Over the course of the cell cycle, the concentra- 
tion of each type of cyclin rises gradually and then falls abruptly (see 
Figure 18-8). The gradual increase in cyclin concentration stems from 
continued transcription of cyclin genes and synthesis of cyclin proteins, 
whereas the rapid fall in cyclin concentration is precipitated by a full- 
scale targeted destruction of the protein. 


The abrupt degradation of M and S cyclins partway through M phase 
depends on a large enzyme called—for reasons that will become clear 
later—the anaphase-promoting complex or cyclosome (APC/C). This 
complex tags these cyclins with a chain of ubiquitin. As discussed in 
Chapter 7, proteins marked in this way are directed to proteasomes where 
they are rapidly degraded (see Figure 7-43). The ubiquitylation and deg- 
radation of the cyclin returns its Cdk to an inactive state (Figure 18-9). 


TABLE 18-2 THE MAJOR CYCLINS AND CDKS OF VERTEBRATES 


Cyclin-Cdk Complex Cdk Partner 


G1-Cdk cyclin D* Cdk4, Cdk6 
G1/S-Cdk cyclin E Cdk2 
S-Cdk cyclin A Cdk2 
M-Cdk cyclin B Cdk1 


*There are three forms of cyclin D in mammals (cyclins D1, D2, and D3). 
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Figure 18-9 The activity of some Cdks 

is regulated by cyclin degradation. 
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Like cyclin accumulation, cyclin destruction can also help drive the 
transition from one phase of the cell cycle to the next. For example, M 
cyclin degradation—and the resulting inactivation of M-Cdk—leads to the 
molecular events that take the cell out of mitosis. 


The Activity of Cyclin-Cdk Complexes Depends on 
Phosphorylation and Dephosphorylation 


The appearance and disappearance of cyclin proteins play an important 
part in regulating Cdk activity during the cell cycle, but there is more to 
the story: although cyclin concentrations increase gradually, the activ- 
ity of the associated cyclin-Cdk complexes tends to switch on abruptly 
at the appropriate time in the cell cycle (see Figure 18-5). What triggers 
the abrupt activation of these complexes? It turns out that the cyclin- 
Cdk complex contains inhibitory phosphates, and to become active, the 
Cdk must be dephosphorylated by a specific protein phosphatase (Figure 
18-10). Thus protein kinases and phosphatases act together to regulate 
the activity of specific cyclin-Cdk complexes and help control progression 
through the cell cycle. 


Cdk Activity Can Be Blocked by Cdk Inhibitor Proteins 


In addition to phosphorylation and dephosphorylation, the activity of 
Cdks can also be modulated by the binding of Cdk inhibitor proteins. 
The cell-cycle control system uses these inhibitors to block the assembly 
or activity of certain cyclin-Cdk complexes. Some Cdk inhibitor proteins, 
for example, help maintain Cdks in an inactive state during the G; phase 
of the cycle, thus delaying progression into S phase (Figure 18-11). 
Pausing at this transition point in G; gives the cell more time to grow, or 
allows it to wait until extracellular conditions are favorable for division. 


The Cell-Cycle Control System Can Pause the Cycle in 
Various Ways 


As mentioned earlier, the cell-cycle control system can transiently delay 
progress through the cycle at various transition points to ensure that the 
major events of the cycle occur only when the cell is fully prepared (see 
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Figure 18-3). At these transitions, the control system monitors the cell’s 
internal state and the conditions in its environment, before allowing the 
cell to continue through the cycle. For example, it allows entry into S 
phase only if environmental conditions are appropriate; it triggers mito- 
sis only after the DNA has been completely replicated; and it initiates 
chromosome segregation only after the duplicated chromosomes are 
correctly aligned on the mitotic spindle. 


To accomplish these feats, the control system uses a combination of the 
mechanisms we have described. At the G,-to-S transition, it uses Cdk 
inhibitors to keep cells from entering S phase and replicating their DNA 
(see Figure 18-11). At the G2-to-M transition, it suppresses the activation 
of M-Cdk by inhibiting the phosphatase required to activate the Cdk (see 
Figure 18-10). And it can delay the exit from mitosis by inhibiting the 
activation of APC/C, thus preventing the degradation of M cyclin (see 
Figure 18-9). 


These mechanisms, summarized in Figure 18-12, allow the cell to make 
“decisions” about whether to progress through the cell cycle or to arrest 
in the current phase and await more favorable conditions. In the next sec- 
tion, we take a closer look at how the cell-cycle control system decides 
whether a cell in G; should commit to divide. 


chromosomes not 
properly attached 
to spindle 


| | 


DNA replication DNA 
not complete damage 


CONTROLLER 


t 


environment not 
favorable 


The Cell-Cycle Control System 619 


Figure 18—11 The activity of a Cdk can be 
blocked by the binding of a Cdk inhibitor. 
In this instance, the inhibitor protein (called 
p27) binds to an activated cyclin-Cdk 
complex. Its attachment prevents the 

Cdk from phosphorylating target proteins 
required for progress through G4 into S 
phase. 


Figure 18-12 The cell-cycle control 
system uses various mechanisms to pause 
the cycle at specific transition points. 
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Proceed to S phase? 
Pause? 

Withdraw to Gy? 
Withdraw permanently 
and terminally differentiate? 


differentiation +— 
Figure 18-13 The transition from G4 to 
S phase offers the cell a crossroad. The 
cell can commit to completing another cell 
cycle, pause temporarily until conditions 
are right, or withdraw from the cell cycle 
altogether—either temporarily in Go, or 


permanently in the case of terminally 
differentiated cells. 


QUESTION 18-3 


Why do you suppose cells have 
evolved a special Go phase to exit 
from the cell cycle, rather than just 
stopping in G1 and not moving on to 
S phase? 


The Cell-Division Cycle 


Gı PHASE 


In addition to being a bustling period of metabolic activity, cell growth, 
and repair, G; serves as an important time of decision-making for the cell. 
Based on intracellular signals that provide information about the size of 
the cell and extracellular signals reflecting conditions in the environment, 
the cell-cycle control machinery can either hold the cell transiently in G, 
(or in a more prolonged nonproliferative state, Go), or allow it to prepare 
for entry into the S phase of another cell cycle. Once past this critical 
G,-to-S transition, a cell usually continues all the way through the rest 
of the cell cycle. In yeasts, the Gj-to-S transition is therefore sometimes 
called Start, because passing it represents a commitment to complete a 
full cell cycle (Figure 18-13). 


In this section, we consider how the cell-cycle control system decides 
whether to proceed to S phase and commit to another cell cycle—and 
what happens once the decision is made. The molecular mechanisms 
involved are especially important, as defects in them can lead to unre- 
strained cell proliferation and cancer. 


Cdks Are Stably Inactivated in G; 


During early M phase, when mitosis begins, the cell is awash with active 
cyclin-Cdk complexes. Those S-Cdks and M-Cdks must be disabled 
by the end of M phase to allow the cell to complete division and to pre- 
vent it from initiating another round of division without spending any 
time in G}. 


To usher a cell from the upheaval of M phase to the relative tranquility 
of Gj, the cell-cycle control machinery must inactivate its inventory of 
S-Cdk and M-Cdk. It does so in several ways: by eliminating all of the 
existing cyclins, by blocking the synthesis of new ones, and by deploying 
Cdk inhibitor proteins to muffle the activity of any remaining cyclin-Cdk 
complexes. The use of multiple mechanisms makes this system of sup- 
pression robust, ensuring that essentially all Cdk activity is shut down. 
This wholesale inactivation resets the cell-cycle control system and gen- 
erates a stable G; phase, during which the cell can grow and monitor its 
environment before committing to a new round of division. 


Mitogens Promote the Production of the Cyclins That 
Stimulate Cell Division 


As a general rule, mammalian cells will multiply only if they are stimu- 
lated to do so by extracellular signals, called mitogens, produced by other 
cells. If deprived of such signals, the cell cycle arrests in Gj; if the cell is 
deprived of mitogens for long enough, it will withdraw from the cell cycle 
and enter a nonproliferating state, in which the cell can remain for days 
or weeks, months, or even for the lifetime of the organism, as we discuss 
shortly. 


Escape from cell-cycle arrest—or from certain nonproliferating states— 
requires the accumulation of cyclins. Mitogens act by switching on cell 
signaling pathways that stimulate the synthesis of G, cyclins, G;/S cyc- 
lins, and other proteins involved in DNA synthesis and chromosome 
duplication. The buildup of these cyclins triggers a wave of G,/S-Cdk 
activity, which ultimately relieves the negative controls that otherwise 
block progression from G to S phase. 


One crucial negative control is provided by the Retinoblastoma (Rb) pro- 
tein. Rb was initially identified from studies of a rare childhood eye tumor 
called retinoblastoma, in which the Rb protein is missing or defective. 
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mitogen Figure 18—14 One way in which mitogens 
stimulate cell proliferation is by inhibiting 
the Rb protein. In the absence of mitogens, 
dephosphorylated Rb protein holds specific 
transcription regulators in an inactive state. 
Mitogens binding to cell-surface receptors 
activate intracellular signaling pathways 
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Rb is abundant in the nuclei of all vertebrate cells, where it binds to par- 
ticular transcription regulators and prevents them from turning on the 
genes required for cell proliferation. Mitogens release the Rb brake by 
triggering the activation of G;-Cdks and G)/S-Cdks. These complexes 
phosphorylate the Rb protein, altering its conformation so that it releases 
its bound transcription regulators, which are then free to activate the 
genes required for entry into S phase (Figure 18-14). 


DNA Damage Can Temporarily Halt Progression 
Through G; 


The cell-cycle control system uses several distinct mechanisms to halt 
progress through the cell cycle if DNA is damaged, and it can do so at 
various transition points. The mechanism that operates at the Gj-to-S 
transition, which prevents the cell from replicating damaged DNA, is 
especially well understood. DNA damage in G; causes an increase in 
both the concentration and activity of a protein called p53, which is a 
transcription regulator that activates the gene encoding a Cdk inhibitor 
protein called p21. The p21 protein binds to G;/S-Cdk and S-Cdk, pre- 
venting them from driving the cell into S phase (Figure 18-15). The arrest 
of the cell cycle in G; gives the cell time to repair the damaged DNA 
before replicating it. If the DNA damage is too severe to be repaired, p53 
can induce the cell to kill itself through apoptosis, a form of programmed 
cell death we discuss later. If p53 is missing or defective, the unrestrained 
replication of damaged DNA leads to a high rate of mutation and the 
generation of cells that tend to become cancerous. In fact, mutations in 
the p53 gene are found in about half of all human cancers (Movie 18.3). 


Cells Can Delay Division for Prolonged Periods by 
Entering Specialized Nondividing States 


As mentioned earlier, cells can delay progress through the cell cycle 
at specific transition points, to wait for suitable conditions or to repair 
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damaged DNA. They can also withdraw from the cell cycle for prolonged 
periods—either temporarily or permanently. 


The most radical decision that the cell-cycle control system can make 
is to withdraw the cell from the cell cycle permanently. This decision 
has a special importance in multicellular organisms. Many cells in the 
human body permanently stop dividing when they differentiate. In such 
terminally differentiated cells, such as nerve or muscle cells, the cell-cycle 


control system is dismantled completely and genes encoding the relevant 
QUESTION 18-4 cyclins and Cdks are irreversibly shut down. 

In the absence of appropriate signals, other cell types withdraw from 
What might be the consequences the cell cycle only temporarily, entering an arrested state called Go. They 
if a cell replicated damaged DNA retain the ability to reassemble the cell-cycle control system quickly and 
perorare paningi to divide again. Most liver cells, for example, are in Go, but they can be 


stimulated to proliferate if the liver is damaged. 


Much of the diversity in cell-division rates in the adult body lies in the vari- 
ation in the time that cells spend in Go or in G;. Some cell types, including 
liver cells, normally divide only once every year or two, whereas certain 
epithelial cells in the gut divide more than twice a day to renew the lining 
of the gut continually. Many of our cells fall somewhere in between: they 
can divide if the need arises but normally do so infrequently. 


S PHASE 


Before a cell divides, it must replicate its DNA. As we discuss in Chapter 6, 
this replication must occur with extreme accuracy to minimize the risk of 
mutations in the next cell generation. Of equal importance, every nucleo- 
tide in the genome must be copied once—and only once—to prevent the 
damaging effects of gene amplification. In this section, we consider the 
elegant molecular mechanisms by which the cell-cycle control system 
initiates DNA replication and, at the same time, prevents replication from 
happening more than once per cell cycle. 


S-Cdk Initiates DNA Replication and Blocks Re-Replication 


Like any monumental task, configuring chromosomes for replica- 
tion requires a certain amount of preparation. For eukaryotic cells, this 
preparation begins early in Gj, when DNA is made replication-ready by 
the recruitment of proteins to the sites along each chromosome where 
replication will begin. These nucleotide sequences, called origins of repli- 
cation, serve as landing pads for the proteins and protein complexes that 
control and carry out DNA synthesis, as discussed in Chapter 6. 


One of these protein complexes, called the origin recognition complex 
(ORC), remains perched on the replication origins throughout the cell 
cycle. To prepare the DNA for replication, the ORC recruits a protein 
called Cdc6, whose concentration rises early in G;. Together, these pro- 
teins load the DNA helicases that will ultimately open up the double helix 
at the origin of replication. Once this prereplicative complex is in place, the 
replication origin is loaded and ready to “fire.” 


The signal to commence replication comes from S-Cdk, the cyclin-Cdk 
complex that triggers S phase. S-Cdk is assembled and activated at the 
end of G;. During S phase, S-Cdk activates the DNA helicases in the pre- 
replicative complex and promotes the assembly of the rest of the proteins 
that form the replication fork (see Figure 6-20). In doing so, S-Cdk essen- 
tially “pulls the trigger” that initiates DNA replication (Figure 18-16). 


In addition to triggering the initiation of DNA synthesis at a replication 
origin, S-Cdk also helps prevent re-replication. It does so by phosphor- 
ylating both Cdcé and the ORC. Phosphorylation inactivates these proteins 
and helps prevent the reassembly of the prereplicative complex. These 
safeguards help ensure that DNA replication cannot be reinitiated later in 
the same cell cycle. When Cdks are inactivated in the next G; phase, the 
ORC and Cdcé are reactivated, thereby allowing origins to be prepared 
for the following S phase. 


Incomplete Replication Can Arrest the Cell Cycle in G2 


Earlier, we described how DNA damage can signal the cell-cycle control 
system to delay progress through the G-to-S transition, preventing the 
cell from replicating damaged DNA. But what if errors occur during DNA 
replication—or if replication is delayed? How does the cell keep from 
dividing with DNA that is incorrectly or incompletely replicated? 


To address these issues, the cell-cycle control system uses a mechanism 
that can delay entry into M phase. As we saw in Figure 18-10, the activity 
of M-Cdk is inhibited by phosphorylation at particular sites. For the cell 
to progress into mitosis, these inhibitory phosphates must be removed by 
an activating protein phosphatase called Cdc25. If DNA replication stalls, 
the appearance of single-stranded DNA at the replication fork triggers 
a DNA damage response. Part of this response includes the inhibition 
of the phosphatase Cdc25, which prevents the removal of the inhibitory 
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phosphates from M-Cdk. As a result, M-Cdk remains inactive and M 
phase is delayed until DNA replication is complete and any DNA damage 
is repaired. 


Once a cell has successfully replicated its DNA in S phase, and pro- 
gressed through Gg, it is ready to enter M phase. During this relatively 
brief period, the cell will accomplish a remarkable reconfiguration, divid- 
ing its nucleus (mitosis) and then its cytoplasm (cytokinesis; see Figure 
18-2). In the next three sections, we describe the events that occur dur- 
ing M phase. We first present a brief overview of M phase as a whole and 
then discuss, in sequence, the mechanics of mitosis and of cytokinesis, 
with a focus on animal cells. 


M PHASE 


Although M phase (which includes mitosis plus cytokinesis) takes place 
over a relatively short amount of time—about one hour in a mammalian 
cell—it is by far the most dramatic phase of the cell cycle. During this brief 
period, the cell reorganizes virtually all of its components and distributes 
them equally into the two daughter cells. The earlier phases of the cell 
cycle, in effect, set the stage for the drama of M phase. 


The central problem for a cell in M phase is to accurately segregate the 
chromosomes that were duplicated in the preceding S phase, so that 
each new daughter cell receives an identical copy of the genome. With 
minor variations, all eukaryotes solve this problem in a similar way: they 
assemble two specialized cytoskeletal machines—one that pulls the 
duplicated chromosomes apart (during mitosis) and another that divides 
the cytoplasm into two halves (during cytokinesis). We begin our discus- 
sion of M phase with an overview of how the cell sets the processes of 
M phase in motion. 


Figure 18-17 Activated M-Cdk indirectly activates more 

M-Cdk, creating a positive feedback loop. Once activated, 

M-Cdk phosphorylates, and thereby activates, more Cdk-activating 
phosphatase (Cdc25). This phosphatase can now activate more M-Cdk 
by removing the inhibitory phosphate groups from the Cdk subunit. 


M-Cdk Drives Entry into Mitosis 


One of the most remarkable features of the cell-cycle control system is 
that a single protein complex, M-Cdk, brings about all the diverse and 
intricate rearrangements that occur in the early stages of mitosis. Among 
its many duties, M-Cdk helps prepare the duplicated chromosomes for 
segregation and induces the assembly of the mitotic spindle—the machin- 
ery that will pull the duplicated chromosomes apart. 


M-Cdk complexes accumulate throughout G2. But this stockpile is not 
switched on until the end of G2, when the activating phosphatase Cdc25 
removes the inhibitory phosphates holding M-Cdk activity in check. This 
act of activation is self-reinforcing: once activated, each M-Cdk complex 
can indirectly turn on additional M-Cdk complexes—by phosphorylating 
and activating more Cdc25 (Figure 18-17). Activated M-Cdk also shuts 
down the inhibitory kinase Weel (see Figure 18-10), further promoting 
the production of activated M-Cdk. The overall consequence is that, once 
M-Cdk activation begins, it ignites an explosive increase in M-Cdk activ- 
ity that drives the cell abruptly—and irreversibly—from Gz into M phase. 


The same M-Cdk complexes that drive entry into mitosis also help set the 
stage for its exit. Activated M-Cdk turns on APC/C, which—after a period 
of delay—directs the destruction of M cyclin and, ultimately, the inactiva- 
tion of M-Cdk. 


Cohesins and Condensins Help Configure Duplicated 
Chromosomes for Separation 


To ensure that duplicated chromosomes will be properly separated dur- 
ing mitosis, two related protein complexes help cells manage and keep 
track of the replicated DNA. The first complexes come into play during 
S phase. When a chromosome is duplicated, the two copies remain tightly 
bound together. These identical copies—called sister chromatids—each 
contain a single, double-stranded molecule of DNA, along with its associ- 
ated proteins. The sisters are held together by protein complexes called 
cohesins, which assemble along the length of each chromatid as the 
DNA is replicated. This cohesion between sister chromatids is crucial 
for proper chromosome segregation, and it is broken completely only in 
late mitosis to allow the sisters to be pulled apart by the mitotic spindle. 
Defects in sister-chromatid cohesion lead to major errors in chromosome 
segregation. In humans, such mis-segregation can lead to abnormal 
numbers of chromosomes, resulting in genetic imbalances that are usu- 
ally deleterious or even lethal. 


When the cell enters M phase, the duplicated chromosomes condense, 
becoming visible under the microscope. Protein complexes called 
condensins help carry out this chromosome condensation, which 
reduces mitotic chromosomes to compact bodies that can be more easily 
segregated within the crowded confines of the dividing cell. The assembly 
of condensin complexes onto the DNA is triggered by the phosphoryla- 
tion of condensins by M-Cdk. 


Cohesins and condensins are structurally related, and both are thought 
to form ring structures around chromosomal DNA. However, whereas 
cohesins encircle the two sister chromatids, tying them together (Figure 
18-18A), condensins assemble along each individual sister chromatid, 


M Phase 625 
inactive 
phosphatase 


POSITIVE 
FEEDBACK 


active 
Cdc25 
phosphatase 


inhibitory 
phosphates 


+. 
g è 


inactive active 
M-Cdk M-Cdk 


QUESTION 18-5 


A small amount of cytoplasm 
isolated from a mitotic cell is 
injected into an unfertilized frog 
oocyte, causing the oocyte to 

enter M phase (see Figure 18—7A). 
A sample of the injected oocyte’s 
cytoplasm is then taken and injected 
into a second oocyte, causing this 
cell also to enter M phase. The 
process is repeated many times 
until, essentially, none of the original 
protein sample remains, and yet, 
cytoplasm taken from the last in the 
series of injected oocytes is still able 
to trigger entry into M phase with 
undiminished efficiency. Explain this 
remarkable observation. 
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Figure 18—18 Cohesins and condensins help to configure 
duplicated chromosomes for segregation. (A) Cohesins tie together 
the two adjacent sister chromatids in each duplicated chromosome. 
They are thought to form large protein rings that surround the sister 
chromatids, preventing them from coming apart, until the rings are 
broken late in mitosis. (B) Condensins help coil each sister chromatid 
(in other words, each DNA double helix) into a smaller, more compact 
structure that can be more easily segregated during mitosis. These 
cartoons illustrate one way that condensins might package chromatids; 
the exact mechanism is not known. (C) A scanning electron micrograph 
of a condensed, duplicated mitotic chromosome, consisting of two 
sister chromatids joined along their length. The constricted region 
(arrow) is the centromere, where each chromatid will attach to the 
mitotic spindle, which pulls the sister chromatids apart toward the 
end of mitosis. (D) An electron micrograph of a duplicated mitotic 
chromosome in which condensin is labelled with antibodies attached 
to tiny gold particles (dark dots), showing that condensins are found 
mainly in the central core of the chromosome. This centralized location 
of condensins is also represented in the cartoon model shown in (B). 

p a (C, courtesy of Terry D. Allen; D, adapted from N. Kireeva et al., J. Cell 
(D) Biol. 166:775-785, 2004. With permission from Rockefeller University Press.) 


helping each of these double helices to coil up into a more compact form 
(Figure 18-18B-D). Together, these proteins help configure replicated 
chromosomes for mitosis. 


Different Cytoskeletal Assemblies Carry Out Mitosis and 
Cytokinesis 


After the duplicated chromosomes have condensed, a pair of com- 
plex cytoskeletal machines assemble in sequence to carry out the two 
mechanical processes that occur in M phase. The mitotic spindle carries 
out nuclear division (mitosis), and, in animal cells and many unicellular 
eukaryotes, the contractile ring carries out cytoplasmic division (cytokine- 
sis) (Figure 18-19). Both structures disassemble rapidly after they have 
performed their tasks. 


The mitotic spindle is composed of microtubules and the various proteins 
that interact with them, including microtubule-associated motor proteins 
(discussed in Chapter 17). In all eukaryotic cells, the mitotic spindle is 
Figure 18-19 Two transient cytoskeletal 
structures mediate M phase in animal chromosomes 
cells. The mitotic spindle assembles first 
to separate the duplicated chromosomes. 
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Then, the contractile ring assembles to PROGRESSION 
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myosin. Plant cells use a very different 
mechanism to divide the cytoplasm microtubules of the actin and myosin filaments of the 


as we discuss later. mitotic spindle “contractile ring 


responsible for separating the duplicated chromosomes and allocating 
one copy of each chromosome to each daughter cell. 


The contractile ring consists mainly of actin and myosin filaments 
arranged in a ring around the equator of the cell (see Chapter 17). It starts 
to assemble just beneath the plasma membrane toward the end of mito- 
sis. As the ring contracts, it pulls the membrane inward, thereby dividing 
the cell in two (see Figure 18-19). We discuss later how plant cells, which 
have a cell wall to contend with, divide their cytoplasm by a very different 
mechanism. 


M Phase Occurs in Stages 


Although M phase proceeds as a continuous sequence of events, it is 
traditionally divided into a series of stages. The first five stages of M 
phase—prophase, prometaphase, metaphase, anaphase, and telo- 
phase—constitute mitosis, which was originally defined as the period 
in which the chromosomes are visible in the microscope (because they 
have become condensed). Cytokinesis, which constitutes the final stage 
of M phase, begins before mitosis ends. The stages of M phase are sum- 
marized in Panel 18-1 (pp. 628-629). Together, they form a dynamic 
sequence in which several independent cycles—involving the chromo- 
somes, cytoskeleton, and centrosomes—are coordinated to produce two 
genetically identical daughter cells (Movie 18.4 and Movie 18.5). 


MITOSIS 


Before nuclear division, or mitosis, begins, each chromosome has been 
duplicated and consists of two identical sister chromatids, held together 
along their length by cohesin proteins (see Figure 18—18A). During mitosis, 
the cohesin proteins are removed, the sister chromatids split apart, and 
the chromosomes are pulled to opposite poles of the cell by the mitotic 
spindle (Figure 18-20). In this section, we examine how the mitotic spin- 
dle assembles and functions. We discuss how the dynamic instability of 
microtubules and the activity of microtubule-associated motor proteins 
contribute to both the assembly of the spindle and its ability to segre- 
gate the duplicated chromosomes. We then consider the mechanism that 
operates during mitosis to ensure the synchronous separation of these 
chromosomes. Finally, we discuss how the daughter nuclei form. 


Centrosomes Duplicate to Help Form the Two Poles of 
the Mitotic Spindle 


Before M phase begins, two critical events must be completed: DNA must 
be fully replicated, and, in animal cells, the centrosome must be dupli- 
cated. The centrosome is the principal microtubule-organizing center in 
animal cells (see Figure 17-13). Duplication is necessary for the centro- 
some to be able to help form the two poles of the mitotic spindle and so 
that each daughter cell will receive its own centrosome. 
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Figure 18-20 Sister chromatids separate 
at the beginning of anaphase. The mitotic 
spindle then pulls the separated sisters to 
opposite poles of the cell. 
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centrosome is duplicated. 


In the light micrographs of dividing animal cells shown in 
this panel, chromosomes are stained orange and 
microtubules are green. 

(Courtesy of Julie Canman and Ted Salmon.) 


The division of a cell into two daughters occurs in the 

M phase of the cell cycle. M phase consists of nuclear division, 
or mitosis, and cytoplasmic division, or cytokinesis. In this 
figure, M phase has been greatly expanded for clarity. Mitosis is 
itself divided into five stages, and these, together with 
cytokinesis, are described in this panel. 
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Outside the nucleus, the 
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between the two 
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simplicity, only three 
chromosomes are drawn. 
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At anaphase, the sister 
chromatids synchronously 
separate and are pulled 
slowly toward the spindle 
pole to which they are 
attached. The kinetochore 
microtubules get shorter, 
and the spindle poles 

also move apart, both 
contributing to chromosome 
segregation. 


During telophase, the two 
sets of chromosomes arrive 
at the poles of the spindle. 
A new nuclear envelope 
reassembles around each 
set, completing the 
formation of two nuclei 
and marking the end of 
mitosis. The division of the 
cytoplasm begins with the 
assembly of the contractile 
ring. 


During cytokinesis of an 
animal cell, the cytoplasm 
is divided in two by a 
contractile ring of actin 
and myosin filaments, 
which pinches the cell into 
two daughters, each with 
one nucleus. 


629 


630 CHAPTER 18 The Cell-Division Cycle 


centrioles 


centrosome 


nucleus 


| duplicated 
centrosome 


aster 


forming 
mitotic | 


duplicated nuclear 

chromosome envelope 
metaphase spindle 
spindle pole 


Figure 18-21 The centrosome in an interphase cell duplicates to 
form the two poles of a mitotic spindle. Most animal cells contain 

a single centrosome, which consists of a pair of centrioles (gray) 
embedded in a matrix of proteins (light green). The volume of the 
centrosome matrix is exaggerated in this diagram for clarity. Although 
the centrioles are made of a cylindrical array of short microtubules, 
they do not participate in the nucleation of microtubules from the 
centrosome (see Figure 17-13). Centrosome duplication begins at 
the start of S phase and is complete by the end of Gz. Initially, the two 
centrosomes remain together, but, in early M phase, they separate, 
and each nucleates its own aster of microtubules. The centrosomes 
then move apart, and the microtubules that interact between the 

two asters elongate preferentially to form a bipolar mitotic spindle, 
with an aster at each pole. When the nuclear envelope breaks down, 
the spindle microtubules are able to interact with the duplicated 
chromosomes. 


Centrosome duplication begins at the same time as DNA replication and 
the process is triggered by the same Cdks—Gj;/S-Cdk and S-Cdk—that 
initiate DNA replication. Initially, when the centrosome duplicates, both 
copies remain together as a single complex on one side of the nucleus. 
As mitosis begins, however, the two centrosomes separate, and each 
nucleates a radial array of microtubules called an aster. The two asters 
move to opposite sides of the nucleus to form the two poles of the mitotic 
spindle (Figure 18-21). The process of centrosome duplication and sepa- 
ration is known as the centrosome cycle. 


The Mitotic Spindle Starts to Assemble in Prophase 


The mitotic spindle begins to form in prophase. The assembly of this 
highly dynamic structure depends on the remarkable properties of 
microtubules. As discussed in Chapter 17, microtubules continuously 
polymerize and depolymerize by the addition and loss of their tubulin 
subunits, and individual filaments alternate between growing and shrink- 
ing—a process called dynamic instability (see Figure 17-14). At the start of 
mitosis, dynamic stability rises—in part because M-Cdk phosphorylates 
microtubule-associated proteins that influence microtubule stability. As 
a result, during prophase, rapidly growing and shrinking microtubules 
extend in all directions from the two centrosomes, exploring the interior 
of the cell. 


Some of the microtubules growing from one centrosome interact with 
the microtubules from the other centrosome (see Figure 18-21). This 
interaction stabilizes the microtubules, preventing them from depolym- 
erizing, and it joins the two sets of microtubules together to form the 
basic framework of the mitotic spindle, with its characteristic bipolar 
shape (Movie 18.6). The two centrosomes that give rise to these micro- 
tubules are now called spindle poles, and the interacting microtubules 
are called interpolar microtubules (Figure 18-22). The assembly of the 
spindle is driven, in part, by motor proteins associated with the interpo- 
lar microtubules that help to cross-link the two sets of microtubules and 
push the two centrosomes apart. 


Chromosomes Attach to the Mitotic Spindle at 
Prometaphase 


Prometaphase starts abruptly with the disassembly of the nuclear enve- 
lope, which breaks up into small membrane vesicles. This process is 
triggered by the phosphorylation and consequent disassembly of nuclear 
pore proteins and the intermediate filament proteins of the nuclear lamina, 


Figure 18-22 A bipolar mitotic spindle is formed by the selective 
stabilization of interacting microtubules. New microtubules grow 
out in random directions from the two centrosomes. The two ends 

of a microtubule (by convention, called the plus and the minus ends) 
have different properties, and it is the minus end that is anchored 

in the centrosome (discussed in Chapter 17). The free plus ends 

are dynamically unstable and switch suddenly from uniform growth 
(outward-pointing red arrows) to rapid shrinkage (inward-pointing blue 
arrows). When two microtubules from opposite centrosomes interact 
in an overlap zone, motor proteins and other microtubule-associated 
proteins cross-link the microtubules together (black dots) in a way 
that stabilizes the plus ends by decreasing the probability of their 
depolymerization. 


the network of fibrous proteins that underlies and stabilizes the nuclear 
envelope (see Figure 17-7). The spindle microtubules, which have been 
lying in wait outside the nucleus, now gain access to the duplicated chro- 
mosomes and capture each and everyone (see Panel 18-1, pp. 628-629). 


Spindle microtubules attach to the chromosomes at their kinetochores, 
protein complexes that assemble on the centromere of each condensed 
chromosome during late prophase (Figure 18-23). Kinetochores recog- 
nize the special DNA sequence that forms a chromosome'’s centromere: if 
this sequence is altered, kinetochores fail to assemble and, consequently, 
the chromosomes fail to segregate properly during mitosis. 


Once the nuclear envelope has broken down, a randomly probing micro- 
tubule encountering a kinetochore will bind to it, thereby capturing that 
chromosome and linking it to a spindle pole (see Panel 18-1, pp. 628- 
629). Of course, each duplicated chromosome has two kinetochores—one 
on each sister chromatid. Because these sister kinetochores face in oppo- 
site directions, they tend to attach to microtubules from opposite poles of 
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Figure 18-23 Kinetochores attach chromosomes to the mitotic spindle. (A) A fluorescence micrograph of a duplicated mitotic 
chromosome. The kinetochores are stained red with fluorescent antibodies that recognize kinetochore proteins. These antibodies 


come from patients with scleroderma (a disease that causes progressive overproduction of connective tissue in skin and other organs), 
who, for unknown reasons, produce antibodies against their own kinetochore proteins. (B) Schematic drawing of a mitotic chromosome 


showing its two sister chromatids attached to kinetochore microtubules, which bind to the kinetochore at their plus ends. Each 


kinetochore forms a plaque on the surface of the centromere. (C) Each microtubule is attached to the kinetochore via interactions with 
multiple copies of an elongated connecting protein complex (blue). These complexes bind to the sides of the microtubule near its plus 
end, allowing the microtubule to grow or shrink while remaining attached to the kinetochore. (A, from R.P. Zinkowski et al., J. Cell Biol. 


113:1091-1110, 1991. With permission from The Rockefeller University Press.) 
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Figure 18-24 Three classes of microtubules make up the mitotic spindle. (A) Schematic drawing of a spindle 
with chromosomes attached, showing the three types of spindle microtubules: astral microtubules, kinetochore 
microtubules, and interpolar microtubules. In reality, the chromosomes are much larger than shown, and usually 
multiple microtubules are attached to each kinetochore. (B) Fluorescence micrograph of duplicated chromosomes 
aligned at the center of the mitotic spindle. In this image, kinetochores are red dots, microtubules are green, and 
chromosomes are blue. (B, from A. Desai, Curr. Biol. 10:R508, 2000. With permission from Elsevier.) 
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Figure 18-25 Motor proteins and 
chromosomes can direct the assembly 

of a functional bipolar spindle in the 
absence of centrosomes. In these 
fluorescence micrographs of embryos of 
the insect Sciara, the microtubules are 
stained green and the chromosomes red. 
The top micrograph shows a normal spindle 
formed by centrosomes in a fertilized 
embryo. The bottom micrograph shows a 
spindle formed without centrosomes in an 
embryo that initiated development without 
fertilization and thus lacks the centrosome 
normally provided by the sperm when it 
fertilizes the egg. Note that the spindle 
with centrosomes has an aster at each 
pole, whereas the spindle formed without 
centrosomes does not. As shown, both 
types of spindles are able to segregate 
chromosomes. (From B. de Saint Phalle 
and W. Sullivan, J. Cell Biol. 141:1383-1391, 
1998. With permission from The Rockefeller 
University Press.) 


the spindle; thus, each duplicated chromosome becomes linked to both 
spindle poles. The attachment to opposite poles, called bi-orientation, 
generates tension on the kinetochores, which are being pulled in oppo- 
site directions. This tension signals to the sister kinetochores that they 
are attached correctly and are ready to be separated (Movie 18.7). The 
cell-cycle control system monitors this tension to ensure correct chro- 
mosome attachment (see Figure 18-3), a safeguard we discuss in detail 
shortly. 


The number of microtubules attached to each kinetochore varies among 
species: each human kinetochore binds 20-40 microtubules, for example, 
whereas a yeast kinetochore binds just one. The three classes of microtu- 
bules that form the mitotic spindle are highlighted in Figure 18-24. 


Chromosomes Assist in the Assembly of the 
Mitotic Spindle 


Chromosomes are more than passive passengers in the process of spin- 
dle assembly: they themselves can stabilize and organize microtubules 
into functional mitotic spindles. In cells without centrosomes—including 
some animal cell types and all plant cells—the chromosomes nucleate 
microtubule assembly, and motor proteins then move and arrange the 
microtubules and chromosomes into a bipolar spindle. Even in animal 
cells that normally have centrosomes, a bipolar spindle can still be formed 
in this way if the centrosomes are removed (Figure 18-25). In cells with 
centrosomes, the chromosomes, motor proteins, and centrosomes work 
together to form the mitotic spindle. 


Chromosomes Line Up at the Spindle Equator 
at Metaphase 


During prometaphase, the duplicated chromosomes, now attached to 
the mitotic spindle, begin to move about, as if jerked first this way and 
then that. Eventually, they align at the equator of the spindle, halfway 
between the two spindle poles, thereby forming the metaphase plate. 
This event defines the beginning of metaphase (see Figure 18-24B and 


Figure 18-26). Although the forces that act to bring the chromosomes 
to the equator are not completely understood, both the continual growth 
and shrinkage of the microtubules and the action of microtubule motor 
proteins are required. A continuous balanced addition and loss of tubulin 
subunits is also required to maintain the metaphase spindle: when tubu- 
lin addition to the ends of microtubules is blocked by the drug colchicine, 
tubulin loss continues until the metaphase spindle disappears. 


The chromosomes gathered at the equator of the metaphase spindle 
oscillate back and forth, continually adjusting their positions, indicat- 
ing that the tug-of-war between the microtubules attached to opposite 
poles of the spindle continues to operate after the chromosomes are all 
aligned. If the kinetochore attachments on one side of a duplicated chro- 
mosome are artificially severed with a laser beam during metaphase, 
the entire chromosome immediately moves toward the pole to which it 
remains attached. Similarly, if the attachment between sister chromatids 
is cut, the two chromosomes separate and move toward opposite poles. 
These experiments show that the duplicated chromosomes are not sim- 
ply deposited at the metaphase plate. They are suspended there under 
tension. In anaphase, that tension will pull the sister chromatids apart. 


Proteolysis Triggers Sister-Chromatid Separation at 
Anaphase 


Anaphase begins abruptly with the breakage of the cohesin linkages 
that hold together the sister chromatids in a duplicated chromosome 
(see Figure 18-18A). This release allows each chromosome to be pulled 
toward the spindle pole to which it is attached (Figure 18-27). The move- 
ment segregates the two identical sets of chromosomes to opposite ends 
of the spindle (see Panel 18-1, pp. 628-629). 


The cohesin linkage is destroyed by a protease called separase. Before 
anaphase begins, this protease is held in an inactive state by an inhibitory 
protein called securin. At the beginning of anaphase, securin is targeted 
for destruction by APC/C—the same protein complex, discussed earlier, 
that marks M cyclin for degradation. Once securin has been removed, 
separase is then free to sever the cohesin linkages (Figure 18-28). 
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Figure 18-26 During metaphase, 
duplicated chromosomes gather halfway 
between the two spindle poles. This 
fluorescence micrograph shows multiple 
mitotic spindles at metaphase in a fruit fly 
(Drosophila) embryo. The microtubules are 
stained green, and the chromosomes are 
stained blue. At this stage of Drosophila 
development, there are multiple nuclei in 
one large cytoplasmic compartment, and 
all of the nuclei divide synchronously, which 
is why all of the nuclei shown here are at 
the same metaphase stage of the cell cycle 
(Movie 18.8). Metaphase spindles are 
usually pictured in two dimensions, as they 
are here; when viewed in three dimensions, 
however, the chromosomes are seen to be 
gathered at a platelike region at the equator 
of the spindle—the so-called metaphase 
plate. (Courtesy of William Sullivan.) 


Figure 18-27 Sister chromatids separate at anaphase. In the transition from (A) metaphase to (B) anaphase, the sister chromatids of 
duplicated chromosomes (stained blue) suddenly separate, allowing the chromosomes to move toward opposite poles, as seen in these 
plant cells stained with gold-labeled antibodies to label the microtubules (red). Plant cells generally do not have centrosomes and 
therefore have less sharply defined spindle poles than do animal cells (see also Figure 18-35); nonetheless, spindle poles are present 
here at the top and bottom of each micrograph, although they cannot be seen. (Courtesy of Andrew Bajer.) 
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Figure 18—28 APC/C triggers the 
separation of sister chromatids by 
promoting the destruction of cohesins. 
APC/C indirectly triggers the cleavage of 
the cohesins that hold sister chromatids 
together. It catalyzes the ubiquitylation and 
destruction of an inhibitory protein called 
securin, which blocks the activation of a 
proteolytic enzyme called separase. When 
freed from securin, separase cleaves the 
cohesin complexes, allowing the mitotic 
spindle to pull the sister chromatids apart. 


QUESTION 18-6 


If fine glass needles are used to 
manipulate a chromosome inside a 
living cell during early M phase, it 
is possible to trick the kinetochores 
on the two sister chromatids into 
attaching to the same spindle 

pole. This arrangement is normally 
unstable, but the attachments can 
be stabilized if the needle is used 
to gently pull the chromosome so 
that the microtubules attached to 
both kinetochores (via the same 
spindle pole) are under tension. 
What does this suggest to you 
about the mechanism by which 
kinetochores normally become 
attached and stay attached to 
microtubules from opposite spindle 
poles? Is the finding consistent with 
the possibility that a kinetochore 

is programmed to attach to 
microtubules from a particular 
spindle pole? Explain your answers. 
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Chromosomes Segregate During Anaphase 


Once the sister chromatids separate, they all move toward the spindle 
poles at the same speed, which is typically about 1 um per minute. The 
movement is the consequence of two independent and overlapping pro- 
cesses that rely on different parts of the mitotic spindle. In anaphase A, 
the kinetochore microtubules shorten and the attached chromosomes 
move poleward. In anaphase B, the spindle poles themselves move apart, 
further segregating the two sets of chromosomes (Figure 18-29). 


The driving force for the movements of anaphase A is thought to be 
provided mainly by the loss of tubulin subunits from both ends of the 
kinetochore microtubules. The driving forces in anaphase B are thought 
to be provided by two sets of motor proteins—members of the kinesin 
and dynein families—operating on different types of spindle microtubules 
(see Figure 17—19A). Kinesin proteins act on the long, overlapping inter- 
polar microtubules, sliding the microtubules from opposite poles past 
one another at the equator of the spindle and pushing the spindle poles 
apart. Dynein proteins, anchored to the plasma membrane, move along 
astral microtubules to pull the poles apart (see Figure 18—29B). 


An Unattached Chromosome Will Prevent Sister- 
Chromatid Separation 


If a dividing cell were to begin to segregate its chromosomes before all 
the chromosomes were properly attached to the spindle, one daughter 
cell would receive an incomplete set of chromosomes, while the other 
would receive a surplus. Both situations could be lethal. Thus, a dividing 
cell must ensure that every last chromosome is attached properly to the 
spindle before it completes mitosis. To monitor chromosome attachment, 
the cell makes use of a negative signal: the kinetochores of unattached 
chromosomes send a “stop” signal to the cell-cycle control system. This 
signal inhibits further progress through mitosis by blocking the activation 
of APC/C (see Figure 18-28). Without active APC/C, the sister chromatids 
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Figure 18-29 Two processes segregate chromosomes at anaphase. the poles apart 
(A) In anaphase A, the sister chromatids are pulled toward opposite 
poles as the kinetochore microtubules depolymerize. The force driving 
this movement is generated mainly at the kinetochore. (B) In anaphase 
B, the two spindle poles move apart as the result of two separate 
forces: (1) the elongation and sliding of the interpolar microtubules 
past one another pushes the two poles apart, and (2) forces exerted on 
the outward-pointing astral microtubules at each spindle pole pull the 
poles away from each other, toward the cell cortex. Both forces 

are thought to depend on the action of motor proteins associated 

with the microtubules. 


remain glued together. Thus, none of the duplicated chromosomes can 
be pulled apart until every chromosome has been positioned correctly on 
the mitotic spindle. The absence of APC/C also prevents the destruction 
of cyclins (see Figure 18-9), so that Cdks remain active—thus prolong- 
ing mitosis. This spindle assembly checkpoint thereby controls the onset 
of anaphase, as well as the exit from mitosis, as mentioned earlier (see 
Figure 18-12). 


The Nuclear Envelope Re-forms at Telophase 


By the end of anaphase, the chromosomes have separated into two equal 
groups, one at each pole of the spindle. During telophase, the final stage 
of mitosis, the mitotic spindle disassembles, and a nuclear envelope 
reassembles around each group of chromosomes to form the two daugh- 
ter nuclei (Movie 18.9 and Movie 18.10). Vesicles of nuclear membrane 
associate with the clustered chromosomes and then fuse to re-form the 
nuclear envelope (see Panel 18-1, pp. 628-629). During this process, 
the nuclear pore proteins and nuclear lamins that were phosphorylated 
during prometaphase are now dephosphorylated, which allows them to 
reassemble and rebuild the nuclear envelope and lamina (Figure 18-30). 
Once the nuclear envelope has been re-established, the pores restore the 
localization of cytosolic and nuclear proteins and the condensed chro- 
mosomes decondense into their interphase state. A new nucleus has 
been created, and mitosis is complete. All that remains is for the cell to 
complete its division into two separate daughter cells. 
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Figure 18-30 The nuclear envelope 
breaks down and re-forms during 
mitosis. The phosphorylation of nuclear 
pore proteins and lamins helps trigger the 
disassembly of the nuclear envelope at 
prometaphase. Dephosphorylation of these 
proteins at telophase helps reverse the 
process. 


QUESTION 18-7 


Consider the events that lead to 
the formation of the new nucleus 
at telophase. How do nuclear and 
cytosolic proteins become properly 
re-sorted so that the new nucleus 
contains nuclear proteins but not 
cytosolic proteins? 


Figure 18-31 The cleavage furrow is 
formed by the action of the contractile 
ring underneath the plasma membrane. 
In these scanning electron micrographs of 
a dividing fertilized frog egg, the cleavage 
furrow is unusually well defined. (A) Low- 
magnification view of the egg surface. (B) A 
higher-magnification view of the cleavage 
furrow. (From H.W. Beams and R.G. Kessel, 
Am. Sci. 64:279-290, 1976. With permission 
of Sigma Xi.) 
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CYTOKINESIS 


Cytokinesis, the process by which the cytoplasm is cleaved in two, com- 
pletes M phase. It usually begins in anaphase but is not completed until 
after the two daughter nuclei have re-formed in telophase. Whereas 
mitosis depends on a transient microtubule-based structure, the mitotic 
spindle, cytokinesis in animal cells depends on a transient structure based 
on actin and myosin filaments, the contractile ring (see Figure 18-19). 
Both the plane of cleavage and the timing of cytokinesis, however, are 
determined by the mitotic spindle. 


The Mitotic Spindle Determines the Plane of 
Cytoplasmic Cleavage 


The first visible sign of cytokinesis in animal cells is a puckering and fur- 
rowing of the plasma membrane that occurs during anaphase (Figure 
18-31). The furrowing invariably occurs along a plane that runs perpen- 
dicular to the long axis of the mitotic spindle. This positioning ensures 
that the cleavage furrow cuts between the two groups of segregated chro- 
mosomes, so that each daughter cell receives an identical and complete 
set of chromosomes. If the mitotic spindle is deliberately displaced (using 


a fine glass needle) as soon as the furrow appears, the furrow will dis- 
appear and a new one will develop at a site corresponding to the new 
spindle location and orientation. Once the furrowing process is well 
under way, however, cleavage proceeds even if the mitotic spindle is arti- 
ficially sucked out of the cell or depolymerized using the drug colchicine. 


How does the mitotic spindle dictate the position of the cleavage furrow? 
The mechanism is still uncertain, but it appears that, during anaphase, the 
overlapping interpolar microtubules that form the central spindle recruit 
and activate proteins that signal to the cell cortex to initiate the assembly 
of the contractile ring at a position midway between the spindle poles 
(Figure 18-32). Because these signals originate during anaphase, this 
mechanism also contributes to the timing of cytokinesis in late mitosis. 


When the mitotic spindle is located centrally in the cell—the usual 
situation in most dividing cells—the two daughter cells will be of equal 
size. During embryonic development, however, there are some instances 
in which the dividing cell moves its mitotic spindle to an asymmetrical 
position, and, consequently, the furrow creates two daughter cells that 
differ in size. In most of these asymmetric divisions, the daughters also 
differ in the molecules they inherit, and they usually develop into differ- 
ent cell types. 


The Contractile Ring of Animal Cells Is Made of Actin 
and Myosin Filaments 


The contractile ring is composed mainly of an overlapping array of actin 
filaments and myosin filaments (Figure 18-33). It assembles at anaphase 
and is attached to membrane-associated proteins on the cytosolic face of 
the plasma membrane. Once assembled, the contractile ring is capable 
of exerting a force strong enough to bend a fine glass needle inserted into 
the cell before cytokinesis. Much of this force is generated by the slid- 
ing of the actin filaments against the myosin filaments. Unlike the stable 
association of actin and myosin filaments in muscle fibers, however, the 
contractile ring is a dynamic and transient structure: it assembles to carry 
out cytokinesis, gradually becomes smaller as cytokinesis progresses, 
and disassembles completely once the cell has been cleaved in two. 


Cell division in many animal cells is accompanied by large changes in 
cell shape and a decrease in the adherence of the cell to its neighbors, to 
the extracellular matrix, or to both. These changes result, in part, from 
the reorganization of actin and myosin filaments in the cell cortex, only 
one aspect of which is the assembly of the contractile ring. Mammalian 
fibroblasts in culture, for example, spread out flat during interphase, as a 
result of the strong adhesive contacts they make with the surface they are 
growing on—called the substratum. As the cells enter M phase, however, 
they round up. This change in shape takes place, in part, because some 
of the plasma membrane proteins responsible for attaching the cells to 
the substratum—the integrins (discussed in Chapter 20)—become phos- 
phor-ylated and thus weaken their grip. Once cytokinesis is complete, the 
daughter cells reestablish their strong contacts with the substratum and 


Figure 18-33 The contractile ring divides the cell in two. 

(A) Schematic diagram of the midregion of a dividing cell showing the 
contractile ring beneath the plasma membrane and the remains of 
the two sets of interpolar microtubules. (B) In this dividing sea urchin 
embryo, the contractile ring is revealed by staining with a fluorescently 
labeled antibody that binds to myosin. (B, courtesy of George von 
Dassow.) 
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Figure 18-32 Position of the cleavage 
furrow is dictated by the central spindle. 
In this model, the interpolar microtubules 
recruit proteins that generate a signal (red 
arrows) that activates a protein called RhoA 
in the cell cortex. RhoA, a member of the 
Rho family of GTPases discussed in Chapter 
17, controls the assembly and contraction 
of the contractile ring midway between the 
spindle poles. 
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Figure 18-34 Animal cells change shape during M phase. In these micrographs of a mouse fibroblast dividing in culture, the 
same cell was photographed at successive times. Note how the cell becomes smaller and rounded as it enters mitosis; the two 
daughter cells then flatten out again after cytokinesis is complete. (Courtesy of Guenter Albrecht-Buehler.) 


QUESTION 18-8 


Draw a detailed view of the 
formation of the new cell wall 
that separates the two daughter 
cells when a plant cell divides (see 
Figure 18-35). In particular, show 
where the membrane proteins of 
the Golgi-derived vesicles end 

up, indicating what happens to 
the part of a protein in the Golgi 
vesicle membrane that is exposed 
to the interior of the Golgi vesicle. 
(Refer to Chapter 11 if you need a 
reminder of membrane structure.) 


flatten out again (Figure 18-34). When cells divide in an animal tissue, 
this cycle of attachment and detachment presumably allows the cells to 
rearrange their contacts with neighboring cells and with the extracellular 
matrix, so that the new cells produced by cell division can be accommo- 
dated within the tissue. 


Cytokinesis in Plant Cells Involves the Formation of a 
New Cell Wall 


The mechanism of cytokinesis in higher plants is entirely different from 
that in animal cells, presumably because plant cells are surrounded by 
a tough cell wall (discussed in Chapter 20). The two daughter cells are 
separated not by the action of a contractile ring at the cell surface but 
instead by the construction of a new wall that forms inside the dividing 
cell. The positioning of this new wall precisely determines the position of 
the two daughter cells relative to neighboring cells. Thus, the planes of 
cell division, together with cell enlargement, determine the final form of 
the plant. 


The new cell wall starts to assemble in the cytoplasm between the two 
sets of segregated chromosomes at the start of telophase. The assem- 
bly process is guided by a structure called the phragmoplast, which 
is formed by the remains of the interpolar microtubules at the equator 
of the old mitotic spindle. Small membrane-enclosed vesicles, largely 
derived from the Golgi apparatus and filled with polysaccharides and gly- 
coproteins required for the cell wall matrix, are transported along the 
microtubules to the phragmoplast. Here, they fuse to form a disclike, 
membrane-enclosed structure, which expands outward by further vesi- 
cle fusion until it reaches the plasma membrane and original cell wall, 
thereby dividing the cell in two (Figure 18-35). Later, cellulose microfi- 
brils are laid down within the matrix to complete the construction of the 
new cell wall. 


Membrane-enclosed Organelles Must Be Distributed to 
Daughter Cells When a Cell Divides 


Organelles such as mitochondria and chloroplasts cannot assemble 
spontaneously from their individual components; they arise only from the 
growth and division of the preexisting organelles. Likewise, endoplasmic 
reticulum (ER) and Golgi apparatus also derive from preexisting organelle 
fragments. How, then, are these various membrane-enclosed organelles 
segregated when the cell divides so that each daughter gets its share? 
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Mitochondria and chloroplasts are usually present in large numbers and 
will be safely inherited if, on average, their numbers simply double once 
each cell cycle. The ER in interphase cells is continuous with the nuclear 
membrane and is organized by the microtubule cytoskeleton (see Figure 
17-21). Upon entry into M phase, the reorganization of the microtubules 
releases the ER; in most cells, the released ER remains intact during mito- 
sis and is cut in two during cytokinesis. The Golgi apparatus fragments 
during mitosis; the fragments associate with the spindle microtubules 
via motor proteins, thereby hitching a ride into the daughter cells as the 
spindle elongates in anaphase. Other components of the cell—including 
the other membrane-enclosed organelles, ribosomes, and all of the solu- 
ble proteins—are inherited randomly when the cell divides. 


Having discussed how cells divide, we now turn to the general problem 
of how the size of an animal or an organ is determined, which leads us to 
consider how cell number and cell size are controlled. 


CONTROL OF CELL NUMBERS AND CELL SIZE 


A fertilized mouse egg and a fertilized human egg are similar in size— 
about 100 um in diameter. Yet an adult mouse is much smaller than an 
adult human. What are the differences between the control of cell behav- 
ior in humans and mice that generate such big differences in size? The 
same fundamental question can be asked about each organ and tissue 
in an individual's body. What adjustment of cell behavior explains the 
length of an elephant’s trunk or the size of its brain or its liver? These 
questions are largely unanswered, but it is at least possible to say what 
the ingredients of an answer must be. Three fundamental processes 
largely determine organ and body size: cell growth, cell division, and cell 
death. Each of these processes, in turn, depends on programs intrinsic to 
the individual cell, regulated by signals from other cells in the body. 


In this section, we first consider how organisms eliminate unwanted cells 
by a form of programmed cell death called apoptosis. We then discuss 
how extracellular signals balance cell death, cell growth, and cell divi- 
sion—thereby helping control the size of an animal and its organs. We 
conclude the section with a brief discussion of the extracellular signals 
that control these three processes. 
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Figure 18-35 Cytokinesis in a plant cell 

is guided by a specialized microtubule- 
based structure called the phragmoplast. 
(A) At the beginning of telophase, after 

the chromosomes have segregated, a new 
cell wall starts to assemble inside the cell at 
the equator of the old spindle. 

(B) The interpolar microtubules of the 
mitotic spindle remaining at telophase 

form the phragmoplast and guide vesicles, 
derived from the Golgi apparatus, toward 
the equator of the spindle. The vesicles, 
which are filled with cell wall material, 

fuse to form the growing new cell wall 

that grows outward to reach the plasma 
membrane and original cell wall. (C) The 
preexisting plasma membrane and the 
membrane surrounding the new cell wall 
then fuse, completely separating the two 
daughter cells. (D) A light micrograph of a 
plant cell in telophase is shown at a stage 
corresponding to (A). The cell has been 
stained to show both the microtubules and 
the two sets of chromosomes segregated at 
the two poles of the spindle. The location of 
the growing new cell wall is indicated by the 
arrowheads. (D, courtesy of Andrew Bajer.) 


QUESTION 18-9 


The Golgi apparatus is thought to 
be partitioned into the daughter 
cells at cell division by a random 
distribution of fragments that 

are created at mitosis. Explain 
why random partitioning of 
chromosomes would not work. 
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Figure 18-36 Apoptosis in the 
developing mouse paw sculpts the digits. 
(A) The paw in this mouse embryo has been 
stained with a dye that specifically labels 
cells that have undergone apoptosis. The 
apoptotic cells appear as bright green dots 
between the developing digits. (B) This cell 
death eliminates the tissue between the 
developing digits, as seen in the paw shown 
one day later. Here, few, if any, apoptotic 
cells can be seen—demonstrating how 
quickly apoptotic cells can be cleared from a 
tissue. (From W. Wood et al., Development 
127:5245-5252, 2000. With permission from 
The Company of Biologists Ltd.) 


Figure 18-37 As a tadpole changes into 
a frog, the cells in its tail are induced to 
undergo apoptosis. All of the changes that 
occur during metamorphosis, including the 
induction of apoptosis in the tadpole tail, 
are stimulated by an increase in thyroid 
hormone in the blood. 


The Cell-Division Cycle 


Apoptosis Helps Regulate Animal Cell Numbers 


The cells of a multicellular organism are members of a highly organized 
community. The number of cells in this community is tightly regulated— 
not simply by controlling the rate of cell division, but also by controlling 
the rate of cell death. If cells are no longer needed, they can remove 
themselves by activating an intracellular suicide program—a process 
called programmed cell death. In animals, the most common form of 
programmed cell death is called apoptosis (from a Greek word meaning 
“falling off,” as leaves fall from a tree). 


The amount of apoptosis that occurs in both developing and adult animal 
tissues can be astonishing. In the developing vertebrate nervous system, 
for example, more than half of some types of nerve cells normally die 
soon after they are formed. In a healthy adult human, billions of cells in 
the bone marrow and intestine perish every hour. It seems remarkably 
wasteful for so many cells to die, especially as the vast majority are per- 
fectly healthy at the time they kill themselves. What purposes does this 
massive cell suicide serve? 


In some cases, the answers are clear. Mouse paws—and our own hands 
and feet—are sculpted by apoptosis during embryonic development: they 
start out as spadelike structures, and the individual fingers and toes sepa- 
rate because the cells between them die (Figure 18-36). In other cases, 
cells die when the structure they form is no longer needed. When a tad- 
pole changes into a frog at metamorphosis, the cells in its tail die, and the 
tail, which is not needed in the adult frog, disappears (Figure 18-37). In 
these cases, the unneeded cells die largely through apoptosis. 


In adult tissues, cell death usually balances cell division, unless the tis- 
sue is growing or shrinking. If part of the liver is removed in an adult 
rat, for example, liver cells proliferate to make up the loss. Conversely, 
if a rat is treated with the drug phenobarbital, which stimulates liver cell 
division, the liver enlarges. However, when the phenobarbital treatment 
is stopped, apoptosis in the liver greatly increases until the organ has 
returned to its original size, usually within a week or so. Thus, the liver 
is kept at a constant size through regulation of both the rate of cell death 
and the rate of cell birth. 


Apoptosis Is Mediated by an Intracellular Proteolytic 
Cascade 


Cells that die as a result of acute injury typically swell and burst, spew- 
ing their contents across their neighbors, a process called cell necrosis 
(Figure 18-38A). This eruption triggers a potentially damaging inflam- 
matory response. By contrast, a cell that undergoes apoptosis dies neatly, 
without damaging its neighbors. A cell in the throes of apoptosis may 
develop irregular bulges—or blebs—on its surface; but it then shrinks 
and condenses (Figure 18—38B). The cytoskeleton collapses, the nuclear 
envelope disassembles, and the nuclear DNA breaks up into fragments 
(Movie 18.11). Most importantly, the cell surface is altered in such a 
manner that it immediately attracts phagocytic cells, usually specialized 
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phagocytic cells called macrophages (see Figure 15-32B). These cells 
engulf the apoptotic cell before its contents can leak out (Figure 18-380C). 
This rapid removal of the dying cell avoids the damaging consequences of 
cell necrosis, and it also allows the organic components of the apoptotic 
cell to be recycled by the cell that ingests it. 


The molecular machinery responsible for apoptosis, which seems to 
be similar in most animal cells, involves a family of proteases called 
caspases. These enzymes are made as inactive precursors, called 
procaspases, which are activated in response to signals that induce apop- 
tosis. Two types of caspases work together to take a cell apart. Initiator 
caspases cleave, and thereby activate, downstream executioner caspases, 
which dismember numerous key proteins in the cell (Figure 18-39). One 
executioner caspase, for example, targets the lamin proteins that form 
the nuclear lamina underlying the nuclear envelope (see Figure 18-30); 
this cleavage causes the irreversible breakdown of the nuclear lamina, 
which allows nucleases to enter the nucleus and break down the DNA. 
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Figure 18-38 Cells undergoing 
apoptosis die quickly and cleanly. 
Electron micrographs showing cells that 
have died (A) by necrosis or (B and C) 
by apoptosis. The cells in (A) and (B) 
died in a culture dish, whereas the cell 
in (C) died in a developing tissue and 
has been engulfed by a phagocytic cell. 
Note that the cell in (A) seems to have 
exploded, whereas those in (B) and (C) 
have condensed but seem relatively 
intact. The large vacuoles seen in 

the cytoplasm of the cell in (B) are a 
variable feature of apoptosis. (Courtesy 
of Julia Burne.) 


QUESTION 18-10 


Why do you think apoptosis 

occurs by a different mechanism 
from the cell death that occurs in 
cell necrosis? What might be the 
consequences if apoptosis were not 
achieved in so neat and orderly a 
fashion, whereby the cell destroys 
itself from within and avoids leakage 
of its contents into the extracellular 
space? 


Figure 18-39 Apoptosis is mediated 

by an intracellular proteolytic cascade. 
An initiator caspase is first made as an 
inactive monomer called a procaspase. 

An apoptotic signal triggers the assembly 
of adaptor proteins that bring together a 
pair of initiator caspases, which are thereby 
activated, leading to cleavage of a specific 
site in their protease domains. Executioner 
caspases are initially formed as inactive 
dimers. Upon cleavage by an initiator 
caspase, the executioner caspase dimer 
undergoes an activating conformational 
change. The executioner caspases then 
cleave a variety of key proteins, leading to 
apoptosis. 
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In this way, the cell dismantles itself quickly and cleanly, and its corpse is 
rapidly taken up and digested by another cell. 


Activation of the apoptotic program, like entry into a new stage of the cell 
cycle, is usually triggered in an all-or-none fashion: once a cell reaches a 
critical point along the path to destruction, it cannot turn back. 


The Intrinsic Apoptotic Death Program Is Regulated by 
the Bcl2 Family of Intracellular Proteins 


All nucleated animal cells contain the seeds of their own destruction: in 
these cells, inactive procaspases lie waiting for a signal to destroy the 
cell. It is therefore not surprising that caspase activity is tightly regulated 
to ensure that the death program is held in check until it is needed— 
for example, to eliminate cells that are superfluous, mislocated, or badly 
damaged. 


The main proteins that regulate the activation of caspases are members 
of the Bcl2 family of intracellular proteins. Some members of this pro- 
tein family promote caspase activation and cell death, whereas others 
inhibit these processes. Two of the most important death-inducing family 
members are proteins called Bax and Bak. These proteins—which are acti- 
vated in response to DNA damage or other insults—promote cell death 
by inducing the release of the electron-transport protein cytochrome c 
from mitochondria into the cytosol. Other members of the Bcl2 family 
(including Bcl2 itself) inhibit apoptosis by preventing Bax and Bak from 
releasing cytochrome c. The balance between the activities of pro-apop- 
totic and anti-apoptotic members of the Bcl2 family largely determines 
whether a cell lives or dies by apoptosis. 


The cytochrome c molecules released from mitochondria activate initia- 
tor procaspases—and induce cell death—by promoting the assembly of 
a large, seven-armed, pinwheel-like protein complex called an apopto- 
some. The apoptosome then recruits and activates a particular initiator 
procaspase, which then triggers a caspase cascade that leads to apopto- 
sis (Figure 18—40). 


Apoptotic Signals Can Also Come from Other Cells 


Sometimes the signal to commit suicide is not generated internally, but 
instead comes from a neighboring cell. Some of these extracellular sig- 
nals activate the cell death program by altering the activity of members 
of the Bcl2 family of proteins. Others stimulate apoptosis more directly by 
activating a set of cell-surface receptor proteins known as death receptors. 


One particularly well-understood death receptor, called Fas, is present 
on the surface of a variety of mammalian cell types. Fas is activated by 
a membrane-bound protein, called Fas ligand, present on the surface of 
specialized immune cells called killer lymphocytes. These killer cells help 
regulate immune responses by inducing apoptosis in other immune cells 
that are unwanted or are no longer needed—and activating Fas is one 
way they do so. The binding of Fas ligand to its receptor triggers the 
assembly of a death-inducing signaling complex, which includes specific 
initiator procaspases that, when activated, launch a caspase cascade that 
leads to cell death. 


Animal Cells Require Extracellular Signals to Survive, 
Grow, and Divide 


In a multicellular organism, the fate of individual cells is controlled by 
signals from other cells. Such communication ensures that a cell survives 
only when it is needed and divides only when another cell is required, 
either to allow tissue growth or to replace cell loss. 
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Most of the extracellular signal molecules that influence cell survival, cell 
growth, and cell division are either soluble proteins secreted by other 
cells or proteins that are bound to the surface of other cells or to the 
extracellular matrix. Although most act positively to stimulate one or 
more of these cell processes, some act negatively to inhibit a particular 
process. The positively acting signal proteins can be classified, on the 
basis of their function, into three major categories: 


1. Survival factors promote cell survival, largely by suppressing 
apoptosis. 

2. Mitogens stimulate cell division, primarily by overcoming the 
intracellular braking mechanisms that block entry into the cell cycle 
in late G1. 

3. Growth factors stimulate cell growth (an increase in cell size 
and mass) by promoting the synthesis and inhibiting the degradation 
of proteins and other macromolecules. 


These categories are not mutually exclusive, as many signal molecules 
have more than one of these functions. Unfortunately, the term “growth 
factor” is often used as a catch-all phrase to describe a protein with any 
of these functions. Indeed, the phrase “cell growth” is frequently used 
inappropriately to mean an increase in cell number, which is more cor- 
rectly termed “cell proliferation.” 


In the following three sections, we examine each of these types of signal 
molecules in turn. 


Survival Factors Suppress Apoptosis 


Animal cells need signals from other cells just to survive. If deprived of 
such survival factors, cells activate a caspase-dependent intracellular 
suicide program and die by apoptosis. This requirement for signals from 
other cells helps ensure that cells survive only when and where they are 
needed. Many types of nerve cells, for example, are produced in excess 
in the developing nervous system and then compete for limited amounts 
of survival factors that are secreted by the target cells they contact. Those 
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Figure 18—40 Bax and Bak can trigger 
apoptosis by releasing cytochrome c 
from mitochondria. When Bak or Bax 
proteins are activated by an apoptotic 
stimulus, they aggregate in the outer 
mitochondrial membrane, leading to 

the release of cytochrome c into the 
cytosol by an unknown mechanism. 
Additional proteins in the mitochondrial 
intermembrane space are released at the 
same time—not shown. Cytochrome c then 
binds to an adaptor protein, causing it to 
assemble into a seven-armed complex 
called the apoptosome. This complex 
then recruits seven molecules of a specific 
initiator procaspase (procaspase-9). The 
procaspase-9 proteins become activated 
within the apoptosome and then go on to 
activate executioner procaspases in the 
cytosol (as shown in Figure 18-39), leading 
to a caspase cascade and apoptosis. 
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Figure 18-41 Cell death can help adjust 
the number of developing nerve cells to 
the number of target cells they contact. If 
more nerve cells are produced than can be 
supported by the limited amount of survival 
factor released by the target cells, some 
cells will receive insufficient amounts of 
survival factor to keep their suicide program 
suppressed and will undergo apoptosis. 
This strategy of overproduction followed 

by culling can help ensure that all target 
cells are contacted by nerve cells and that 
the “extra” nerve cells are automatically 
eliminated. 
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Figure 18-42 Survival factors often 
suppress apoptosis by regulating Bcl2 
family members. In this case, the survival 
factor binds to cell-surface receptors that 
activate an intracellular signaling pathway, 
which in turn activates a transcription 
regulator in the cytosol. This protein moves 
to the nucleus, where it activates the gene 
encoding Bcl2, a protein that inhibits 
apoptosis (see also Figure 16-33). 


apoptotic 
nerve cells 


FOR Oa 


nerve cells 


nerve 
cell 
body 

CELL DEATH 
nerve HELPS MATCH 
cell NUMBER OF 
axon gate are Ti NERVE CELLS 


TARGET CELLS 


‘eo TO NUMBER OF 


survival factor 


target cells released by target cells 


nerve cells that receive enough survival factor live, while the others die 
by apoptosis. In this way, the number of surviving nerve cells is auto- 
matically adjusted to match the number of cells with which they connect 
(Figure 18-41). A similar dependence on survival signals from neighbor- 
ing cells is thought to help control cell numbers in other tissues, both 
during development and in adulthood. 


Survival factors usually act through cell-surface receptors. Once acti- 
vated, the receptors turn on intracellular signaling pathways that keep 
the apoptotic death program suppressed, usually by regulating mem- 
bers of the Bcl2 family of proteins. Some survival factors, for example, 
increase the production of Bcl2, a protein that suppresses apoptosis 
(Figure 18—42). 


Mitogens Stimulate Cell Division by Promoting Entry into 
S Phase 


Most mitogens are secreted signal proteins that bind to cell-surface 
receptors. When activated by mitogen binding, these receptors initiate 
various intracellular signaling pathways (discussed in Chapter 16) that 
stimulate cell division. As we saw earlier, these signaling pathways act 
mainly by releasing the molecular brakes that block the transition from 
the G, phase of the cell cycle into S phase (see Figure 18-14). 


Most mitogens have been identified and characterized by their effects 
on cells in culture. One of the first mitogens identified in this way was 
platelet-derived growth factor, or PDGF, the effects of which are typical 
of many others discovered since. When blood clots form (in a wound, 
for example), blood platelets incorporated in the clots are stimulated to 
release PDGF. PDGF then binds to receptor tyrosine kinases (discussed in 
Chapter 16) in surviving cells at the wound site, stimulating these cells to 
proliferate and help heal the wound. In a similar way, if part of the liver is 
lost through surgery or acute injury, a mitogen called hepatocyte growth 
factor helps stimulate the surviving liver cells to proliferate. 


Growth Factors Stimulate Cells to Grow 


The growth of an organ—or an entire organism—depends as much on 
cell growth as it does on cell division. If cells divided without growing, 
they would get progressively smaller, and there would be no increase 
in total cell mass. In single-celled organisms such as yeasts, both cell 
growth and cell division require only nutrients. In animals, by contrast, 
both cell growth and cell division depend on signals from other cells. 
Cell growth, unlike cell division, does not depend on the cell-cycle con- 
trol system. Indeed, many animal cells, including nerve cells and most 
muscle cells, do most of their growing after they have terminally differ- 
entiated and permanently stopped dividing. 


Control of Cell Numbers and Cell Size 


Figure 18-43 Extracellular growth factors increase the synthesis 
and decrease the degradation of macromolecules. Binding of a 
growth factor to a receptor tyrosine kinase (RTK, a class of cell-surface 
receptor described in Chapter 16) initiates an intracellular signaling 
pathway that leads to activation of a protein kinase called Tor, which 
acts through multiple targets to stimulate protein synthesis and inhibit 
protein degradation (see also Figure 16-34). This action leads to a net 
increase in macromolecules and thereby cell growth. 


Like most survival factors and mitogens, most extracellular growth fac- 
tors bind to cell-surface receptors that activate intracellular signaling 
pathways. These pathways lead to the accumulation of proteins and other 
macromolecules. Growth factors both increase the rate of synthesis of 
these molecules and decrease their rate of degradation (Figure 18-43). 


Some extracellular signal proteins, including PDGF, can act as both 
growth factors and mitogens, stimulating both cell growth and progres- 
sion through the cell cycle. Such proteins help ensure that cells maintain 
their appropriate size as they proliferate. 


Compared to cell division, there has been surprisingly little study of how 
cell size is controlled in animals. As a result, it remains a mystery how 
different cell types in the same animal grow to be so different in size 
(Figure 18—44). 


Some Extracellular Signal Proteins Inhibit Cell Survival, 
Division, or Growth 


The extracellular signal proteins that promote survival, growth, and 
cell division act positively to increase the size of organs and organisms. 
Some extracellular signal proteins, however, act to oppose these positive 
regulators and thereby inhibit tissue growth. Myostatin, for example, is a 
secreted signal protein that normally inhibits the growth and proliferation 
of the precursor cells (myoblasts) that fuse to form skeletal muscle cells 
during mammalian development. When the gene that encodes myostatin 
is deleted in mice, their muscles grow to be several times larger than nor- 
mal, because both the number and the size of muscle cells is increased. 
Remarkably, two breeds of cattle that were bred for large muscles turned 
out to have mutations in the gene encoding myostatin (Figure 18-45). 


Cancers are similarly the products of mutations that set cells free from 
the normal “social” controls operating on cell survival, growth, and pro- 
liferation. Because cancer cells are generally less dependent than normal 
cells on signals from other cells, they can out-survive, outgrow, and out- 
divide their normal neighbors, producing tumors that can kill their host 
(see Chapter 20). 


In our discussions of cell division, we have focused entirely on the ordi- 
nary divisions that produce two daughter cells, each with a full and 
identical complement of the parent cell’s genetic material. There is, how- 
ever, a different and highly specialized type of cell division called meiosis, 
which is required for sexual reproduction in eukaryotes. In the next chap- 
ter, we describe the special features of meiosis and how they underlie the 
genetic principles that define the laws of inheritance. 


Figure 18-44 The cells in an animal can differ greatly in size. 

The neuron and liver cell shown here are drawn at the same scale. A 
neuron grows progressively larger after it has terminally differentiated 
and permanently stopped dividing. (Neuron adapted from S. Ramón y 
Cajal, Histologie du Systeme Nerveux de l'Homme et de Vertébrés, 
1909-1911. Paris: Maloine; reprinted, Madrid: C.S.I.C., 1972.) 
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(B) 


Figure 18—45 Mutation of the myostatin gene leads to a dramatic increase in muscle mass. (A) This Belgian 
Blue was produced by cattle breeders and was only later found to have a mutation in the myostatin gene. (B) Mice 
purposely made deficient in the same gene also have remarkably big muscles. A normal mouse is shown at the top 
for comparison with the muscular mutant shown at the bottom. (A, Yann Arthus-Bertrand/Getty Images; B, from 


S.-J. Lee, PLoS One 2:e789, 2007.) 


ESSENTIAL CONCEPTS 


The eukaryotic cell cycle consists of several distinct phases. In inter- 
phase, the cell grows and the nuclear DNA is replicated; in M phase, 
the nucleus divides (mitosis) followed by the cytoplasm (cytokinesis). 


In most cells, interphase consists of an S phase when DNA is dupli- 
cated plus two gap phases: G; and G2. These gap phases give 
proliferating cells more time to grow and prepare for S phase and 
M phase. 


The cell-cycle control system coordinates events of the cell cycle by 
sequentially and cyclically switching on and off the appropriate parts 
of the cell-cycle machinery. 


The cell-cycle control system depends on cyclin-dependent protein 
kinases (Cdks), which are cyclically activated by the binding of cyclin 
proteins and by phosphorylation and dephosphorylation; when acti- 
vated, Cdks phosphorylate key proteins in the cell. 


Different cyclin-Cdk complexes trigger different steps of the cell 
cycle: G;-Cdk drives the cell through G1; G;/S-Cdk and S-Cdk drive it 
into S phase; and M-Cdk drives it into mitosis. 


The control system also uses protein complexes, such as APC/C, to 
trigger the destruction of specific cell-cycle regulators at particular 
stages of the cycle. 


The cell-cycle control system can halt the cycle at specific transition 
points to ensure that intracellular and extracellular conditions are 
favorable and that each step is completed before the next is started. 
Some of these control mechanisms rely on Cdk inhibitors that block 
the activity of one or more cyclin-Cdk complexes. 


S-Cdk initiates DNA replication during S phase and helps ensure that 
the genome is copied only once. The cell-cycle control system can 
delay cell-cycle progression during G; or S phase to prevent cells 
from replicating damaged DNA. It can also delay the start of M phase 
to ensure that DNA replication is complete. 


Centrosomes duplicate during S phase and separate during G2. Some 
of the microtubules that grow out of the duplicated centrosomes 
interact to form the mitotic spindle. 
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e When the nuclear envelope breaks down, the spindle microtubules 
capture the duplicated chromosomes and pull them in opposite direc- 
tions, positioning the chromosomes at the equator of the metaphase 
spindle. 


e The sudden separation of sister chromatids at anaphase allows the 
chromosomes to be pulled to opposite poles; this movement is driven 
by the depolymerization of spindle microtubules and by microtubule- 
associated motor proteins. 


e A nuclear envelope re-forms around the two sets of segregated chro- 
mosomes to form two new nuclei, thereby completing mitosis. 


e In animal cells, cytokinesis is mediated by a contractile ring of actin 
filaments and myosin filaments, which assembles midway between 
the spindle poles; in plant cells, by contrast, a new cell wall forms 
inside the parent cell to divide the cytoplasm in two. 


e In animals, extracellular signals regulate cell numbers by controlling 
cell survival, cell growth, and cell proliferation. 


e Most animal cells require survival signals from other cells to avoid 
apoptosis—a form of cell suicide mediated by a proteolytic caspase 
cascade; this strategy helps ensure that cells survive only when and 
where they are needed. 


e Animal cells proliferate only if stimulated by extracellular mitogens 
produced by other cells; mitogens release the normal intracellular 
brakes that block progression from G or Go into S phase. 


e For an organism or an organ to grow, cells must grow as well as 
divide; animal cell growth depends on extracellular growth factors 
that stimulate protein synthesis and inhibit protein degradation. 


e Some extracellular signal molecules inhibit rather than promote cell 
survival, cell growth, or cell division. 


e Cancer cells fail to obey these normal “social” controls on cell behav- 
ior and therefore outgrow, out-divide, and out-survive their normal 
neighbors. 


KEY TERMS 


anaphase condensin metaphase 

anaphase-promoting contractile mitogen 
complex (APC/C) ring mitosis 

apoptosis cyclin mitotic spindle 

aster cytokinesis p53 

Bcl2 family G1-Cdk phragmoplast 

bi-orientation G41 cyclin programmed cell 

caspase G1 phase death 

Cdk (cyclin-dependent G2 phase prometaphase 

protein kinase) G,/S-Cdk prophase 

Cdk inhibitor protein G1/S cyclin S-Cdk 

cell cycle growth factor S cyclin 

cell-cycle control system interphase S phase 

centrosome kinetochore sister chromatid 

centrosome cycle M-Cdk spindle pole 

chromosome condensation M cyclin survival factor 


cohesin M phase telophase 


648 CHAPTER 18 


The Cell-Division Cycle 


QUESTIONS 


QUESTION 18-11 


Roughly, how long would it take a single fertilized human 
egg to make a cluster of cells weighing 70 kg by repeated 
divisions, if each cell weighs 1 nanogram just after cell 
division and each cell cycle takes 24 hours? Why does it take 
very much longer than this to make a 70 kg adult human? 


QUESTION 18-12 


The shortest eukaryotic cell cycles of all—shorter even 

than those of many bacteria—occur in many early animal 
embryos. These so-called cleavage divisions take place 
without any significant increase in the weight of the embryo. 
How can this be? Which phase of the cell cycle would you 
expect to be most reduced? 


QUESTION 18-13 


One important biological effect of a large dose of ionizing 
radiation is to halt cell division. 


A. How does this occur? 


B. What happens if a cell has a mutation that prevents it 
from halting cell division after being irradiated? 


C. What might be the effects of such a mutation if the cell 
is not irradiated? 


D. An adult human who has reached maturity will die within 
a few days of receiving a radiation dose large enough to 
stop cell division. What does that tell you (other than that 
one should avoid large doses of radiation)? 


QUESTION 18-14 


If cells are grown in a culture medium containing 
radioactive thymidine, the thymidine will be covalently 
incorporated into the cell's DNA during S phase. The 
radioactive DNA can be detected in the nuclei of individual 
cells by autoradiography: radioactive cells will activate 

a photographic emulsion and be labeled by black dots 
when looked at under a microscope. Consider a simple 
experiment in which cells are radioactively labeled by 
this method for only a short period (about 30 minutes). 
The radioactive thymidine medium is then replaced with 
one containing unlabeled thymidine, and the cells are 
grown for some additional time. At different time points 
after replacement of the medium, cells are examined in a 
microscope. The fraction of cells in mitosis (which can be 
easily recognized because the cells have rounded up and 
their chromosomes are condensed) that have radioactive 
DNA in their nuclei is then determined and plotted as 

a function of time after the labeling with radioactive 
thymidine (Figure Q18-14). 


A. Would all cells (including cells at all phases of the cell 
cycle) be expected to contain radioactive DNA after the 
labeling procedure? 


B. Initially, there are no mitotic cells that contain radioactive 
DNA (see Figure 018-14). Why is this? 


C. Explain the rise and fall and then rise again of the curve. 


D. Estimate the length of the G2 phase from this graph. 


QUESTION 18-15 


One of the functions of M-Cdk is to cause a precipitous 
drop in M cyclin concentration halfway through M phase. 
Describe the consequences of this sudden decrease and 
suggest possible mechanisms by which it might occur. 


QUESTION 18-16 


Figure 18-5 shows the rise of M cyclin concentration 

and the rise of M-Cdk activity in cells as they progress 
through the cell cycle. It is remarkable that the M cyclin 
concentration rises slowly and steadily, whereas M-Cdk 
activity increases suddenly. How do you think this difference 
arises? 


QUESTION 18-17 


What is the order in which the following events occur during 
cell division? 


. anaphase 


. metaphase 


. telophase 


A 

B 

C. prometaphase 
D 

E. mitosis 

F. 


prophase 


Where does cytokinesis fit in? 


QUESTION 18-18 


The lifetime of a microtubule in a mammalian cell, between 
its formation by polymerization and its spontaneous 
disappearance by depolymerization, varies with the stage of 
the cell cycle. For an actively proliferating cell, the average 
lifetime is 5 minutes in interphase and 15 seconds in mitosis. 
If the average length of a microtubule in interphase is 

20 um, how long will it be during mitosis, assuming that 

the rates of microtubule elongation due to the addition of 
tubulin subunits in the two phases are the same? 


QUESTION 18-19 


The balance between plus-end directed and minus-end 
directed motor proteins that bind to interpolar microtubules 
in the overlap region of the mitotic spindle is thought to 
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help determine the length of the spindle. How might each 
type of motor protein contribute to the determination of 
spindle length? 


QUESTION 18-20 


Sketch the principal stages of mitosis, using Panel 18-1 
(pp. 628-629) as a guide. Color one sister chromatid and 
follow it through mitosis and cytokinesis. What event 
commits this chromatid to a particular daughter cell? Once 
initially committed, can its fate be reversed? What may 
influence this commitment? 


QUESTION 18-21 


The polar movement of chromosomes during anaphase 

A is associated with microtubule shortening. In particular, 
microtubules depolymerize at the ends at which they are 
attached to the kinetochores. Sketch a model that explains 
how a microtubule can shorten and generate force yet 
remain firmly attached to the chromosome. 


QUESTION 18-22 


Rarely, both sister chromatids of a replicated chromosome 
end up in one daughter cell. How might this happen? What 
could be the consequences of such a mitotic error? 


QUESTION 18-23 


Which of the following statements are correct? Explain your 
answers. 


A. Centrosomes are replicated before M phase begins. 


B. Two sister chromatids arise by replication of the DNA of 
the same chromosome and remain paired as they line up on 
the metaphase plate. 


C. Interpolar microtubules attach end-to-end and are 
therefore continuous from one spindle pole to the other. 


D. Microtubule polymerization and depolymerization 
and microtubule motor proteins are all required for DNA 
replication. 


E. Microtubules nucleate at the centromeres and then 
connect to the kinetochores, which are structures at the 
centrosome regions of chromosomes. 


QUESTION 18-24 


An antibody that binds to myosin prevents the movement 
of myosin molecules along actin filaments (the interaction 
between actin and myosin is described in Chapter 17). 
How do you suppose the antibody exerts this effect? What 
might be the result of injecting this antibody into cells 

(A) on the movement of chromosomes at anaphase or 

(B) on cytokinesis? Explain your answers. 


QUESTION 18-25 


Look carefully at the electron micrographs in Figure 
18-38. Describe the differences between the cell that 
died by necrosis and those that died by apoptosis. How 
do the pictures confirm the differences between the two 
processes? Explain your answer. 
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QUESTION 18-26 


Which of the following statements are correct? Explain your 
answers. 


A. Cells do not pass from G4 into M phase of the cell cycle 
unless there are sufficient nutrients to complete an entire 
cell cycle. 


B. Apoptosis is mediated by special intracellular proteases, 
one of which cleaves nuclear lamins. 


C. Developing neurons compete for limited amounts of 
survival factors. 


D. Some vertebrate cell-cycle control proteins function 
when expressed in yeast cells. 


E. The enzymatic activity of a Cdk protein is determined 
both by the presence of a bound cyclin and by the 
phosphorylation state of the Cdk. 

QUESTION 18-27 


Compare the rules of cell behavior in an animal with the 
rules that govern human behavior in society. What would 
happen to an animal if its cells behaved as people normally 
behave in our society? Could the rules that govern cell 
behavior be applied to human societies? 

QUESTION 18-28 


In his highly classified research laboratory, Dr. Lawrence M. 
is charged with the task of developing a strain of dog-sized 
rats to be deployed behind enemy lines. In your opinion, 
which of the following strategies should Dr. M. pursue to 
increase the size of rats? 


A. Block all apoptosis. 
B. Block p53 function. 


C. Overproduce growth factors, mitogens, or survival 
factors. 


Explain the likely consequences of each option. 


QUESTION 18-29 

PDGF is encoded by a gene that can cause cancer when 
expressed inappropriately. Why do cancers not arise at 
wounds in which PDGF is released from platelets? 
QUESTION 18-30 

What do you suppose happens in mutant cells that 

A. cannot degrade M cyclin? 

B. always express high levels of p21? 


C. cannot phosphorylate Rb? 


QUESTION 18-31 


Liver cells proliferate excessively both in patients with 
chronic alcoholism and in patients with liver cancer. What 
are the differences in the mechanisms by which cell 
proliferation is induced in these diseases? 


Sexual Reproduction 
and Genetics 


Individual cells reproduce by replicating their DNA and dividing in two. 
This basic process of cell proliferation occurs in all living species—in the 
cells of multicellular organisms and in free-living cells such as bacteria 
and yeasts—and it allows each cell to pass on its genetic information to 
future generations. 


Yet reproduction in a multicellular organism—in a fish or a fly, a person 
or a plant—is a much more complicated affair. It entails elaborate devel- 
opmental cycles, in which all of the organism’s cells, tissues, and organs 
must be generated afresh from a single cell. This starter cell is no ordi- 
nary cell. It has a very peculiar origin: for most animal and plant species, 
the single cell from which an organism arises is produced by the union 
of a pair of cells that hail from two completely separate individuals—a 
mother and a father. As a result of this cell fusion—a central event in 
sexual reproduction—two genomes merge to form the genome of a new 
individual. The mechanisms that govern genetic inheritance in sexually 
reproducing organisms are therefore different, and more complex, than 
those that operate in organisms that pass on their genetic information 
asexually—by a straightforward cell division or by budding off a brand 
new individual. 


In this chapter, we explore the cell biology of sexual reproduction. We 
discuss what organisms gain from sex, and we describe how they do it. 
We examine the reproductive cells produced by males and females, and 
we explore the specialized form of division, called meiosis, that generates 
them. We discuss how Gregor Mendel, a nineteenth-century Austrian 
monk, deduced the basic logic of genetic inheritance by studying the 
progeny of pea plants. Finally, we describe how scientists can exploit 
the genetics of sexual reproduction to gain insights into human biology, 
human origins, and the molecular underpinnings of human disease. 


THE BENEFITS OF SEX 


MEIOSIS AND FERTILIZATION 


MENDEL AND THE LAWS OF 
INHERITANCE 


GENETICS AS AN 
EXPERIMENTAL TOOL 


EXPLORING HUMAN GENETICS 
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Figure 19-1 A hydra reproduces by 
budding. This form of asexual reproduction 
involves the production of buds (arrows), 
which eventually pinch off to form progeny 
that are genetically identical to their parent. 
(Courtesy of Amata Hornbruch.) 


THE BENEFITS OF SEX 


Most of the creatures we see around us reproduce sexually. However, 
many organisms, especially those invisible to the naked eye, can produce 
offspring without resorting to sex. Most bacteria and other single-celled 
organisms multiply by simple cell division. Many plants also reproduce 
asexually, forming multicellular offshoots that later detach from the par- 
ent to make new independent plants. Even in the animal kingdom, there 
are species that can procreate without sex. Hydra produce young by bud- 
ding (Figure 19-1). Certain worms, when split in two, can regenerate the 
“missing halves” to form two complete individuals. And in some species 
of insects, lizards, and even birds, the females can lay eggs that develop 
parthenogenetically—without the help of males, sperm, or fertilization— 
into healthy daughters that can also reproduce the same way. 


But while such forms of asexual reproduction are simple and direct, 
they give rise to offspring that are genetically identical to the parent 
organism. Sexual reproduction, on the other hand, involves the mixing 
of DNA from two individuals to produce offspring that are genetically 
distinct from one another and from both their parents. In this section, we 
outline briefly the cellular mechanisms that make this mode of reproduc- 
tion possible. Sexual reproduction apparently has great advantages, as 
the vast majority of plants and animals on Earth have adopted it. 


Sexual Reproduction Involves both Diploid and 
Haploid Cells 


Organisms that reproduce sexually are generally diploid: their cells con- 
tain two sets of chromosomes—one inherited from each parent. The 
maternal chromosome set and the paternal chromosome set are very 
similar, except for their sex chromosomes, which, in many species, distin- 
guish males from females. In humans, for example, the Y chromosome 
carries genes that specify the development of a male. Males inherit a Y 
chromosome from their father and an X chromosome from their mother; 
females inherit one X chromosome from each parent. Aside from these 
sex chromosomes, the maternal and paternal versions of every chro- 
mosome—called the maternal and paternal homologs, or homologous 
chromosomes—carry the same set of genes. Each diploid cell, therefore, 
carries two copies of every gene (except for those found on the sex chro- 
mosomes, which may be present in only one copy). 


Unlike the majority of cells in a diploid organism, the specialized cells 
that carry out the central process in sexual reproduction—the gametes— 
are haploid: they each contain only one set of chromosomes. For most 
organisms, the males and females produce different types of gametes. In 
animals, one is large and nonmotile and is referred to as the egg; the other 
is small and motile and is referred to as the sperm (Figure 19-2). These 
two dissimilar haploid gametes join together to regenerate a diploid cell, 
called the fertilized egg, or zygote, which has homologous chromosomes 
from both the mother and father. The zygote thus produced develops into 
a new individual with a diploid set of chromosomes that is distinct from 
that of either parent (Figure 19-3). 


For almost all multicellular animals, including vertebrates, most of the 
life cycle is spent in the diploid state. The haploid cells exist only briefly 
and are highly specialized for their function as genetic ambassadors. 
These haploid gametes are generated from diploid precursor cells by a 
specialized form of reductive division called meiosis. This precursor cell 
lineage is called the germ line. The cells forming the rest of the ani- 
mal’s body—the somatic cells—ultimately leave no progeny of their own 
(Figure 19-4 and see Figure 9-3). They exist, in effect, only to help the 
cells of the germ line survive and propagate. 


a 
25 um 


The sexual reproductive cycle thus involves an alternation of haploid 
cells, each carrying a single set of chromosomes, with generations of 
diploid cells, each carrying two sets of chromosomes. One benefit of this 
arrangement is that it allows sexually reproducing organisms to produce 
offspring that are genetically diverse, as we discuss next. 


Sexual Reproduction Generates Genetic Diversity 


Sexual reproduction produces novel chromosome combinations. During 
meiosis, the maternal and paternal chromosome sets in the diploid germ- 
line cells are partitioned into the single chromosome sets of the gametes. 
Each gamete will receive a mixture of maternal homologs and paternal 
homologs; when the genomes of two gametes combine during fertiliza- 
tion, they produce a zygote with a unique chromosomal complement. 


If the maternal and paternal homologs carry the same genes, why should 
such chromosomal reassortment matter? One answer is that although 
the set of genes on each homolog is the same, the paternal and mater- 
nal version of each gene is not. Genes occur in variant versions, called 
alleles, with slightly different DNA sequences. For any given gene, many 
different alleles may be present in the “gene pool” of a species. The exist- 
ence of these variant alleles means that the two copies of any given gene 
in a particular individual are likely to be somewhat different from each 
other—and from those carried by other individuals. What makes indi- 
viduals within a species genetically unique is the inheritance of different 
combinations of alleles. And with its cycles of diploidy, meiosis, haploidy, 
and cell fusion, sex breaks up old combinations of alleles and generates 
new ones. 


Sexual reproduction also generates genetic diversity through a second 
mechanism—homologous recombination. We discuss this process, which 
scrambles the genetic information on each chromosome during meiosis, 
a bit later. 


Figure 19-3 Sexual reproduction involves both haploid and diploid cells. 
Sperm and egg are produced by meiosis of diploid germ-line cells. During 
fertilization, a haploid egg and a haploid sperm fuse to form a diploid zygote. 
For simplicity, only one chromosome is shown for each gamete, and the sperm 
cell has been greatly enlarged. Human gametes have 23 chromosomes, and 
the egg is much larger than the sperm (see, for example, Figure 19-2). 


The Benefits of Sex 


Figure 19-2 Despite their tremendous 
difference in size, sperm and egg 
contribute equally to the genetic 
character of the offspring. This difference 
in size between male and female gametes 
(in which eggs contain a large quantity of 
cytoplasm, whereas sperm contain almost 
none) is consistent with the fact that the 
cytoplasm is not the basis of inheritance. 
If it were, the female’s contribution to the 
makeup of the offspring would be much 
greater than the male’s. Shown here is a 
scanning electron micrograph of an egg 
with human sperm bound to its surface. 
Although many sperm are bound to the 
egg, only one will fertilize it. (David M. 
Philips/Science Source.) 
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Figure 19-4 Germ-line cells and somatic 
cells carry out fundamentally different 
functions. In sexually reproducing animals, 
diploid germ-line cells, which are specified 
early in development, give rise to haploid 
gametes by meiosis. The gametes 
propagate genetic information into the 
next generation. Somatic cells (gray) form 
the body of the organism and are therefore 
necessary to support sexual reproduction, 
but they themselves leave no progeny. 
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Sexual Reproduction Gives Organisms a Competitive 
Advantage in a Changing Environment 


The processes that generate genetic diversity during meiosis operate at 
random, and so the collection of alleles an individual receives from each 
parent is just as likely to represent a combination that is inferior as it is 
an improvement. Why, then, should the ability to try out new genetic 
combinations give organisms that reproduce sexually an evolutionary 
advantage over those that “breed true” through an asexual process? This 
question continues to perplex evolutionary geneticists, but one advan- 
tage seems to be that reshuffling genetic information through sexual 
reproduction can help a species survive in an unpredictably variable 
environment. If two parents produce many offspring with a wide variety 
of gene combinations, they increase the odds that at least one of their 
progeny will have a combination of features necessary for survival in a 
variety of environmental conditions. They are more likely, for example, to 
survive infections by bacteria, viruses, and parasites, which themselves 
continually change in a never-ending evolutionary battle. This genetic 
gamble may explain why even unicellular organisms, such as yeasts, 
intermittently indulge in a simple form of sexual reproduction. Typically, 
they switch on this behavior as an alternative to ordinary cell division 
when times are hard and starvation looms. Yeasts with a genetic defect 
that makes them unable to reproduce sexually show a reduced ability to 
adapt when they are subjected to harsh conditions. 


Sexual reproduction may also be advantageous for another reason. In 
any population, new mutations continually occur, giving rise to new 
alleles—and many of these new mutations may be harmful. Sexual repro- 
duction can speed up the elimination of these deleterious alleles and help 
to prevent them from accumulating in the population. By mating with 
only the fittest males, females select for good combinations of alleles and 
allow bad combinations to be lost from the population more efficiently 
than they would otherwise be. 


Whatever its advantages, sex has clearly been favored by evolution. In 
the following section, we review the central features of this popular form 
of reproduction, beginning with meiosis, the process by which gametes 
are formed. 


MEIOSIS AND FERTILIZATION 


Our modern understanding of the fundamental cycle of events involved 
in sexual reproduction grew out of discoveries made in the late 1800s, 
when biologists noted that the fertilized eggs of a parasitic roundworm 
contain four chromosomes, whereas the worm’s gametes (sperm and 
eggs) contain only two. Gametes must be therefore produced by a special 
kind of “reductive” division in which the number of chromosomes is pre- 
cisely halved (see Figure 19-3). The term meiosis was coined to describe 
this form of cell division; it comes from a Greek word meaning “diminu- 
tion,” or “lessening.” 


From these early experiments on roundworms and other species, it 
became clear that the behavior of the chromosomes, which at that time 
were simply microscopic bodies of unknown function, matched the pat- 
tern of inheritance, in which the two parents make equal contributions 
to the character of the progeny despite the enormous difference in size 
between egg and sperm (see Figure 19-2). These observations were 
among the first clues that chromosomes contain the material of hered- 
ity. The study of sexual reproduction and meiosis therefore has a central 
place in the history of cell biology. 


In this section, we describe the cell biology of sexual reproduction from a 
modern point of view, focusing on the elaborate dance of chromosomes 
that occurs when a cell undertakes meiosis. We take a close look at how 
homologous chromosomes pair, recombine, and are segregated during 
meiosis, thereby shuffling the maternal and paternal genes into novel 
combinations. We also discuss what happens when meiosis goes awry. 
Finally, we consider briefly the process of fertilization, through which 
gametes come together to form a new, genetically distinct individual. 


Meiosis Involves One Round of DNA Replication 
Followed by Two Rounds of Nuclear Division 


Before a diploid cell divides by mitosis, it duplicates all of its chromo- 
somes. This duplication allows a full set of chromosomes—including a 
complete maternal set plus a complete paternal set—to be transmitted to 
each daughter cell (discussed in Chapter 18). Although meiosis ultimately 
halves this diploid chromosome complement—producing haploid gam- 
etes that carry only a single set of chromosomes—it, too, begins with a 
round of chromosome duplication. The subsequent reduction in chromo- 
some number occurs because this single round of duplication is followed 
by two successive rounds of nuclear division, without further DNA repli- 
cation (Figure 19-5). 


It seems like it would be simpler and more direct if meiosis instead took 
place by a modified form of mitotic cell division in which DNA replication 
(S phase) were omitted completely; a single round of division could then, 
in theory, produce two haploid cells directly. But, for reasons that are still 
unclear, this is not the way meiosis works. 


Meiosis begins in specialized germ-line cells that reside in the ovaries 
or testes. Like somatic cells, these germ-line cells are diploid; each con- 
tains two copies of every chromosome—a paternal homolog, inherited 
from the individual's father, and a maternal homolog, inherited from the 
mother. In the first step of meiosis, all of these chromosomes are dupli- 
cated, and the resulting copies remain closely attached to each other, as 
they would during an ordinary mitosis (see Chapter 18). 


The next phase of the process, however, is unique to meiosis. During this 
phase, called meiotic prophase, each duplicated paternal chromosome 
locates and then attaches itself along its entire length to the corresponding 
duplicated maternal homolog. This process, called pairing, is of funda- 
mental importance in meiosis, as it allows the segregation of homologous 
chromosome pairs during the first meiotic division (meiosis I). 


The two duplicated chromosomes within each homolog are then sep- 
arated during the second meiotic division (meiosis II), producing four 
haploid nuclei. Because chromosome segregation during meiosis I and II 
is random, each haploid gamete will receive a different mixture of mater- 
nal and paternal chromosomes. 


Thus, meiosis produces four nuclei that are genetically dissimilar and 
that contain exactly half as many chromosomes as the original parent 
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Figure 19-5 Mitosis and meiosis both 
begin with a round of chromosome 
duplication. In mitosis, chromosome 
duplication is followed by a single round 
of cell division to yield two diploid nuclei. 
In meiosis, chromosome duplication in a 
diploid germ-line cell is followed by two 
rounds of division, without further DNA 
replication, to produce four haploid nuclei. 
N represents the number of chromosomes 
in the haploid nucleus. 
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germ-line cell. Mitosis, in contrast, produces two genetically identical 
diploid nuclei. Figure 19-6 summarizes the molecular events that distin- 
guish these two types of division—differences we now discuss in greater 
detail, beginning with the meiosis-specific pairing of maternal and pater- 
nal chromosomes. 
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Figure 19—6 Meiosis generates four nonidentical haploid nuclei, whereas mitosis produces two identical diploid nuclei. As in Figure 
19-3, only one pair of homologous chromosomes is shown. (A) In meiosis, chromosome duplication is followed by two meiotic divisions 
to produce haploid nuclei. Each diploid nucleus that enters meiosis therefore produces four haploid nuclei, which are then packaged into 
haploid gametes by specialized forms of cytokinesis. (B) In mitosis, each diploid nucleus produces two diploid nuclei, which are packaged 
by cytokinesis into two diploid cells. Although mitosis and meiosis Il are similar processes that are usually accomplished within hours, 
meiosis | can last days, months, or even years, because of the time required for homolog pairing before meiosis |. 
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Duplicated Homologous Chromosomes Pair During 
Meiotic Prophase 


Before a eukaryotic cell divides—by either meiosis or mitosis—it first 
duplicates all of its chromosomes. The twin copies of each duplicated 
chromosome, called sister chromatids, are tightly linked along their 
length. The way these duplicated chromosomes are handled subse- 
quently, however, differs between meiosis and mitosis. In mitosis, as we 
discuss in Chapter 18, the duplicated chromosomes line up, single file, at 
the metaphase plate (Figure 19-7A). They are then segregated into the 
two daughter nuclei. 


In meiosis, however, the need to halve the number of chromosomes intro- 
duces an extra demand on the cell-division machinery. The germ-line 
cell must keep track of the maternal and paternal homologs—to ensure 
that each of the four haploid cells produced by meiosis will receive a 
single sister chromatid from each chromosome set. Meiosis therefore 
begins with a complex and time-consuming process called pairing, in 
which duplicated homologs are brought together during a stage called 
meiotic prophase (or prophase 1). It is these pairs of duplicated homologs 
that line up at the metaphase plate in meiosis I (Figure 19-7B). Each 
pairing forms a structure called a bivalent, in which all four sister chro- 
matids stick together until the cell is ready to divide (Figure 19-8). 
The maternal and paternal homologs will separate during meiosis I, 
and the individual sister chromatids will separate during meiosis II. 


How the homologs (and the two sex chromosomes) recognize each other 
during pairing is still not fully understood. In many organisms, the ini- 
tial association depends on an interaction between matching maternal 
and paternal DNA sequences at numerous sites that are widely dispersed 
along the homologous chromosomes. Once formed, bivalents are very 
stable: they remain associated throughout meiotic prophase, a stage that 
in some organisms can last for years. 


Crossing-Over Occurs Between the Duplicated Maternal 
and Paternal Chromosomes in Each Bivalent 
The picture of meiosis we have just painted is greatly simplified, in that 


it leaves out a crucial feature. In sexually reproducing organisms, the 
pairing of the maternal and paternal chromosomes is accompanied by 
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Figure 19-7 During meiosis I, duplicated 
homologous chromosomes pair before 
lining up on the meiotic spindle. (A) In 
mitosis, the individual duplicated maternal 
(M) and paternal (P) chromosomes line up 
independently at the metaphase plate; each 
consists of a pair of sister chromatids, which 
will separate just before the cell divides. 

(B) By contrast, in meiosis, duplicated 
maternal and paternal homologs pair before 
lining up at the metaphase plate. The 
maternal and paternal homologs separate 
during the first meiotic division, and the 
sister chromatids separate during meiosis Il. 
The mitotic and meiotic spindles are shown 
in green. 
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Figure 19-8 Duplicated maternal and 
paternal chromosomes pair during 
meiosis | to form bivalents. Each bivalent 
contains four sister chromatids. 
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FOUR CHROMATIDS IN A BIVALENT 
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Figure 19-9 During meiosis I, non-sister chromatids in each 
bivalent swap segments of DNA. The process begins when protein 
complexes that carry out homologous recombination (not shown) 
produce a double-strand break in the DNA of one of the chromatids. 
(Here, the maternal chromatid has been broken, but the paternal 
chromatid is equally vulnerable.) These proteins then promote the 
formation of a cross-strand exchange with the undamaged chromatid. 
When this exchange is resolved, each chromatid contains a segment 
of DNA from the other. Many of the steps that produce chromosome 
crossovers during meiosis resemble those that guide the repair of 
DNA double-strand breaks in somatic cells (see Figure 6—31). 


homologous recombination, a process in which two identical or very 
similar nucleotide sequences exchange genetic information. In Chapter 
6, we discussed how homologous recombination is used to mend dam- 
aged chromosomes from which genetic information has been lost. This 
type of repair uses information from an intact DNA double helix to restore 
the correct nucleotide sequence to a damaged, newly duplicated sister 
chromatid (see Figure 6-31). 


A similar process takes place when homologous chromosomes pair dur- 
ing the long prophase of meiosis I. In this case, the recombination occurs 
between the non-sister chromatids in each bivalent, rather than between 
the identical sister chromatids within each duplicated chromosome. In 
the process, the maternal and paternal homologs can physically swap 
homologous chromosomal segments, an event called crossing-over 
(Figure 19-9). 


Crossing-over is a complex, multistep process that is facilitated by the 
formation of a synaptonemal complex. As the duplicated homologs pair, 
this elaborate protein complex helps to hold the bivalent together and 
align the homologs so that strand exchange can readily occur between 
the non-sister chromatids (Figure 19-10). Each of the chromatids in a 
duplicated homolog (that is, each of these very long DNA double helices) 
can form a crossover with either (or both) of the chromatids from the 
other chromosome in the bivalent. 
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Figure 19-10 The synaptonemal complex helps to align the duplicated homolog 
pairs. The sister chromatids in the maternal (red) and paternal (blue) homologs 

are held together by a protein complex called the axial core (gray), which interacts 
with the cohesins (green) that link the sisters together (see Figure 18-18). When the 
duplicated homologs pair, the axial cores associated with each are pulled closely 
together in a zipperlike fashion by a set of rod-shaped transverse filaments (yellow), 
forming the synaptonemal complex. 
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By the time meiotic prophase ends, the synaptonemal complex has disas- 
sembled, allowing the homologs to separate along most of their length. 
But each bivalent remains held together by at least one chiasma (plu- 
ral chiasmata), a structure named after the Greek letter chi, X, which is 
shaped like a cross. Each chiasma corresponds to a crossover between 
two non-sister chromatids (Figure 19—11A). Most bivalents contain more 
than one chiasma, indicating that multiple crossovers occur between 
homologous chromosomes (Figure 19—11B and C). In human oocytes— 
the cells that give rise to the egg—an average of two to three crossover 
events occur within each bivalent (Figure 19-12). 


Crossovers that take place during meiosis are a major source of genetic 
variation in sexually reproducing species. By scrambling the genetic con- 
stitution of each of the chromosomes in the gamete, crossing-over helps 
to produce individuals with novel assortments of alleles. But crossing- 
over also has a second important role in meiosis: it helps ensure that the 
maternal and paternal homologs will segregate from one another cor- 
rectly at the first meiotic division, as we discuss next. 


Chromosome Pairing and Crossing-Over Ensure the 
Proper Segregation of Homologs 


In most organisms, crossing-over during meiosis is required for the cor- 
rect segregation of the two duplicated homologs into separate daughter 
nuclei. The chiasmata created by crossover events keep the maternal 
and paternal homologs bundled together until the spindle separates 
them during meiotic anaphase I. Before anaphase I, the two poles of the 
spindle pull on the duplicated homologs in opposite directions, and the 
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Figure 19—11 Crossover events create 
chiasmata between non-sister chromatids 
in each bivalent. (A) Schematic set of 
paired homologs in which one crossover 
event has occurred, creating a single 
chiasma. (B) Micrograph of a grasshopper 
bivalent with three chiasmata. (C) Schematic 
of the three crossovers between the paired 
homologs in (B). Each sister chromatid is 
numbered. (B, courtesy of Bernard John.) 


Figure 19—12 Multiple crossovers 

can occur between the duplicated 
homologous chromosomes in a bivalent. 
Fluorescence micrograph shows a spread 
of chromosomes from a human oocyte 
(egg-cell precursor) at the stage where 

both maternal and paternal homologs are 
still tightly associated: each single long 
thread (stained red) is a bivalent, containing 
four sister chromatids. Sites of crossing- 
over between the chromatids within each 
bivalent are marked by the presence of 

a protein (stained green) that is a key 
component of the meiotic recombination 
machinery. Blue staining marks the positions 
of centromeres (see Figure 19-8). (From 

C. Tease et al., Am. J. Hum. Genet. 70: 
1469-1479, 2002.) 
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Figure 19-13 Chiasmata help ensure 
proper segregation of duplicated 
homologs during the first meiotic 
division. (A) In metaphase of meiosis I, 
chiasmata created by crossing-over hold the 
maternal and paternal homologs together. 
At this stage, cohesin proteins keep the 
sister chromatids glued together along 
their entire length (see Figure 19-10). The 
kinetochores of sister chromatids function as 
a single unit in meiosis |, and microtubules 
that attach to them point toward the same 
spindle pole. (B) At anaphase of meiosis |, 
the cohesins holding the arms of the 

sister chromatids together are suddenly 
degraded, allowing the homologs to be 
separated. Cohesins at the centromere 
continue to hold the sister chromatids 
together as the homologs are pulled apart. 
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chiasmata resist this pulling (Figure 19-13A). In so doing, the chiasmata 
help to position and stabilize bivalents at the metaphase plate. 


In addition to the chiasmata, which hold the maternal and paternal 
homologs together, cohesin proteins (described in Chapter 18) keep the 
sister chromatids glued together along their entire length at meiosis I 
(see Figure 19-10). At the start of anaphase I, the cohesin proteins that 
hold the chromosome arms together are suddenly degraded. This release 
allows the arms to separate and the recombined homologs to be pulled 
apart (Figure 19—13B). If the arms were not released in this way, the 
duplicated maternal and paternal homologs would remain tethered to 
one another by the homologous DNA segments they had exchanged and 
would not separate during anaphase I. 


The Second Meiotic Division Produces Haploid 
Daughter Nuclei 


To separate the sister chromatids and produce cells with a haploid 
amount of DNA, a second round of division, meiosis II, follows soon after 
the first—without further DNA replication or any significant interphase 
period. A meiotic spindle forms, and the kinetochores on each pair of 
sister chromatids now attach to kinetochore microtubules that point in 
opposite directions, as they would in an ordinary mitotic division. At ana- 
phase of meiosis II, the remaining, meiosis-specific cohesins—located at 
the centromere—are degraded, and the sister chromatids are pulled apart 
(Figure 19—14). The entire process is shown in Movie 19.1. 


Haploid Gametes Contain Reassorted Genetic 
Information 


Even though they share the same parents, no two siblings are geneti- 
cally the same (unless they are identical twins). These genetic differences 
are initiated long before sperm meets egg, when meiosis I produces two 
kinds of randomizing genetic reassortment. 


First, as we have seen, the maternal and paternal chromosomes are shuf- 
fled and dealt out randomly during meiosis I. Although the chromosomes 
are carefully distributed so that each gamete receives one and only one 
copy of each chromosome, the choice between the maternal or paternal 
homolog is made by chance, like the flip of a coin. Thus, each gamete 
contains the maternal versions of some chromosomes and the paternal 
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versions of others (Figure 19-15A). This random assortment depends 
solely on the way each bivalent happens to be positioned when it lines 
up on the spindle during metaphase of meiosis I. Whether the maternal 
or paternal homolog is captured by the microtubules from one pole or the 
other depends on which way the bivalent is facing when the microtubules 
connect to its kinetochore (see Figure 19-13). Because the orientation of 
each bivalent at the moment of capture is completely random, the assort- 
ment of maternal and paternal chromosomes is random as well. 


Thanks to this random reassortment of maternal and paternal homologs, 
an individual could in principle produce 2" genetically different gametes, 
where n is the haploid number of chromosomes. With 23 chromosomes to 
choose from, each human, for example, could in theory produce 27°—or 
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Figure 19—14 In meiosis Il, as in mitosis, 
the kinetochores on each sister chromatid 
function independently, allowing the 
two sister chromatids to be pulled to 
opposite poles. (A) In metaphase of 
meiosis Il, the kinetochores of the sister 
chromatids point in opposite directions. 
(B) At anaphase of meiosis Il, the cohesins 
holding the sister chromatids together at 
the centromere are degraded, allowing 
kinetochore microtubules to pull the sister 
chromatids to opposite poles. 


QUESTION 19-1 


Why do you think that organisms do 
not use the first steps of meiosis (up 
to and including meiotic division l) 
for the ordinary mitotic division of 
somatic cells? 


Figure 19-15 Two kinds of genetic 
reassortment generate new chromosome 
combinations during meiosis. (A) The 
independent assortment of the maternal 
and paternal homologs during meiosis 
produces 2" different haploid gametes 

for an organism with n chromosomes. 

Here n = 3, and there are 23, or 8, 

different possible gametes. For simplicity, 
chromosome crossing-over is not shown 
here. (B) Crossing-over during meiotic 
prophase exchanges segments of DNA 
between homologous chromosomes and 
thereby reassorts genes on each individual 
chromosome. For simplicity, only a single 
pair of homologous chromosomes is shown. 
Both independent chromosome assortment 
and crossing-over occur during every 
meiosis. 
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QUESTION 19-2 


Ignoring the effects of chromosome 
crossovers, an individual human can 
in principle produce 223 = 8.4 x 10° 
genetically different gametes. How 
many of these possibilities can be 
“sampled” in the average life of 

(A) a female and (B) a male, given 
that women produce one egg a 
month during their fertile years, 
whereas men can make hundreds 
of millions of sperm each day? 


Figure 19-16 Errors in chromosome 
segregation during meiosis can result 

in gametes with incorrect numbers 

of chromosomes. In this example, the 
duplicated maternal and paternal copies 
of Chromosome 21 fail to separate 
normally during the first meiotic division. 
As a result, two of the gametes receive 

no copy of the chromosome, while the 
other two gametes receive two copies. For 
simplicity, only Chromosome 21 is shown. 
Gametes that receive an incorrect number 
of chromosomes are called aneuploid 
gametes. If one of them participates in the 
fertilization process, the resulting zygote 
will also have an abnormal number of 
chromosomes. A child that receives three 
copies of Chromosome 21 will have Down 
syndrome. 


8.4 x 10°—genetically distinct gametes. The actual number of different 
gametes each person can produce, however, is much greater than that, 
because the crossing-over that takes place during meiosis provides a sec- 
ond source of randomized genetic reassortment. Between two and three 
crossovers occur on average between each pair of human homologs, 
generating new chromosomes with novel assortments of maternal and 
paternal alleles. Because crossing-over occurs at more or less random 
sites along the length of a chromosome, each meiosis will produce four 
sets of entirely novel chromosomes (Figure 19—15B). 


Taken together, the random reassortment of maternal and paternal chro- 
mosomes, coupled with the genetic mixing of crossing-over, provides a 
nearly limitless source of genetic variation in the gametes produced by a 
single individual. Considering that every person is formed by the fusion 
of such gametes, produced by two completely different individuals, the 
richness of human variation that we see around us, even within a single 
family, should not be very surprising. 


Meiosis Is Not Flawless 


The sorting of chromosomes that takes place during meiosis is a remark- 
able feat of molecular bookkeeping: in humans, each meiosis requires 
that the starting cell keep track of 92 chromosomes (23 pairs, each of 
which has duplicated), handing out one complete set to each gamete. 
Not surprisingly, mistakes can occur in the distribution of chromosomes 
during this elaborate process. 


Occasionally, homologs fail to separate properly—a phenomenon known 
as nondisjunction. As a result, some of the haploid cells that are produced 
lack a particular chromosome, while others have more than one copy. 
Upon fertilization, such gametes form abnormal embryos, most of which 
die. Some, however, survive. Down syndrome, for example—a disorder 
associated with cognitive disability and characteristic physical abnormal- 
ities—is caused by an extra copy of Chromosome 21. This error results 
from nondisjunction of a Chromosome 21 pair during meiosis I, giving 
rise to a gamete that contains two copies of that chromosome instead 
of one (Figure 19-16). When this abnormal gamete fuses with a normal 
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gamete at fertilization, the resulting embryo contains three copies of 
Chromosome 21 instead of two. This chromosome imbalance produces 
an extra dose of the proteins encoded by Chromosome 21 and thereby 
interferes with the proper development of the embryo and normal func- 
tions in the adult. 


The frequency of chromosome mis-segregation during the production 
of human gametes is remarkably high, particularly in females: nondis- 
junction occurs in about 10% of the meioses in human oocytes, giving 
rise to eggs that contain the wrong number of chromosomes (a condi- 
tion called aneuploidy). Aneuploidy occurs less often in human sperm, 
perhaps because sperm development is subjected to more stringent qual- 
ity control than egg development. If meiosis goes wrong in male cells, 
a cell-cycle checkpoint mechanism is activated, arresting meiosis and 
leading to cell death by apoptosis. Regardless of whether the segregation 
error occurs in the sperm or the egg, nondisjunction is thought to be one 
reason for the high rate of miscarriages—spontaneous early pregnancy 
losses—in humans. 


Fertilization Reconstitutes a Complete Diploid Genome 


Having seen how chromosomes are parceled out during meiosis to 
form haploid germ cells, we now briefly consider how they are reunited 
in the process of fertilization to form a new cell with a diploid set of 
chromosomes. 


Of the 300 million human sperm ejaculated during sexual intercourse, 
only about 200 reach the site of fertilization in the oviduct. Sperm are 
attracted to an ovulated egg by chemical signals released by both the 
egg and the supporting cells that surround it. Once a sperm finds the 
egg, it must migrate through a protective layer of cells and then bind to, 
and tunnel through, the egg coat, called the zona pellucida. Finally, the 
sperm must bind to and fuse with the underlying egg plasma membrane 
(Figure 19-17). Although fertilization normally occurs by this process 
of sperm-egg fusion, it can also be achieved artificially by injecting the 
sperm directly into the egg cytoplasm; this is often done in infertility clin- 
ics when there is a problem with natural sperm-egg fusion. 


Although many sperm may bind to an egg (see Figure 19-2), only one 
normally fuses with the egg plasma membrane and introduces its DNA 
into the egg cytoplasm. The control of this step is especially important 
because it ensures that the fertilized egg—also called a zygote—will 
contain two, and only two, sets of chromosomes. Several mechanisms 
prevent multiple sperm from entering an egg. In one mechanism, the first 
successful sperm triggers the release of a wave of Ca?* ions in the egg 
cytoplasm. This flood of Ca?*+ in turn triggers the secretion of enzymes 
that cause a “hardening” of the zona pellucida, which prevents “runner 
up” sperm from penetrating the egg. The Ca** wave also helps trigger the 
development of the egg. To watch a fertilization-induced calcium wave, 
see Movie 19.2. 


The process of fertilization is not complete, however, until the two 
haploid nuclei (called pronuclei) come together and combine their chro- 
mosomes into a single diploid nucleus. Soon after the pronuclei fuse, 
the diploid cell begins to divide, forming a ball of cells that—through 
repeated rounds of cell division and differentiation—will give rise to 
an embryo and, eventually, an adult organism. Fertilization marks the 
beginning of one of the most remarkable phenomena in all of biology— 
the process by which a single-celled zygote initiates the developmental 
program that directs the formation of a new individual. 
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Figure 19-17 A sperm binds to the plasma 
membrane of an egg. Shown here is a 
scanning electron micrograph of a human 
sperm coming in contact with a hamster 
egg. The egg has been stripped of its zona 
pellucida, exposing the plasma membrane, 
which is covered in fingerlike microvilli. Such 
uncoated hamster eggs were sometimes 
used in infertility clinics to assess whether a 
man’s sperm were capable of penetrating 
an egg. The zygotes resulting from this test 
do not develop. (David M. Phillips/ 

The Population Council/Science Source.) 
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Figure 19—18 One disproven theory 

of inheritance suggested that genetic 
traits are passed down solely from the 
father. In support of this particular theory 
of uniparental inheritance, some early 
microscopists fancied they could see a 
small, perfectly formed human crouched 
inside the head of each sperm. 


& 


Figure 19—19 Some people taste it, 
some people don't. The ability to taste 
the chemical phenylthiocarbamide (PTC) 

is governed by a single gene. Although 
geneticists have known since the 1930s that 
the inability to taste PTC is inherited, it was 
not until 2003 that researchers identified 
the responsible gene, which encodes a 
bitter-taste receptor. Nontasters produce 

a PTC receptor protein with amino acid 
substitutions that are thought to reduce the 
receptor's activity. 


MENDEL AND THE LAWS OF INHERITANCE 


In organisms that reproduce asexually, the genetic material of the parent 
is transmitted faithfully to its progeny. The resulting offspring are thus 
genetically identical to a single parent. Before Mendel started working 
with peas, some biologists suspected that inheritance might work that 
way in humans (Figure 19-18). 


Although children resemble their parents, they are not carbon copies of 
either the mother or the father. Thanks to the mechanisms of meiosis 
just described, sex breaks up existing collections of genetic information, 
shuffles alleles into new combinations, and produces offspring that tend 
to exhibit a mixture of traits derived from both parents, as well as novel 
ones. The ability to track characteristics that show some variation from 
one generation to the next enabled geneticists to begin to decipher the 
rules that govern heredity in sexually reproducing organisms. 


The simplest traits to follow are those that are easy to see or to meas- 
ure. In humans, these include the tendency to sneeze when exposed to 
bright sun, whether a person's earlobes are attached or pendulous, or the 
ability to detect certain odors or flavors (Figure 19-19). Of course, the 
laws of inheritance were not worked out by studying people with pendu- 
lous earlobes, but by following traits in organisms that are easy to breed 
and that produce large numbers of offspring. Gregor Mendel, the father 
of genetics, focused on peas. But similar breeding experiments can be 
performed in fruit flies, worms, dogs, cats, or any other plant or animal 
that possesses characteristics of interest, because the same basic laws 
of inheritance apply to all sexually reproducing organisms, from peas to 
people. 


In this section, we describe the logic of genetic inheritance in sexually 
reproducing organisms. We see how the behavior of chromosomes dur- 
ing meiosis—their segregation into gametes that then unite at random to 
form genetically unique offspring—explains the experimentally derived 
laws of inheritance. But first, we discuss how Mendel, breeding peas in 
his monastery garden, discovered these laws more than 150 years ago. 


Mendel Studied Traits That Are Inherited in a Discrete 
Fashion 


Mendel chose to study pea plants because they are easy to cultivate in 
large numbers and could be raised in a small space—such as an abbey 
garden. He controlled which plants mated with which by removing sperm 
(pollen) from one plant and brushing it onto the female structures of 
another. This careful cross-pollination ensured that Mendel could be cer- 
tain of the parentage of every pea plant he examined. 


Perhaps more important for Mendel’s purposes, pea plants were avail- 
able in many varieties. For example, one variety has purple flowers, 
another has white. One variety produces seeds (peas) with smooth skin, 
another produces peas that are wrinkled. Mendel chose to follow seven 
traits—including flower color and pea shape—that were distinct, easily 
observable, and, most importantly, inherited in a discrete fashion: for 
example, the plants have either purple flowers or white flowers—nothing 
in-between (Figure 19-20). 


Mendel Disproved the Alternative Theories of 
Inheritance 
The breeding experiments that Mendel performed were straightforward. 


He started with stocks of genetically pure, “true-breeding” plants—those 
that produce offspring of the same variety when allowed to self-fertilize. 


pea pea flower 
shape color color 


one form 
of trait 
(dominant) © O 
round (R) yellow (Y) purple axial flowers 
a second 
form 
of trait © © 
(recessive) 
wrinkled (r) green (y) white terminal flowers 


If he followed pea color, for example, he used plants with yellow peas 
that always produced offspring with yellow peas, and plants with green 
peas that always produced offspring with green peas. 


Mendel's predecessors had focused on organisms that varied in mul- 
tiple traits. These investigators often wound up trying to characterize 
offspring whose appearance differed in such a complex way that they 
could not easily be compared with their parents. But Mendel took the 
unique approach of studying each trait one at a time. In a typical experi- 
ment, he would cross-pollinate two of his true-breeding varieties. He 
then recorded the inheritance of the chosen trait in the next generation. 
For example, Mendel crossed plants producing yellow peas with plants 
producing green peas and discovered that the resulting hybrid offspring, 
called the first filial, or F;, generation, all had yellow peas (Figure 19—21). 
He obtained a similar result for every trait he followed: the F; hybrids all 
resembled only one of their two parents. 


Had Mendel stopped there—observing only the F; generation—he might 
have developed some mistaken ideas about the nature of heredity: these 
results appear to support the theory of uniparental inheritance, which 
states that the appearance of the offspring will match one parent or the 
other. Fortunately, Mendel took his breeding experiments to the next 
step: he crossed the F; plants with one another (or allowed them to self- 
fertilize) and examined the results. 


Mendel’s Experiments Revealed the Existence of 
Dominant and Recessive Alleles 


One look at the offspring of Mendel’s initial cross-fertilization experi- 
ments, such as those shown in Figure 19-21, raises an obvious question: 
what happened to the trait that disappeared in the F; generation? Did 
the plants bearing green peas, for example, fail to make a genetic con- 
tribution to their offspring? To find out, Mendel allowed the F; plants to 
self-fertilize. If the trait for green peas had been lost, then the F; plants 
would produce only plants with yellow peas in the next, F2, genera- 
tion. Instead, he found that the “disappearing trait” returned: although 


flower pod 
position 


inflated 


pinched 
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yellow short 


Figure 19—20 Mendel studied seven traits 
that are inherited in a discrete fashion. 
For each trait, the plants display either one 
variation or the other, with nothing in- 
between. As we will see shortly, one form of 
each trait is dominant, whereas the other is 
recessive. 


ge 


true-breeding 
green-pea 
plants 


ò 


true-breeding 
yellow-pea 
plants 


CROSS- 
FERTILIZATION 


OFFSPRING (F, GENERATION) 
100% yellow-pea plants 


Figure 19-21 True-breeding varieties, 
when cross-fertilized with each other, 
produce hybrid offspring that resemble 
one parent. In this case, true-breeding 
green-pea plants, crossed with true- 
breeding yellow-pea plants, always produce 
offspring with yellow peas. 
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OFFSPRING (F, GENERATION) 


yellow-pea plants 


SELF- 
FERTILIZATION 


ge 


OFFSPRING (F, GENERATION) 


75% yellow- 25% green-pea 
pea plants plants 


Figure 19—22 The appearance of the 

F2 generation shows that an individual 
carries two alleles of each gene. When 
the F; plants in Figure 19-21 are allowed to 
self-fertilize (or are bred with each other), 
25% of the progeny produce green peas. 


three-quarters of the offspring in the Fz generation had yellow peas, one- 
quarter had green peas (Figure 19-22). Mendel saw the same type of 
behavior for each of the other six traits he examined. 


To account for these observations, Mendel proposed that the inheritance 
of traits is governed by hereditary factors (which we now call genes) and 
that these factors come in alternative versions that account for the vari- 
ations seen in inherited characteristics. The gene dictating pea color, for 
example, exists in two “flavors’”—one that directs the production of yellow 
peas and one that directs production of green peas. Such alternative ver- 
sions of a gene are now called alleles, and the whole collection of alleles 
possessed by an individual—its genetic makeup—is called its genotype. 


Mendel’s major conceptual breakthrough was to propose that for each 
characteristic, an organism must inherit two copies, or alleles, of each 
gene—one from its mother and one from its father. The true-breed- 
ing parental strains, he theorized, each possessed a pair of identical 
alleles—the yellow-pea plants possessed two alleles for yellow peas, the 
green-pea plant two alleles for green peas. An individual that possesses 
two identical alleles is said to be homozygous for that trait. The F; hybrid 
plants, on the other hand, had received two dissimilar alleles—one speci- 
fying yellow peas and the other green. These plants were heterozygous 
for the trait of interest. 


The appearance, or phenotype, of the organism depends on which ver- 
sions of each allele it inherits. To explain the disappearance of a trait in 
the F; generation—and its reappearance in the F2 generation—Mendel 
supposed that for any pair of alleles, one allele is dominant and the other 
is recessive, or hidden. The dominant allele, whenever it is present, would 
dictate the plant’s phenotype. In the case of pea color, the allele that 
specifies yellow peas is dominant; the green-pea allele is recessive. 


One important consequence of heterozygosity, and of dominance and 
recessiveness, is that not all of the alleles that an individual carries can 
be detected in its phenotype. Humans have about 24,000 genes, and each 
of us is heterozygous for a very large number of these. Thus, we all carry 
a great deal of genetic information that remains hidden in our personal 
phenotype but that can turn up in future generations. 


Each Gamete Carries a Single Allele for Each Character 


Mendel’s theory—that for every gene, an individual inherits one copy from 
its mother and one copy from its father—raised some logistical issues. If 
an organism has two copies of every gene, how does it pass only one 
copy of each to its progeny? And how do these gene sets come together 
again in the resulting offspring? 


Mendel postulated that when sperm and eggs are formed, the two copies 
of each gene present in the parent separate so that each gamete receives 
only one allele for each trait. For his pea plants, each egg (ovum) and 
each sperm (pollen) receives only one allele for pea color (either yel- 
low or green), one allele for pea shape (round or wrinkled), one allele 
for flower color (purple or white), and so on. During fertilization, sperm 
carrying one or other allele unites with an egg carrying one or other 
allele to produce a fertilized egg or zygote with two alleles. Which sperm 
unites with which egg at fertilization—thus, which alleles the zygote will 
receive—is entirely a matter of chance. 


This principle of heredity is laid out in Mendel’s first law, the law of 
segregation. It states that the two alleles for each trait separate (or seg- 
regate) during gamete formation and then unite at random—one from 
each parent—at fertilization. According to this law, the F; hybrid plants 


Figure 19-23 Parent plants produce gametes that each contain 
one allele for each trait; the phenotype of the offspring depends 
on which combination of alleles it receives. Here we see both the 
genotype and phenotype of the pea plants that were bred in the 
experiments illustrated in Figures 19-21 and 19-22. The true-breeding 
yellow-pea plants produce only Y-bearing gametes; the true-breeding 
green plants produce only y gametes. The F offspring of a cross 
between these parents all produce yellow peas, and they have the 
genotype Yy. When these hybrid plants are bred with each other, 75% 
of the offspring have yellow peas, 25% have green. The gray box at 
the bottom, called a Punnett square after a British mathematician who 
was a follower of Mendel, allows one to track the segregation of alleles 
during gamete formation and to predict the outcomes of breeding 
experiments like the one outlined in Figure 19-22. According to the 
system established by Mendel, capital letters indicate a dominant 
allele and lowercase letters a recessive allele. 


with yellow peas will produce two classes of gametes: half the gametes 
will get a yellow-pea allele and half will get a green-pea allele. When the 
hybrid plants self-pollinate, these two classes of gametes will unite at 
random. Thus, four different combinations of alleles can come together in 
the Fz offspring (Figure 19-23). One-quarter of the Fz plants will receive 
two alleles specifying green peas; these plants will produce green peas. 
One-quarter of the plants will receive two yellow-pea alleles and will 
produce yellow peas. But one-half of the plants will inherit one allele for 
yellow peas and one allele for green. Because the yellow allele is domi- 
nant, these plants—like their heterozygous F; parents—will all produce 
yellow peas. All told, three-quarters of the offspring will have yellow peas 
and one-quarter will have green peas. Thus Mendel’s law of segregation 
explains the 3:1 ratio that he observed in the Fz generation. 


Mendel’s Law of Segregation Applies to All Sexually 
Reproducing Organisms 


Mendel's law of segregation explained the data for every trait he exam- 
ined in pea plants, and he replicated his basic findings with corn and 
beans. But his rules governing inheritance are not limited to plants: they 
apply to all sexually reproducing organisms (Figure 19-24). 


Consider a phenotype in humans that reflects the action of a single gene. 
The major form of albinism—Type II albinism—is a rare condition that 
is inherited in a recessive manner in many animals, including humans. 
Like the pea plants that produce green seeds, albinos are homozygous 
recessive: their genotype is aa. The dominant allele of the gene (denoted 
A) encodes an enzyme involved in making melanin, the pigment respon- 
sible for most of the brown and black color present in hair, skin, and the 
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Figure 19-24 Mendel'’s law of 
segregation applies to all sexually 
reproducing organisms. Dogs are bred 
specifically to enhance certain phenotypic 
traits, including a diverse range of body 
size, coat color, head shape, snout length, 
ear position, and fur patterns. Scientists 
have been conducting genetic analyses 
on scores of dog breeds to search for the 
alleles that underlie these common canine 
characteristics. A single growth factor 
gene has been linked to body size, and 
three additional genes have been shown 
to account for coat length, curliness, and 
the presence or absence of “furnishings“— 
bushy eyebrows and beards—in almost 

all dog breeds. (By Ester Inbar, available 
from http://commons.wikimedia.org/wiki/ 
User:ST.) 
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Figure 19-25 Recessive alleles all follow 
the same Mendelian laws of inheritance. 
Here we trace the inheritance of Type I 
albinism, a recessive trait that is associated 
with a single gene in humans. Note that 
normally pigmented individuals can be 


either homozygous (AA) or heterozygous albino man normally pigmented normally pigmented albino woman 
(Aa) for the dominant allele A. (aa) woman (AA) man (AA) (aa) 


= — 


all normally pigmented all normally pigmented 
offspring (Aa) offspring (Aa) 


— _—___— 


75% normally pigmented 25% albino 
(AA or Aa) (aa) 


retina of the eye. Because the recessive allele codes for a version of this 
enzyme that is only weakly active or completely inactive, albinos have 
white hair, white skin, and pupils that look pink because a lack of mela- 
nin in the eye allows the red color of the hemoglobin in blood vessels in 
the retina to be visible. 


The trait for albinism is inherited in the same manner as any other reces- 
sive trait, including Mendel’s green peas. If a man who is homozygous 
for the recessive albinism allele (genotype aa) has children with a woman 
who has the same genotype, all of their children will be albino (aa). 
However, if a homozygous nonalbino man (AA) marries and has children 
with an albino woman (aa), their children will all be heterozygous (Aa) 
and normally pigmented (Figure 19-25). If two nonalbino individuals 
with an Aa genotype start a family, each of their children would have a 
25% chance of being an albino (aa). 


Of course, humans generally don’t have families large enough to guaran- 
tee perfect Mendelian ratios. (Mendel arrived at his ratios by breeding and 
counting thousands of pea plants for most of his crosses.) Geneticists that 
follow the inheritance of specific traits in humans get around this problem 
by working with large numbers of families—or with several generations 
of a few large families—and preparing pedigrees that show the pheno- 
type of each family member for the relevant trait. Figure 19-26 shows 
the pedigree for a family that harbors a recessive allele for deafness. It 
also illustrates an important practical consequence of Mendel’s laws: 
marriages between related individuals—called consanguineous (from the 
Latin sanguis, “blood”)—create a greatly increased risk of producing chil- 
dren that are homozygous for a deleterious recessive mutation. 


Alleles for Different Traits Segregate Independently 


Mendel deliberately simplified the problem of heredity by starting with 
breeding experiments that focused on the inheritance of one trait at a time, 
called monohybrid crosses. He then turned his attention to multihybrid 


generation | 


generation II 


generation Ill 


generation IV 


crosses, examining the simultaneous inheritance of two or more appar- 
ently unrelated traits. 


In the simplest situation, a dihybrid cross, Mendel followed the inheritance 
of two traits at once: for example, pea color and pea shape. In the case 
of pea color, we have already seen that yellow is dominant over green; 
for pea shape, round is dominant over wrinkled (see Figure 19-20). What 
would happen when plants that differ in both of these characters are 
crossed? Again, Mendel started with true-breeding parental strains: the 
dominant strain produced yellow round peas (its genotype is YYRR), the 
recessive strain produced green wrinkled peas (yyrr). One possibility is 
that the two characters, pea color and shape, would be transmitted from 
parents to offspring as a linked package. In other words, plants would 
always produce either yellow round peas or green wrinkled ones. The 
other possibility is that pea color and shape would be inherited indepen- 
dently, which means that at some point plants bearing a novel mix of 
traits—yellow wrinkled peas or green round peas—would arise. 


The F; generation of plants all showed the expected phenotype: each 
produced peas that were yellow and round. But this result would occur 
whether or not the parental alleles were linked. When the F, plants were 
then allowed to self-fertilize, the results were clear: the two alleles for 
seed color segregated independently from the two alleles for seed shape, 
producing four different pea phenotypes: yellow-round, yellow-wrinkled, 
green-round, and green-wrinkled (Figure 19-27). Mendel tried his seven 
pea characters in various pairwise combinations and always observed 
a characteristic 9:3:3:1 phenotypic ratio in the F2 generation. The inde- 
pendent segregation of each pair of alleles during gamete formation is 
Mendel’s second law—the law of independent assortment. 


The Behavior of Chromosomes During Meiosis 
Underlies Mendel’s Laws of Inheritance 


So far we have discussed alleles and genes as if they are disembod- 
ied entities. We now know that Mendel’s “factors’—the things we call 
genes—are carried on chromosomes that are parceled out during the for- 
mation of gametes and then brought together in novel combinations in 
the zygote at fertilization. Chromosomes therefore provide the physical 
basis for Mendel’s laws, and their behavior during meiosis and fertiliza- 
tion—which we discussed earlier—explains these laws perfectly. 


During meiosis, the maternal and paternal homologs—and the genes that 
they contain—pair and then separate from each other as they are parceled 
out into gametes. These maternal and paternal chromosome copies will 
possess different variants—or alleles—of many of the genes they carry. 
Take, for example, a pea plant that is heterozygous for the yellow-pea 
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Figure 19-26 A pedigree shows the risks 
of first-cousin marriages. Shown here is an 
actual pedigree for a family that harbors a 
rare recessive mutation causing deafness. 
According to convention, squares represent 
males, circles are females. Here, family 
members that show the deaf phenotype are 
indicated by a blue symbol, those that do 
not by a gray symbol. A black horizontal line 
connecting a male and female represents a 
mating between unrelated individuals, and 
a red horizontal line represents a mating 
between blood relatives. The offspring of 
each mating are shown underneath, in order 
of their birth from left to right. 

Individuals within each generation are 
labeled sequentially from left to right for 
purposes of identification. In the third 
generation in this pedigree, for example, 
individual 2, a man who is not deaf, marries 
his first cousin, individual 3, who is also 
not deaf. Three out of their five children 
(individuals 7, 8, and 9 in the fourth 
generation) are deaf. Meanwhile, individual 
1, the brother of 2, also marries a first 
cousin, individual 4, the sister of 3. Two out 
of their five children are deaf. (Adapted from 
Z.M. Ahmed et al., BMC Med. Genet. 5:24, 
2004. With permission from BMC Medical 
Genetics.) 
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Figure 19—27 A dihybrid (two traits) cross © D 
demonstrates that alleles can segregate 
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independently = packaged injo gametes phenotype: yellow-round phenotype: green-wrinkled 
in all possible combinations. So the Y allele genotype: YYRR genotype:yyrr 

is equally likely to be packaged with the R 

or r allele during gamete formation; and the | | 

same holds true for the y allele. Thus four 
classes of gametes are produced in roughly 


equal numbers: YR, Yr, yR, and yr. When 
these gametes are allowed to combine at CROSS-FERTILIZATION 
random to produce the F2 generation, the 


resulting pea phenotypes are yellow-round, 
yellow-wrinkled, green-round, and green- 
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gene (Yy). During meiosis, the chromosomes bearing the Y and y alleles 
will be separated, producing two types of haploid gametes—ones that 
contain a Y allele and others that contain a y. In a plant that self-fertilizes, 
these haploid gametes come together to produce the diploid individuals 
of the next generation—which may be YY, Yy, or yy. Together, the meiotic 
mechanisms that distribute the alleles into gametes and the combining 
of gametes at fertilization provide the physical foundation for Mendel’s 
law of segregation. 


But what about independent assortment of multiple traits? Because each 
pair of duplicated homologs attaches to the spindle and lines up at the 
metaphase plate independently during meiosis, each gamete will inherit 
a random mixture of paternal and maternal chromosomes (see Figure 
19-15A). Thus the alleles of genes on different chromosomes will segre- 
gate independently. 


Consider a pea plant that is heterozygous for both seed color (Yy) and 
seed shape (Rr). The homolog pair carrying the color alleles will attach 
to the meiotic spindle with a certain orientation: whether the Y-bearing 
homolog or its y-bearing counterpart is captured by the microtubules 
from one pole or the other depends on which way the bivalent happens 
to be facing at the moment of attachment (Figure 19-28). The same 
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is true for the homolog pair carrying the alleles for seed shape. Thus, 
whether the final gamete receives the YR, Yr, yR, or yr allele combination 
depends entirely on which way the two homolog pairs were facing when 
they were captured by the meiotic spindle; each outcome has the same 
degree of randomness as the tossing of a coin. 


Genes That Lie on the Same Chromosome Can 
Segregate Independently by Crossing-Over 


Mendel studied seven traits, each of which is controlled by a separate 
gene. It turns out that most of these genes are located on different chromo- 
somes, which readily explains the independent segregation he observed. 
But the independent segregation of different traits does not necessarily 
require that the responsible genes lie on different chromosomes. If two 
genes are far enough away from each other on the same chromosome, 
they will also sort independently, because of the crossing-over that occurs 
during meiosis. As we discussed earlier, when duplicated homologs pair 
to form bivalents, the maternal and paternal homologs always undergo 
crossing-over. This genetic exchange can separate alleles that were for- 
merly together on the same chromosome, causing them to segregate into 
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Figure 19-28 The separation of 
duplicated homologous chromosomes 
during meiosis explains Mendel’s 

laws of segregation and independent 
assortment. Here we show independent 
assortment of the alleles for seed color, 
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yellow (Y) and green (y), and for seed shape, 


round (R) and wrinkled (r), as an example of 
how two genes on different chromosomes 
segregate independently. Although 
crossovers are not shown, they would not 
affect the independent assortment of these 
traits, as the two genes lie on different 
chromosomes. 
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(A) (B) Figure 19-29 Genes that lie far enough apart on the same 
F H) chromosome will segregate independently. (A) Because several 
crossover events occur randomly along each chromosome during 
i prophase of meiosis |, two genes on the same chromosome will obey 
A Mendel’s law of independent assortment if they are far enough apart. 
Thus, for example, there is a high probability of crossovers occurring in 
i iaci the long region between C/c and F/f, meaning that a gamete carrying 
p r between these p the F allele will wind up with the c allele as often as it will the C allele. 
C i 
B b o] 
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l I 
two genes i 3 In contrast, the A/a and B/b genes are close together, so there is only 


on average, 
several 
| crossover events 
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likely to be co-inherited with the B allele, and the a allele with the b 
allele. From the frequency of recombination, one can estimate the 
distances between the genes. (B) An example of a crossover that has 
separated the C/c and F/f alleles, but not the A/a and B/b alleles. 
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different gametes (Figure 19-29). We now know, for example, that the 
genes for pea shape and pod color that Mendel studied are located on 
the same chromosome, but because they are far apart they segregate 
independently. 


Not all genes segregate independently as per Mendel’s second law. If 
genes lie close together on a chromosome, they are likely to be inherited 
as a unit. For example, human genes associated with red-green color- 
blindness and hemophilia are typically inherited together for this reason. 
By measuring how frequently genes are co-inherited, geneticists can 
determine whether they reside on the same chromosome and, if so, how 
far apart they are. These measurements of genetic linkage have been used 
to map the relative positions of the genes on each chromosome of many 
organisms. Such genetic maps have been crucial for isolating and char- 
acterizing mutant genes responsible for human genetic diseases such as 
cystic fibrosis. 


Mutations in Genes Can Cause a Loss of Function or a 
Gain of Function 


Mutations produce heritable changes in DNA sequence. They can arise in 
various ways (discussed in Chapter 6) and can be classified by the effect 
they have on gene function. Mutations that reduce or eliminate the activ- 
ity of a gene are called loss-of-function mutations (Figure 19-30). An 
organism in which both alleles of a gene bear loss-of-function mutations 
will generally display an abnormal phenotype—one that differs from 
the most commonly occurring phenotype (although the difference may 
sometimes be subtle and hard to detect). By contrast, the heterozygote, 
which possesses one mutant allele and one normal, “wild-type” allele, 
generally makes enough active gene product to function normally and 
retain a normal phenotype. Thus loss-of-function mutations are usually 
recessive, because—for most genes—decreasing the normal amount of 
gene product by 50% has little impact. 


In the case of Mendel’s peas, the gene that dictates seed shape codes for 
an enzyme that helps convert sugars into branched starch molecules. The 
dominant, wild-type allele, R, produces an active enzyme; the recessive, 


Figure 19-30 Mutations in protein-coding 
genes can affect the protein product in 

a variety of ways. (A) In this example, the 
normal or “wild-type” protein has a specific 
function, denoted by the red rays. (B) 
Various loss-of-function mutations decrease 
or eliminate this activity. (C) Gain-of-function 
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mutant allele, r, does not. Because they lack this enzyme, plants that 
are homozygous for the r allele contain more sugar and less starch than 
plants that possess the dominant R allele, which gives their peas a wrin- 
kled appearance. The sweet peas available in the supermarket are often 
wrinkled mutants of the same type that Mendel studied. 


Although most loss-of-function mutations are recessive, some can be 
dominant. Take, for example, a mutation that causes a protein to mis- 
fold. In a heterozygote, 50% of the proteins produced would be misfolded 
and inactive, while the other 50% would function normally. However, the 
misfolded form of the protein could go on to form aggregates that cause 
severe problems for the cell (see Figure 4-19). Because of its widespread 
impact, this particular loss-of-function mutation would be dominant. 


Mutations that increase the activity of a gene or its product, or result 
in the gene being expressed in inappropriate circumstances, are called 
gain-of-function mutations (see Figure 19-30). Such mutations are 
usually dominant. For example, as we saw in Chapter 16, certain muta- 
tions in the Ras gene generate a form of the protein that is always active. 
Because the normal Ras protein is involved in controlling cell prolifera- 
tion, the mutant protein drives cells to multiply inappropriately, even in 
the absence of signals that are normally required to stimulate cell divi- 
sion—thereby promoting the development of cancer. About 30% of all 
human cancers contain such dominant, gain-of-function mutations in 
the Ras gene. 


Each of Us Carries Many Potentially Harmful Recessive 
Mutations 


AS we saw in Chapter 9, mutations that occur in the germ line provide 
the fodder for evolution. They can alter the fitness of an organism, mak- 
ing it either less or more likely for the individual to survive and leave 
progeny. Natural selection determines whether these mutations are pre- 
served: those that confer a selective advantage on an organism tend to 
be perpetuated, whereas those that compromise an organism’s fitness or 
ability to procreate tend to be lost. 


The great majority of chance mutations are either neutral, with no 
effect on phenotype, or deleterious. A deleterious mutation that is domi- 
nant—one that exerts its negative effects when present even in a single 
copy—will be eliminated almost as soon as it arises. In extreme cases, 
if a mutant organism is unable to reproduce, the mutation that causes 
that failure will be lost from the population when the mutant individual 
dies. For deleterious mutations that are recessive, things are a little more 
complicated. When such a mutation first arises, it will generally be pre- 
sent in only a single copy. The organism that carries the mutation can 
produce just as many progeny as other individuals; some of these prog- 
eny will inherit a single copy of the mutation, and they too will appear fit 
and healthy. But as they and their descendants begin to mate with one 
another, some individuals will inherit two copies of the mutant allele and 
display an abnormal phenotype. 


If such a homozygous individual fails to reproduce, two copies of the 
mutant allele will be lost from the population. Eventually, an equilibrium 
is reached, where the rate at which new mutations occur in the gene 
balances the rate at which these mutant alleles are lost through matings 
that yield abnormal, homozygous mutant individuals. As a consequence, 
many deleterious recessive mutations are present in heterozygous 
individuals at a surprisingly high frequency, even though homozygous 
individuals showing the deleterious phenotype are rare. For example, the 
most common form of hereditary deafness (due to mutations in a gene 
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QUESTION 19-3 


Imagine that each chromosome 
undergoes one and only one 


crossover event on each chromatid 


during each meiosis. How would 


the co-inheritance of traits that are 


determined by genes at opposite 
ends of the same chromosome 
compare with the co-inheritance 


observed for genes on two different 


chromosomes? How does this 
compare with the actual situation? 
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Figure 19-31 DNA-damaging agents 
produce various types of mutations. 
Some common types of mutation are shown 
here. Different mutagens each produce 

a characteristic spectrum of mutations. 
Other types of mutation involve changes 

in larger segments of DNA, including 
deletions, duplications, and chromosomal 
rearrangements (not shown). 


that encodes a gap-junction protein; see Figure 20-28) occurs in about 
one in 4000 births, but about one in 30 of us are carriers of a loss-of- 
function mutant allele of the gene. 


GENETICS AS AN EXPERIMENTAL TOOL 


Unraveling how chromosomes shuttle genetic information from one gen- 
eration to the next did more than demystify the basis of inheritance: it 
united the science of genetics with other life sciences, from cell biol- 
ogy and biochemistry to physiology and medicine. Genetics provides a 
powerful way to discover what specific genes do and how variations in 
those genes underlie the differences between one species and another or 
between individuals within a species. Such knowledge also has practical 
benefits, as understanding the genetic and biological basis of diseases 
can help us to better diagnose, treat, and prevent them. 


In this section, we outline the classical genetic approach to identifying 
genes and determining how they influence the phenotype of experimental 
organisms such as yeast or flies. The process begins with the genera- 
tion of a very large number of mutants and the identification of those 
rare individuals that show a phenotype of interest. By analyzing these 
rare mutant individuals and their progeny, we can track down the genes 
responsible and work out what these genes normally do—and how muta- 
tions that alter their activity affect how an organism looks and behaves. 


The Classical Genetic Approach Begins with Random 
Mutagenesis 


Before the advent of DNA technology (discussed in Chapter 10), most 
genes were identified and characterized by observing the processes dis- 
rupted when the gene was mutated. This type of analysis begins with the 
isolation of mutants that have an interesting or unusual phenotype: fruit 
flies that have white eyes or curly wings or that become paralyzed when 
exposed to high temperatures, for example. Working backward from 
the abnormal phenotype, one then determines the change in DNA that 
is responsible. This classical genetic approach—searching for mutant 
phenotypes and then isolating the responsible genes—is most easily 
performed in model organisms that reproduce rapidly and are amena- 
ble to genetic manipulation, such as bacteria, yeasts, nematode worms, 
zebrafish, and fruit flies. A few of the principles behind this classical 
approach are outlined in Panel 19-1, (p. 675). 


Although spontaneous mutants with interesting phenotypes can be found 
by combing through a collection of thousands or millions of organisms, 
the process can be made much more efficient by generating mutations 
artificially with agents that damage DNA, called mutagens. Different muta- 
gens generate different types of DNA mutations (Figure 19-31). Not all 
mutations will lead to a noticeable change in phenotype. But by treating 
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GENES AND PHENOTYPES 
Gene: a functional unit of inheritance, corresponding to the segment 

of DNA coding for a protein or noncoding RNA molecule. 
Genome: all of an organism's DNA sequences. 
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the individual 
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In the example above, the phenotype of the heterozygote is the same as that of one of the homozygotes; 
in cases where it is different from both homozygotes, the two alleles are said to be co-dominant. 
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TWO GENES OR ONE? Given two mutations that produce the same phenotype, how can we tell whether they are mutations 
in the same gene? If the mutations are recessive (as they most often are), the answer can be found by 
a complementation test. In the simplest type of complementation test, an individual who is 
homozygous for one mutation is mated with an individual who is homozygous for the other. The 
phenotype of the offspring gives the answer to the question. 
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Figure 19-32 Genetic screens can be 
used to identify mutations that affect 
an animal's behavior. (A) Wild-type 

C. elegans dine alone. (B) Mutant worms 
engage in social feeding. (Courtesy of 
Cornelia Bargmann.) 


large numbers of organisms with mutagens, collections of mutants can 
be generated quickly, increasing the odds of finding an interesting phe- 
notype, as we discuss next. 


Genetic Screens Identify Mutants Deficient in Specific 
Cell Processes 


A genetic screen typically involves examining many thousands of muta- 
genized individuals to find the few that show a specific altered phenotype 
of interest. To search for genes involved in cell metabolism, for example, 
one might screen mutagenized bacterial or yeast cells to pick out those 
that have lost the ability to grow in the absence of a particular amino acid 
or other nutrient (see Figure 9-5). 


Even genes involved in complex phenotypes, such as social behavior, 
can be identified by genetic screens in multicellular organisms. For exam- 
ple, by screening for worms that feed alone rather than in clusters as do 
wild-type individuals, scientists identified and isolated a gene that affects 
this “social behavior” (Figure 19-32). 


Advances in modern technologies have made it possible to carry out 
high-throughput genetic screens using large collections of individuals, 
each of which has a different gene inactivated. Such mutant collections 
can often be screened using automated robots. For example, investiga- 
tors have made use of RNA interference (explained in Figure 8-28) to 
generate a collection of nematode worms in which the activity of every 
protein-coding gene has been disrupted, with each worm being deficient 
in just one gene. These collections can be screened rapidly for dramatic 
changes in phenotype, such as stunted growth, uncoordinated move- 
ment, decreased fertility, or impaired embryonic development (Figure 
19-33). Using this strategy, the genes needed to produce a particular 
characteristic can be identified. 


Conditional Mutants Permit the Study of Lethal 
Mutations 


Genetic screens are a powerful approach for isolating and characterizing 
mutations that are compatible with life—those that change the appear- 
ance or behavior of an organism without killing it. A problem arises, 
however, if we wish to study essential genes—those that are absolutely 
required for fundamental cell processes, such as RNA synthesis or cell 
division. Defects in these genes are usually lethal, which means that spe- 
cial strategies are needed to isolate and propagate such mutants: if the 
mutants cannot be bred, their genes cannot be studied. 
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If the organism is diploid—a mouse or a pea plant, say—and the mutant 
phenotype is recessive, there is a simple solution. Individuals that are 
heterozygous for the mutation will have a normal phenotype and can be 
propagated. When they are mated with one another, 25% of the progeny 
will be homozygous mutants and will show the lethal mutant phenotype; 
50% will be heterozygous carriers of the mutation like their parents and 
can be used to maintain the breeding stock. 


But what if the organism is haploid, as is the case for many yeast and 
bacteria? One way to study lethal mutations in such organisms makes 
use of conditional mutants, in which the protein product of the mutant 
gene is only defective under certain conditions. For example, in mutants 
that are temperature-sensitive, the protein functions normally within a 
certain range of temperatures (called the permissive temperature) but 
can be inactivated by a shift to a nonpermissive temperature outside this 
range. Thus the abnormal phenotype can be switched on and off simply 
by changing the temperature. A cell containing a temperature-sensitive 
mutation in an essential gene can be propagated at the permissive tem- 
perature and then be driven to display its mutant phenotype by a shift to 
a nonpermissive temperature (Figure 19-34). 


Many temperature-sensitive bacterial mutants were isolated to identify 
the genes that encode the bacterial proteins required for DNA replication; 
investigators treated large populations of bacteria with mutagens and 
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out in Petri dish grow into 
colonies at 23°C 


cells proliferate at the cooler, 
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fail to proliferate at the warmer, 
nonpermissive temperature 
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Figure 19-33 RNA interference provides 
a convenient method for conducting 
genome-wide genetic screens. In this 
experiment, each well in this 96-well plate 
is filled with E. coli that produce a different 
double-stranded (ds), interfering RNA. 

E. coli are a standard diet for C. elegans 
raised in the laboratory. Each interfering 
RNA matches the nucleotide sequence of 

a single C. elegans gene. About 10 worms 
are added to each well, where they ingest 
the genetically modified bacteria. The plate 
is incubated for several days, which gives 
the RNAs time to bind to and inactivate 
their target genes—and the worms time 

to grow, mate, and produce offspring. The 
plate is then examined in a microscope, 
which can be controlled robotically, to 
screen for genes that affect the worms’ 
ability to survive, reproduce, develop, and 
behave. Because the investigator knows 
which interfering RNA was added to each 
well, the gene responsible for any resulting 
defect can be readily identified. Shown here 
are wild-type worms alongside a mutant 
that shows an impaired ability to reproduce. 
(Adapted from Lehner et al., Nat. Genet. 
38:896-903, 2006.) 


Figure 19-34 Temperature-sensitive 
mutants are valuable for identifying 
the genes and proteins involved 

in essential cell processes. In this 
example, yeast cells are treated with 

a mutagen, spread on a culture plate 
at a relatively cool temperature, 

and allowed to proliferate to form 
colonies. The colonies are then 
transferred to two identical Petri 
plates using a technique called 
replica plating. One of these plates is 
incubated at a cool temperature, the 
other at a warmer temperature. Those 
cells that contain a temperature- 
sensitive mutation in a gene essential 
for proliferation can be readily 
identified, because they form a 
colony only at the cooler, permissive 
temperature. 
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Figure 19-35 A complementation test 
can reveal that mutations in two different 
genes are responsible for the same 
abnormal phenotype. When an albino 
(white) bird from one strain is bred with an 
albino from a different strain, the resulting 
offspring have normal coloration. This 
restoration of the wild-type plumage 
implies that the two white breeds lack 
color because of recessive mutations in 
different genes. (From W. Bateson, 
Mendel’s Principles of Heredity, 1st ed. 
Cambridge, UK: Cambridge University 
Press, 1913.) 


then screened for cells that stopped making DNA when they were warmed 
from 30°C to 42°C. Similarly, temperature-sensitive yeast mutants were 
used to identify many of the proteins involved in regulating the cell cycle 
(see How We Know, pp. 30-31) and in transporting proteins through the 
secretory pathway (see Figure 15-28). 


The DNA technologies discussed in Chapter 10 have made it possible to 
construct an entirely different type of conditional mutant—one in which 
the gene of interest is engineered such that it can be deleted at a par- 
ticular time in development or in a particular tissue (see Figure 10-30). 
Essential genes—for example, those whose complete inactivation would 
cause an organism to die early in development—can be studied in this 
way because the organism can be allowed to develop past the critical 
period before the gene is deleted. This strategy is particularly useful for 
studying genes that are active in many different tissues, as the role they 
play in each one can be independently assessed. 


A Complementation Test Reveals Whether Two 
Mutations Are in the Same Gene 


A large-scale genetic screen can turn up many mutant organisms with 
the same phenotype. These mutations might affect the same gene or 
they might affect different genes that function in the same process. How 
can we distinguish between the two? If the mutations are recessive, a 
complementation test can reveal whether they affect the same or dif- 
ferent genes. 


In the simplest type of complementation test, an individual that is 
homozygous for one recessive mutation is mated with an individual that 
is homozygous for the other mutation. If the two mutations affect the 
same gene, the offspring will show the mutant phenotype, because they 
carry only defective copies of the gene in question. If, in contrast, the 
mutations affect different genes, the resulting offspring will show the 
normal, wild-type phenotype, because they will have one normal copy 
(and one mutant copy) of each gene (see Panel 19-1, p. 675). 


Whenever the normal phenotype is restored in such a test, the alleles 
inherited from the two parents are said to complement each other (Figure 
19-35). For example, complementation tests on mutants identified dur- 
ing genetic screens have revealed that five genes are required for yeast 
cells to digest the sugar galactose, that 20 genes are needed for E. coli 
to build a functional flagellum, and many hundreds are essential for the 
normal development of an adult nematode worm from a fertilized egg. 


EXPLORING HUMAN GENETICS 


Genetic screens in model experimental organisms have been spec- 
tacularly successful in identifying genes and relating them to various 
phenotypes, including many that are conserved between these organ- 
isms and humans. But the same approach cannot be used in humans. 
Unlike flies, worms, yeast, and bacteria, humans do not reproduce rap- 
idly. More importantly, intentional mutagenesis in humans is out of the 
question for ethical reasons. 


Nonetheless, humans are becoming increasingly attractive subjects for 
genetic studies. Because the human population is so large, spontane- 
ous nonlethal mutations have arisen in all human genes—many times 
over. A substantial proportion of these nonlethal mutations remain in 
the genomes of present-day humans, and the most harmful of these 
mutations are discovered when the affected individuals call attention to 
themselves by seeking medical help—a uniquely human behavior. 


With the recent advances that have enabled the sequencing of entire 
human genomes rapidly and inexpensively (discussed in Chapter 10), we 
can now identify such mutations and study their evolution and inherit- 
ance in ways that were impossible even a few years ago. By comparing 
the sequences of tens of thousands of human genomes, we can now 
identify directly the DNA differences that distinguish one individual from 
another. In this section, we discuss how analyses of DNA collected from 
human families and populations all over the world are providing clues 
about our evolutionary history and about the genes that influence our 
susceptibility to disease. 


Linked Blocks of Polymorphisms Have Been Passed 
Down from Our Ancestors 


As discussed in Chapter 9, when we compare the sequences of multiple 
human genomes, we find that any two individuals will differ in about 1 
nucleotide pair in 1000. Most of these variations are common and rela- 
tively harmless. When two sequence variants coexist at the same site and 
are common in the population, the variants are called polymorphisms. 
The majority of polymorphisms are due to the substitution of a single 
nucleotide, called single-nucleotide polymorphisms or SNPs (see 
Figure 9-38). The rest are due largely to insertions or deletions—called 
indels when the change is small, or copy number variants (CNVs) when it 
is large. 


Although these common variants can be found throughout the genome, 
they are not scattered randomly—or even independently. Instead, they 
tend to travel in groups called haplotype blocks—combinations of poly- 
morphisms or other DNA markers that are inherited as a unit. 


To understand why such haplotype blocks exist, we need to consider our 
evolutionary history. It is thought that modern humans expanded from 
a relatively small population—perhaps around 10,000 individuals—that 
existed in Africa about 200,000 years ago. Among that small group of our 
ancestors, some individuals might have carried one set of genetic vari- 
ants, others a different set. The chromosomes of a present-day human 
represent a shuffled combination of chromosome segments from differ- 
ent members of this small ancestral group of people. Because only about 
two thousand generations separate us from them, large segments of 
these ancestral chromosomes have passed from parent to child, unbro- 
ken by the crossover events that occur during meiosis. (Remember, only 
a few crossovers occur between each set of homologous chromosomes, 
as shown in Figure 19-12.) 


As a result, certain sets of DNA sequences—and their associated poly- 
morphisms—have been inherited in linked groups, with little genetic 
rearrangement across the generations. These are the haplotype blocks. 
Like genes that exist in different allelic forms, haplotype blocks also come 
in a limited number of variants that are common in the human popu- 
lation, each representing a combination of DNA polymorphisms passed 
down from a particular ancestor long ago. 


Polymorphisms Provide Clues to Our Evolutionary 
History 


A detailed examination of haplotype blocks has provided intriguing 
insights into the history of human populations. Our DNA sequences are 
constantly being altered by mutation; many of these changes will be 
neutral, in that they will not affect the reproductive success of the indi- 
vidual. Each of these variants has a chance of becoming common in the 
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QUESTION 19-4 


When two individuals from different 
isolated, inbred subpopulations of 
a species come together and mate, 
their offspring often show “hybrid 
vigor”: that is, they appear more 
robust, healthy, and fertile than 
either parent. Can you suggest 

a possible explanation for this 
phenomenon? 
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population. The more time that has elapsed since the origin of a relatively 
common polymorphism like a SNP, the smaller should be the haplotype 
block that surrounds it: that’s because, over the course of many gen- 
erations, crossover events will have had many chances to separate an 
ancient allele from other variants nearby. Thus by comparing the sizes of 
haplotype blocks from different human populations, it is possible to esti- 
mate how many generations have elapsed since the origin of a specific 
neutral mutation. By combining such genetic comparisons with archaeo- 
logical findings, scientists have been able to deduce the most probable 
routes our ancestors took when they left Africa (see Figure 9-37). 


Genome analyses can also be used to estimate when and where humans 
acquired mutations that have conferred an evolutionary benefit, such as 
resistance to infection. Such favorable mutations will rapidly accumulate 
in the population because individuals that carry them will be more likely 
to survive an epidemic and pass the mutation on to their offspring. A hap- 
lotype analysis can be used to “date” the appearance of such a favorable 
mutation. If it cropped up in the population relatively recently, there will 
have been fewer opportunities for recombination to break up the DNA 
sequence around it, so the surrounding haplotype block will be large. 


Such is the case for sickle-cell anemia, a disorder caused by a single 
nucleotide substitution that changes a glutamic acid to a valine in one of 
the protein subunits of hemoglobin (see Figure 6-32). Although individu- 
als who are homozygous for this allele experience the harmful effects 
of anemia, heterozygotes who carry one normal and one sickle-cell 
allele show no ill effects and, in addition, are resistant to malaria. This 
allele—which confers a benefit under the right set of circumstances—is 
widespread in Africa, where malaria is rife. A comparison of numerous 
human genes reveals that the sickle-cell allele is embedded in an unusu- 
ally large haplotype block, indicating that it arose relatively recently in 
the African gene pool—probably about 2000 years ago. In this way, analy- 
ses of modern human genomes can highlight important events in human 
evolution, including our initial exposures to specific infections. 


Genetic Studies Aid in the Search for the Causes of 
Human Diseases 


Like the wrinkled peas studied by Mendel, our susceptibility to disease is 
a phenotypic trait—albeit an unfortunate one. Thus, for many diseases, 
the causes are rooted in our genomes. In some cases, the genetic under- 
pinnings of disease are clear and unequivocal. For example, mutations in 
specific genes give rise, in a reproducible way, to clearly defined condi- 
tions such as congenital deafness, albinism, hemophilia, and sickle-cell 
anemia. Other times, the genetic connections are more complex. Many of 
the most common human disorders, such as diabetes or arthritis, involve 
many genes working together to give rise to the “disease phenotype.” 


Most diseases are also influenced by environmental factors: availabil- 
ity of nutrition or exposure to toxins, carcinogens, infectious viruses or 
microorganisms—even to sunlight (see Figure 6-25). Yet even diseases 
that are clearly environmental in nature, such as infection by specific 
pathogens can be modified by genetic factors. For example, individu- 
als bearing a sickle-cell allele are resistant to malaria, as we discussed 
earlier. Others carry an allele that renders them genetically resistant to 
infection with HIV, the virus that causes AIDS, as we discuss shortly. The 
ultimate outcome, in terms of disease phenotype, thus depends on an 
intricate interplay amongst genetic and environmental factors. 


Despite these complexities, genetic studies—particularly those that 
involve a comparison of human genome sequences—are expanding 
our understanding of the fundamental causes of human disease. In the 


remainder of this chapter, we explore the genetic underpinnings of dis- 
ease, and the approaches that investigators use to identify them. 


Many Severe, Rare Human Diseases Are Caused by 
Mutations in Single Genes 


About 3000 human diseases are caused by mutations in single genes. 
These single-gene, or monogenic, disorders are sometimes referred to as 
Mendelian because they show a pattern of inheritance that is as simple to 
trace as that of the wrinkled peas and purple flowers that Mendel studied. 
Most of these mutations are recessive; individuals who carry only one 
copy of the mutant allele are largely asymptomatic, while those homozy- 
gous for the mutation are severely impaired. For example, individuals 
with Tay-Sachs disease, which is characterized by seizures, blindness, 
and neurodegeneration, usually do not survive infancy. 


Monogenic disorders also tend to be very rare, affecting only a fraction 
of one percent of the human population. This low prevalence can be 
attributed to a number of factors. First, many of these diseases are “early 
onset,” meaning that affected individuals die early in life, often before 
reproducing. The disease-causing alleles carried by these individuals are 
thus eliminated from the gene pool. At the same time, because many 
monogenic disorders are recessive, heterozygous individuals typically 
lead normal lives and show no signs of the disease. Thus, the disease- 
causing alleles are never entirely eradicated from the population, and 
instead persist at a low frequency. The disease then manifests itself only 
in those rare individuals who inherit two mutant alleles. Monogenic dis- 
orders can occur more frequently, however, in families or populations 
in which the parents are genetically related. Such consanguineous mar- 
riages are more likely to produce offspring that are homozygous for the 
mutant, disease-causing alleles than are marriages between unrelated 
individuals (Figure 19-36). 
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Figure 19-36 The prevalence of 
consanguineous marriage can 
increase the likelihood of inheriting 
disease-causing alleles. A comparison 
of these two maps indicates the 

large degree of overlap between 

the percentage of consanguineous 
marriages and the incidence of birth 
defects in countries around the 

world. (A) The percentage of birth 
defects is indicated by blue shading. 
Here, a birth defect is defined as any 
abnormality affecting body structure 
or function that is present from birth. 
These include conditions caused by 
simple, monogenic diseases, as well as 
environmental factors, such as exposure 
to chemicals that cause mutations. 

(B) The proportion of marriages 
between second cousins or closer is 
indicated in orange. Gray shading 
indicates countries for which data were 
not available. (Adapted from M.A. 
Jobling et al., Human Evolutionary 
Genetics, 2nd ed. New York: Garland 
Science, 2014. With permission from 
Garland Science.) 


QUESTION 19-5 


In a recent automated analysis, 
thousands of SNPs across the 
genome were analyzed in pooled 
DNA samples from humans who had 
been sorted into groups according 
to their age. For the vast majority of 
these sites, there was no change in 
the relative frequencies of different 
variants as these humans aged. 
Sometimes, albeit rarely, a particular 
variant at one position was found to 
decrease in frequency progressively 
for people over 50 years old. Which 
of the possible explanations seems 
most likely? 

A. The nucleotide in that SNP 

at that position is unstable, and 
mutates with age. 

B. Those people born more 

than 50 years ago came from a 
population that tended to lack the 
disappearing SNP variant. 

C. The SNP variant alters an 
important gene product in a way 
that shortens the human life-span, or 
is linked to a neighboring allele that 
has this effect. 
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Some monogenic diseases are more common in certain populations 
than in others. In some cases, this prevalence is due to natural selection. 
The mutant hemoglobin allele that can provide resistance to malaria, for 
example, is present in higher frequencies in geographic regions where 
malaria is common. In other cases, the preponderance of a particular 
mutation is likely due to a founder effect; that is, a subpopulation of 
humans arose from a small number of individuals, some of which hap- 
pened to carry a particular mutation. As this subpopulation expanded, 
the frequency of the mutant allele became higher than it is in the human 
population as a whole. This appears to be the case for Tay-Sachs disease, 
which is more prevalent in Ashkenazi Jews. 


When we explore the mechanisms by which these rare, single-gene 
mutations lead to disease, we find that monogenic disorders affect nearly 
all aspects of cell and molecular biology. Tay-Sachs disease, for exam- 
ple, is caused by loss-of-function mutations in the gene that encodes 
the enzyme hexosaminidase. Without this enzyme, brain and nerve cells 
become increasingly damaged, with tragic consequences. Another dis- 
ease, called cystic fibrosis, arises from mutations in the gene coding for 
a specialized form of chloride channel. Some of these mutations prevent 
the channel from opening, while others inhibit its proper folding, which 
leads to the protein being destroyed. Knowing how a particular mutation 
affects protein function can point the way toward the most effective treat- 
ment. For example, certain drugs can help direct a misfolded channel to 
its proper place in the plasma membrane and help it to function more 
effectively. Such treatments can “rescue” mutant proteins and restore 
enough of their activity to alleviate some of the worst symptoms of cystic 
fibrosis. Although the consequences of such diseases can be devastating 
to individuals, families, and communities, their study has provided criti- 
cal insights into the function of many human genes. 


Finally, it should be noted that not all loss-of-function mutations in 
humans are deleterious. For example, individuals that are homozygous 
for mutations that destroy a cell-surface receptor called CCR5 are resist- 
ant to infection by HIV because the virus uses this receptor to enter 
human immune cells. Although individuals lacking this receptor may be 
slightly more sensitive to other viral infections, they appear normal in all 
other respects. This mutation can thus be seen as largely beneficial in 
a world in which HIV continues to be a major public health issue—and 
could point toward therapeutic strategies for combatting the spread of 
the virus. 


Common Human Diseases Are Often Influenced by 
Multiple Mutations and Environmental Factors 


Although the monogenic disorders discussed above are rare, the genes 
responsible tend to be relatively simple to track down. Analyzing the 
genomes from an affected family—or a small population in which the 
disease is prevalent—is often sufficient to locate the disease-causing 
mutation. However, many of the most common human diseases—such as 
type 2 diabetes, coronary artery disease, and obesity—are influenced by 
many different genetic factors, as well as environmental conditions. For 
such complex, multigenic conditions, no single allele—whether homozy- 
gous or heterozygous—is sufficient to precipitate the disease. Instead, a 
given allele might increase the risk of having the disease, but—in the 
absence of other contributing alleles (or environmental factors)—would 
be unlikely to cause it. 


Unlike the relatively simple inheritance patterns associated with mono- 
genic disorders, those of multigenic disorders are often bewilderingly 
complex: although the individual alleles involved are each inherited 
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according to the laws of Mendel, their sheer number greatly complicate 
the analysis. More importantly, unlike monogenic disorders—which tend 
to occur early in life—complex, multigenic diseases often occur much 
later. Because of this delay, many of these disorders do not affect an indi- 
vidual’s likelihood of reproducing. As a result, some of the risk-enhancing 
alleles have become quite common in the population, as they are not 
eliminated by selection. The resulting disorders can thus affect a large 
proportion of the population (Figure 19-37). 


Genome-wide Association Studies Can Aid the Search 
for Mutations Associated with Disease 


Given the complexity of many of the most common human diseases, iden- 
tifying the associated genes can be a difficult task. One way of uncovering 
these genetic risk factors involves analyzing the patterns of inherited 
polymorphisms. Investigators typically collect DNA samples from a large 
number of people who have the disease and compare them to samples 
from a group of people who do not. They look for variants—SNPs, for 
example—that are more common among the people who have the dis- 
ease. Because DNA sequences that are close together on a chromosome 
tend to be inherited together, the presence of such SNPs could indicate 
that these variants themselves—or alleles that lie nearby— increase the 
risk of the disease (Figure 19-38). 


Such genome-wide association studies (GWAS)—which initially focused on 
SNPs—have been used to search for genes that predispose individuals to 
common diseases, including diabetes, coronary artery disease, rheuma- 
toid arthritis, and even depression. One such study is described in How 
We Know, pp. 684-685. For many of these conditions, environmental as 
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Figure 19-37 Complex diseases are 
widespread geographically and are 
common in the human population. This 
map shows the global distribution of 
overweight and obese males, as determined 
by an elevated body mass index (a measure 
of an individual's weight and height). 
Conditions such as obesity, which depend 
on complex interactions between genetics 
and the environment, often occur at 
frequencies that can be thousands of times 
higher than those of simple, monogenic 
disorders. (Adapted from M.A. Jobling 

et al., Human Evolutionary Genetics, 

2nd ed. New York: Garland Science, 2014. 
With permission from Garland Science.) 


Figure 19-38 Genes that affect the 

risk of developing a common disease 
can often be tracked down through 
linkage to SNPs. Here, the patterns 

of SNPs are compared between two 

sets of individuals—a set of healthy 
controls and a set affected by a particular 
common disease. A segment of a typical 
chromosome is shown. For most SNPs 

in this segment, it is a random matter 
whether an individual has one SNP variant 
(red vertical bars) or another (blue vertical 
bars); the same randomness is seen both 
for the control group and for the affected 
individuals. However, in the part of the 
chromosome that is shaded in darker 

gray, a bias is seen, such that most normal 
individuals have the blue SNP variants, 
whereas most affected individuals have the 
red SNP variants. This finding suggests that 
this region contains, or is very close to, a 
gene that is involved in the disease. Using 
carefully selected controls and thousands of 
affected individuals, this approach can help 
track down disease-related genes, even 
when they confer only a slight increase in 
the risk of developing the disease. 
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“ HOW WE KNOW 


USING SNPs TO GET A HANDLE ON HUMAN DISEASE 


For diseases that have their roots in genetics, finding the 
gene or genes responsible can be the first step toward 
improved diagnosis, treatment, and even prevention. 
The task is not simple, but having access to polymor- 
phisms such as SNPs can help. In 1999, an international 
group of scientists set out to collect and catalog 300,000 
SNPs—the single-nucleotide polymorphisms that are 
common in the human population (see Figure 9-38). 
Today, the database has grown to include a catalog of 
millions upon millions of genetic variations. These SNPs 
do not only help to define the differences between one 
individual and another; for geneticists, they also serve 
as signposts that can point the way toward the genes 
involved in common human disorders, such as diabetes, 
obesity, asthma, arthritis, and even gallstones and rest- 
less leg syndrome. 


Making a map 

One way that SNPs have facilitated the search for alleles 
that predispose to disease is by providing the physical 
markers needed to construct detailed genetic linkage 
maps. A genetic linkage map displays the relative loca- 
tions of genetic markers along each chromosome. Such 
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maps are based on the frequency with which these mark- 
ers are co-inherited. Those that lie close to one another 
on the same chromosome will be inherited together 
much more frequently than those that lie farther apart. 
By determining how often crossing-over separates two 
markers, the relative distance between them can be cal- 
culated (see Panel 19-1, p. 675). 


The same sort of analysis can be used to discover link- 
age between a SNP and an allele—for example, one that 
might cause an inherited disease. We simply look for co- 
inheritance of the SNP with a certain phenotype—in this 
case, the disease. Finding such a linkage indicates that 
the mutation responsible for the phenotype is either the 
SNP itself or, more likely, lies close to the SNP (Figure 
19-39). And because we know the exact location in the 
human genome sequence of every SNP we examine, the 
linkage tells us the neighborhood in which the causa- 
tive mutation resides. A more detailed analysis of the 
DNA in that region—to look for deletions, insertions, or 
other functionally significant abnormalities in the DNA 
sequence of affected individuals—can then lead to a 
precise identification of the critical gene. 
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TESTS PERFORMED ON 7 OFFSPRING 


OBSERVATION: Disease is seen only in progeny with SNP genotype aa. 
CONCLUSION: Recessive mutation causing the disease is co-inherited with SNP a. If this same 
correlation is observed in other families that have been examined, the mutation causing the disease 


must lie close to SNP a. 


Figure 19-39 SNP analysis can pin down the location of a mutation that causes a genetic disease. In this approach, one studies 
the co-inheritance of a specific human phenotype (here a genetic disease) with a particular set of SNPs. The figure shows the logic for 
the common case of a family in which both parents are carriers of a recessive mutation. If individuals with the disease, and only such 
individuals, are homozygous for a particular SNP, then the SNP and the recessive mutation that causes the disease are likely to be close 
together on the same chromosome, as shown here. To prove that an apparent linkage is statistically significant, a few dozen individuals 
from such families may need to be examined. With more individuals and using more SNPs, it is possible to locate the mutation more 
precisely. These days it can be just as fast and cheap to use whole-genome sequencing to find the mutation. 


Such linkage analyses are usually carried out in fami- 
lies that are particularly prone to a disorder—the larger 
the family, the better. And the method works best where 
there is a simple cause-and-effect relationship, such that 
a particular mutant gene directly and reliably causes the 
disorder—as is the case, for example, for the mutant 
gene that causes cystic fibrosis. But most common dis- 
orders are not like this. Instead, many factors affect the 
disease risk—some genetic, some environmental, some 
just a matter of chance. For such conditions, a different 
approach is needed to identify risk genes. 


Making associations 


Genome-wide association studies (GWAS, for short) allow 
us to discover common genetic variants that affect the 
risk for a common disease, even if each variant alters 
susceptibility only slightly. Because mutations that 
destroy the activity of a key gene are likely to have a 
disastrous effect on the fitness of the mutant individual, 
they tend to be eliminated from the population by nat- 
ural selection and so are rarely seen. Genetic variants 
that alter a gene’s function only slightly, on the other 
hand, are much more common. By tracking down these 
common variants, or polymorphisms, we can sniff out 
some of the genes that contribute to the biology of com- 
mon diseases. 


GWAS rely on genetic markers, such as SNPs, that are 
located throughout the genome to compare directly the 
DNA sequences of two populations: individuals who 
have a particular disease and those who do not. The 
approach identifies SNPs that are present in the peo- 
ple who have the disease more often than would be 
expected by chance. 


Consider the case of age-related macular degeneration 
(AMD), a degenerative disorder of the retina that is 
a leading cause of blindness in the elderly. To search 
for genetic variations that are associated with AMD, 
researchers looked at a panel of just over 100,000 SNPs 
that spanned the genome. They determined the nucleo- 
tide sequence at each of these SNPs in 96 people who 
had AMD, and 50 who did not. Among the 100,000 SNPs, 
they discovered that one particular SNP was present 
significantly more often in the individuals who had the 
disease (Figure 19—40). 


The SNP is located in a gene called Cfh (complement fac- 
tor H). But it falls within one of the gene’s introns and 
appears unlikely to have any effect on the protein prod- 
uct. This SNP itself, therefore, did not seem likely to be 
the cause of the increase in susceptibility to AMD. But 
it focused the researchers’ attention on the Cfh gene. 
So they resequenced the region to look for additional 
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polymorphisms that might also be inherited more often 
by people with AMD, along with the SNP that they had 
already identified. They discovered three variants that 
change the amino acid sequence of the Cfh protein. 
One substitutes a histidine for a tyrosine at one particu- 
lar place in the protein, and it was strongly associated 
with the disease (and almost always coupled with the 
original SNP that had put the researchers on the track 
of the Cfh gene). Individuals who carried two copies of 
this risky allele were five to seven times more likely to 
develop AMD than those who harbored a different allele 
of the Cfh gene. 


Several other research teams, using a similar genetic 
association approach, have also pointed to Cfh variants 
as increasing the likelihood of developing AMD, mak- 
ing it almost certain that the Cfh gene has something 
to do with the biology of the disease. The Cfh protein is 
part of the complement system, an important compo- 
nent of immunity; the protein helps prevent the system 
from becoming overactive, a condition that can lead 
to inflammation and tissue damage. Interestingly, the 
environmental risk factors associated with the disease— 
smoking, obesity, and age—also affect inflammation and 
the activity of the complement system. Thus, whatever 
the detailed mechanism by which the Cfh gene influ- 
ences the risk of AMD, the finding that complement is 
critical could lead to new tests for the early diagnosis 
of the disorder, as well as potential new avenues for 
treatment. 
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Figure 19-40 Genome-wide association studies identify 

DNA variations that are significantly more frequent in people 
with age-related macular degeneration (AMD). In this study, 
scientists examined more than 100,000 SNPs in each of 146 
people. The x-axis of the graph shows the relative position of 
each SNP in the genome, starting at the left with the SNPs on 
Chromosome 1. The y-axis shows the strength of each SNP’s 
observed correlation with AMD. The blue region indicates a cutoff 
level for statistical significance, corresponding to a probability of 
less than 5% of finding that strength of correlation by pure chance 
anywhere among the whole set of 100,000 tested SNPs. The SNP 
marked in red is the one that led the way to the relevant gene, 
Cth. The initial association of the other prominent SNP (black) 
with the disease was rendered insignificant when additional 
sequencing at that site was performed. (Adapted from R.J. Klein 
et al., Science 308:385-389, 2005.) 
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well as genetic factors play an important part in determining which indi- 
viduals will develop the disease. 


Disappointingly, most of the DNA polymorphisms identified through this 
strategy increase the risk of disease only slightly. Many of them fall within 
regulatory DNA sequences and only subtly alter the expression of the 
genes they control. However, by identifying these “risky alleles,” such 
studies provide insights into the molecular mechanisms underlying these 
complex disorders, these results are leading to an improved understand- 
ing of the molecular basis of common inherited diseases. 


We Still Have Much to Learn about the Genetic Basis 
of Human Variation and Disease 


The polymorphisms that have thus far allowed us to track our ancestors 
and identify genes that increase our risk of disease have to be relatively 
common to be detected by the methods we have described. Because they 
arose so long ago in our evolutionary past they are now present, in one 
form or another, in a substantial portion (1% or more) of the population. 
Such genetic variants are thought to account for about 90% of the differ- 
ences between one person's genome and another. But when we try to tie 
these common alterations to differences in disease susceptibility or other 
heritable traits, such as height, we find that they do not have as much 
predictive power as we had anticipated: thus, for example, most confer 
relatively small increases—less than twofold—in the risk of developing a 
common disease. 


Part of the problem is that many of the mutations that are directly 
responsible for complex human diseases appeared more recently in 
our evolutionary history—during a period when the human population 
underwent an explosive expansion in size, from the few million individu- 
als that existed a mere 10,000 years ago to the 7 billion or so that inhabit 
the planet today. Because such mutations occur more rarely than the 
ancient polymorphisms that are common in the human population, they 
could slip through the type of GWAS approach just described. 


Now that the price of DNA sequencing has plummeted, the most efficient 
and cost-effective way to identify these recent mutations is by sequenc- 
ing and comparing the genomes of many thousands of individuals—those 
affected by diseases and those who are not. Such DNA sequencing must 
be very accurate, so that rare DNA variants in the population can be 
unambiguously identified (and distinguished from sequencing errors). 


Once these variants are identified, the next challenge is to determine how 
they affect the phenotype of the individuals that carry them. When a vari- 
ant falls within the coding region of a gene, it is simple to assess whether 
it would alter the amino acid sequence of the resulting protein. However, 
as we have seen, many important DNA variants lie outside coding 
regions. This mechanism could help explain why many alleles have only 
a small, but statistically significant, effect on the probability of precipitat- 
ing a particular disease. It is often difficult to predict—from inspection of 
a genome sequence alone—what the effect of such variants might be; in 
these cases, additional experiments in cultured cells or animal models 
are needed to determine the consequences of such a mutation. 


As genome sequencing efforts continue, we are discovering many previ- 
ously unreported genetic variants in people affected by disease—and in 
apparently healthy individuals. Based on one study, each of us harbors 
about 100 loss-of-function mutations in protein-coding genes—some of 
which eliminate the activity of both gene copies. This surprising result 
means that our genome still holds many secrets, and that we can develop 


and function as “normal” humans in today’s world despite the enormous 
variations across the human population. 


ESSENTIAL CONCEPTS 


Sexual reproduction involves the cyclic alternation of diploid and 
haploid states: diploid germ-line cells divide by meiosis to form hap- 
loid gametes, and the haploid gametes from two individuals fuse at 
fertilization to form a new diploid cell—the zygote. 


During meiosis, the maternal and paternal homologs are parceled 
out to gametes such that each gamete receives one copy of each 
chromosome. Because the segregation of these homologs occurs 
randomly, and crossing-over occurs between them, many genetically 
different gametes can be produced from a single individual. 


In addition to enhancing genetic mixing, crossing-over helps ensure 
the proper segregation of chromosomes during meiosis. 


Although most of the mechanical features of meiosis are similar to 
those of mitosis, the behavior of the chromosomes is different: meio- 
sis produces four genetically distinct haploid cells by two consecutive 
cell divisions, whereas mitosis produces two genetically identical 
diploid cells by a single cell division. 


Mendel unraveled the laws of heredity by studying the inheritance of 
a handful of discrete traits in pea plants. 


Mendel’s law of segregation states that the maternal and paternal 
alleles for each trait separate from one another during gamete forma- 
tion and then reunite randomly during fertilization. 


Mendel’s law of independent assortment states that, during gamete 
formation, different pairs of alleles segregate independently of one 
another. 


The behavior of chromosomes during meiosis explains both of 
Mendel’s laws. 


If two genes are close to each other on a chromosome, they tend 
to be inherited as a unit; if they are far apart, they will typically be 
separated by crossing-over. The frequency with which two genes 
are separated by crossovers can be used to construct a genetic map 
that shows their order on a chromosome. 


Mutant alleles can be either dominant or recessive. If a single copy 
of the mutant allele alters the phenotype of an individual that also 
possesses a wild-type allele, the mutant allele is dominant; if not, it 
is recessive. 


Complementation tests reveal whether two mutations that produce 
the same phenotype affect the same gene or different genes. 


Mutant organisms can be generated by treating animals with muta- 
gens, which damage DNA. Such mutants can then be screened to 
identify phenotypes of interest and, ultimately, to isolate the respon- 
sible genes. 


With the possible exception of identical twins, no two human genomes 
are alike. Each of us carries a unique set of polymorphisms—varia- 
tions in nucleotide sequence that in some cases contribute to our 
individual phenotypes. 


Some of the common polymorphisms—including SNPs, indels, and 
CNVs—provide useful markers for genetic mapping. 


The human genome consists of large haplotype blocks—segments of 
nucleotide sequence that have been passed down intact from our 
distant ancestors and, in most individuals, have not yet been broken 
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up by crossovers. The relative sizes of haplotype blocks can give 
us clues to our evolutionary history and help to identify common 
disease-associated alleles. 


e Many rare, inherited human diseases are due to mutations in a single 
gene. 


e The most common human disorders are due to many genes acting 
together; DNA sequencing studies are identifying mutations in these 
genes that increase the risk of developing these diseases. 


KEY TERMS 
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QUESTIONS 


QUESTION 19-6 


It is easy to see how deleterious mutations in bacteria, 
which have a single copy of each gene, are eliminated by 
natural selection: the affected bacteria die and the mutation 
is thereby lost from the population. Eukaryotes, however, 
have two copies of most genes—that is, they are diploid. 
Often an individual with two normal copies of the gene 
(homozygous normal) is indistinguishable in phenotype from 
an individual with one normal copy and one defective copy 
of the gene (heterozygous). In such cases, natural selection 
can operate only against an individual with two copies of 
the defective gene (homozygous defective). Consider the 
situation in which a defective form of the gene is lethal 
when homozygous, but without effect when heterozygous. 
Can such a mutation ever be eliminated from the population 
by natural selection? Why or why not? 


QUESTION 19-7 


Which of the following statements are correct? Explain your 
answers. 


A. The egg and sperm cells of animals contain haploid 
genomes. 


B. During meiosis, chromosomes are allocated so that each 
germ cell obtains one and only one copy of each of the 
different chromosomes. 


C. Mutations that arise during meiosis are not transmitted 
to the next generation. 
QUESTION 19-8 


What might cause chromosome nondisjunction, where 
two copies of the same chromosome end up in the same 
daughter cell? What could be the consequences of this 
event occurring (a) in mitosis and (b) in meiosis? 
QUESTION 19-9 


Why do sister chromatids have to remain paired in division | 
of meiosis? Does the answer suggest a strategy for washing 
your socks? 

QUESTION 19-10 

Distinguish between the following genetic terms: 

A. Gene and allele. 


B. Homozygous and heterozygous. 


C. Genotype and phenotype. 


D. Dominant and recessive. 


QUESTION 19-11 


You have been given three wrinkled peas, which we shall call 
A, B, and C, each of which you plant to produce a mature 
pea plant. Each of these three plants, once self-pollinated, 
produces only wrinkled peas. 


A. Given that you know that the wrinkled-pea phenotype 
is recessive, as a result of a loss-of-function mutation, what 
can you say about the genotype of each plant? 


B. How do you determine if each of the three plants carries 
a mutation in the same gene or in different genes that 
produce the phenotype? 

QUESTION 19-12 


Susan's grandfather was deaf, and passed down a hereditary 
form of deafness within Susan’s family as shown in 
Figure Q19-12. 


A. Is this mutation most likely to be dominant or recessive? 
B. Is it carried on a sex chromosome? Why or why not? 


C. A complete SNP analysis has been done for all of the 11 
grandchildren (4 affected and 7 unaffected). In comparing 
these 11 SNP results, how long a haplotype block would 
you expect to find around the critical gene? How might you 


detect it? 
iia grandfather 


Figure Q19-12 


Susan 


QUESTION 19-13 


Given that the mutation causing deafness in the family 
shown in Figure 19-26 is very rare, what is the most 
probable genotype of each of the four children in 
generation II? 


brick-red 


Figure Q19-17 


flies with other eye colors 
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QUESTION 19-14 


In the pedigree shown in Figure Q19-14, the first born in 
each of three generations is the only person affected by 
a dominant genetically inherited disease, D. Your friend 
concludes that the first child born has a greater chance of 
inheriting the mutant D allele than do later children. 


A. According to Mendel’s laws, is this conclusion plausible? 


B. What is the probability of obtaining this result by 
chance? 


C. What kind of additional data would be needed to test 
your friend's idea? 


D. Is there any way in which your friend’s hypothesis might 
turn out to be right? 


Figure O19-14 


children 


grandchildren 


great-grandchildren 


QUESTION 19-15 


Suppose one person in 100 is a carrier of a fatal recessive 
mutation, such that babies homozygous for the mutation die 
soon after birth. In a population where there are 1,000,000 
births per year, how many babies per year will be born with 
the lethal homozygous condition? 


QUESTION 19-16 


Certain mutations are called dominant-negative mutations. 
What do you think this means and how do you suppose 
these mutations act? Explain the difference between 

a dominant-negative mutation and a gain-of-function 
mutation. 


QUESTION 19-17 


Early genetic studies in Drosophila laid the foundation for 
our current understanding of genes. Drosophila geneticists 
were able to generate mutant flies with a variety of easily 
observable phenotypic changes. Alterations from the fly's 
normal brick-red eye color have a venerable history because 
the very first mutant found by Thomas Hunt Morgan was 

a white-eyed fly (Figure Q19-17). Since that time, a large 
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TABLE 019-17 COMPLEMENTATION ANALYSIS OF Drosophila EYE-COLOR MUTATIONS 


Poon [ote [oaet [ty [vernon [cory [oot [ooriot [oak [erator | 


ruby - 4 ae a + + + 
TRE 
| carnation | : 


+ indicates that progeny of a cross between individuals showing the indicated eye colors are phenotypically normal; — indicates that 
the eye color of the progeny is abnormal. 


= + + + + + + + 


number of mutant flies with intermediate eye colors have 
been isolated and given names that challenge your color 
sense: garnet, ruby, vermilion, cherry, coral, apricot, buff, 
and carnation. The mutations responsible for these eye- 
color phenotypes are all recessive. To determine whether 
the mutations affected the same or different genes, 
homozygous flies for each mutation were bred to one 
another in pairs and the eye colors of their progeny were 
noted. In Table O19-17, a + or a — indicates the phenotype 
of the progeny flies produced by mating the fly listed at the 
top of the column with the fly listed to the left of the row; 
brick-red wild-type eyes are shown as + and other colors are 
indicated as -. 


A. How is it that flies with two different eye colors—ruby 
and white, for example—can give rise to progeny that all 
have brick-red eyes? 


B. Which mutations are alleles of the same gene and which 
affect different genes? 


C. How can different alleles of the same gene give different 
eye colors? 
QUESTION 19-18 


What are single-nucleotide polymorphisms (SNPs), and 
how can they be used to locate a mutant gene by linkage 
analysis? 


Cell Communities: Tissues, 
Stem Cells, and Cancer 


Cells are the building blocks of multicellular organisms. Although this 
seems a relatively simple statement, it raises deep questions. Cells are 
not like bricks: they are small and squishy and enclosed in a flimsy mem- 
brane less than a hundred-thousandth of a millimeter thick. How, then, 
can cells be joined together robustly to construct a giraffe’s neck, a red- 
wood tree, or muscles that can support an elephant’s weight? How are all 
the different cell types in a plant or an animal produced, and how do they 
assemble so that each is in its proper place ( )? Most mysteri- 
ous of all, if cells are the building blocks, where is the builder and where 
are the architect’s plans? 


Most of the cells in multicellular organisms are organized into coopera- 
tive assemblies called tissues, such as the nervous, muscular, epithelial, 
and connective tissues found in vertebrates; tissues, in turn, are organ- 
ized into organs, such as heart, lung, brain and kidney ( ). In 
this chapter, we begin by discussing the architecture of tissues from a 
mechanical point of view. We see that tissues are composed not only of 
cells, with their internal framework of cytoskeletal filaments (discussed 
in Chapter 17), but also of extracellular matrix, the material that cells 
secrete around themselves; it is this matrix that gives supportive tissues 
such as bone or wood their strength. At the same time, cells can also 
attach to one another directly. Thus, we also discuss the cell junctions 
that link cells together in the flexible epithelial tissues of animals. These 
junctions transmit forces either from the cytoskeleton of one cell to that 
of the next, or from the cytoskeleton of a cell to the extracellular matrix. 


But there is more to the organization of tissues than mechanics. Just as 
a building needs plumbing, telephone lines, and other fittings, so an ani- 
mal tissue requires blood vessels, nerves, and other components formed 
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Figure 20-2 Cells are organized into 
tissues, and tissues often assemble into 
organs. Simplified drawing of a cross 
section through part of the wall of the 
intestine of a mammal. This long, tubelike 
organ is constructed from epithelial tissues 
(red), connective tissues (green), and muscle 
tissues (yellow). Each tissue is an organized 
assembly of cells, held together by cell-cell 
adhesions, extracellular matrix, or both. 


smooth l 
muscle| longitudinal 


Figure 20-1 Multicellular organisms are built from organized 
collections of cells. This thin section shows cells in the urine-collecting 
ducts of a human kidney. Each duct is made of closely packed 
“principal” cells, which form an epithelial tube, seen here in cross 
section as rings of cells. The ducts are embedded in an extracellular 
matrix populated by other types of cells. (Jose Luis Calvo/Shutterstock.) 


from a variety of specialized cell types. All the tissue components have 
to be appropriately organized and functionally coordinated, and many of 
them require continual maintenance and renewal. Cells die and have to 
be replaced with new cells of the right type, in the right places, and in the 
right numbers. In the third section of this chapter, we discuss how these 
processes are organized, as well as the crucial role that stem cells—self- 
renewing, undifferentiated cells—play in the renewal and repair of some 
tissues. 


Disorders of tissue renewal are a major medical concern, and those due 
to the misbehavior of mutant cells underlie the development of cancer. 
We discuss cancer in the final section of this chapter and of the book as 
a whole. The study of cancer requires a synthesis of knowledge of cells 
and tissues at every level, from the molecular biology of DNA repair to 
the principles of natural selection and the social interactions of cells in 
tissues. Many fundamental advances in cell biology have been driven by 
cancer research, and basic cell biology in return continues to deepen our 
understanding of cancer and provide us with renewed optimism about 
its treatment. 


EXTRACELLULAR MATRIX AND CONNECTIVE 
TISSUES 


Plants and animals have evolved their multicellular organization inde- 
pendently, and their tissues are constructed on different principles. 
Animals prey on other living things—and often are preyed on by other 
animals—and for these reasons they must be strong and agile: they must 
possess tissues capable of rapid movement, and the cells that form those 
tissues must be able to generate and transmit forces and to change shape 
quickly. Plants, by contrast, are sedentary: their tissues are more or less 
rigid, although their cells are weak and fragile if isolated from the stiff 
supporting matrix that surrounds them. 


In plants, the supportive matrix is called the cell wall, a boxlike structure 
that encloses, protects, immobilizes, and shapes each cell (Figure 20-3). 
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Figure 20-3 Plant tissues are strengthened 
by cell walls. (A) A cross section of part of 
the stem of the flowering plant Arabidopsis 
is shown, stained with fluorescent 

dyes that label two different cell wall 
polysaccharides—cellulose in blue, and 
pectin in green. The cells themselves are 
unstained and invisible in this preparation. 
Regions rich in both cellulose and pectin 
appear white. Pectin predominates in the 
outer layers of cells, which have only primary 
cell walls (deposited while the cell is still 
growing). Cellulose is more plentiful in 

the inner layers, which have thicker, more 
rigid secondary cell walls (deposited after 
cell growth has ceased). (B) Cells and their 
primary cell walls are clearly seen in this 
electron micrograph of the young cells in 
the root of the same plant. These cells are 
much smaller than those in the stem, as can 
be seen by the different scale bars in the two 
micrographs. (Courtesy of Paul Linstead.) 


(A) | 
20 um 


Plant cells themselves synthesize, secrete, and control the composition 
of this extracellular matrix: a cell wall can be thick and hard, as in wood, 
or thin and flexible, as in a leaf. But the principle of construction is the 
same in either case: many tiny boxes are cemented together, with a deli- 
cate cell living inside each one. Indeed, as we noted in Chapter 1, it was 
the close-packed mass of microscopic chambers that Robert Hooke saw 
in a slice of cork three centuries ago that inspired the term “cell.” 


Animal tissues are more diverse. Like plant tissues, they consist of both 
cells and extracellular matrix, but these components are organized in 
many different ways. In specialized connective tissues, such as bone or 
tendon, extracellular matrix is plentiful and mechanically all-important; 
in other tissues, such as muscle or the epidermis of the skin, extracellular 
matrix is scanty, and the cytoskeletons of the cells themselves carry the 
mechanical load. We begin this section with a brief discussion of plant 
cells and tissues before considering those of animals. 


Plant Cells Have Tough External Walls 


A naked plant cell, artificially stripped of its wall, is a delicate and vul- 
nerable thing. With care, it can be kept alive in culture; but it is easily 
ruptured, and even a small decrease in the osmotic strength of the cul- 
ture medium can cause the cell to swell and burst. Its cytoskeleton lacks 
the tension-bearing intermediate filaments found in animal cells, and as 
a result, it has virtually no tensile strength. An external wall, therefore, 
is essential. 


The plant cell wall has to be tough, but it does not necessarily have to 
be rigid. Osmotic swelling of the cell, limited by the resistance of the cell 
wall, can keep the chamber distended, and a mass of such swollen cham- 
bers cemented together forms a semirigid tissue. Such is the state of a 
crisp lettuce leaf. If water is lacking, the cells shrink and the leaf wilts. 


Most newly formed cells in a plant initially make relatively thin primary 
cell walls, which can slowly expand to accommodate cell growth (see 
Figure 20-3B). The driving force for cell growth is the same as that keep- 
ing the lettuce leaf crisp—a swelling pressure, called the turgor pressure, 
that develops as the result of an osmotic imbalance between the interior 
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Figure 20-4 A cellulose microfibril 

is made from a bundle of cellulose 
molecules. Cellulose molecules are long, 
unbranched chains of glucose. Each 
glucose subunit is inverted with respect 
to its neighbors and joined to them via a 
B1,4-linkage. The resulting disaccharide 
repeat occurs hundreds of times in each 
individual cellulose molecule. About 

16 cellulose molecules are held together 
via hydrogen bonds in a single cellulose 
microfibril, as shown. 


Figure 20-5 A scale model shows a 
portion of a primary plant cell wall. 
Cellulose microfibrils (blue) provide tensile 
strength. Other polysaccharides (red strands) 
cross-link the cellulose microfibrils, while the 
polysaccharide pectin (green strands) fills the 
spaces between the microfibrils, providing 
resistance to compression. The middle 
lamella (yellow) is rich in pectin and is the 
layer that cements one cell wall to another. 


of the plant cell and its surroundings. Once cell growth stops and the 
wall no longer needs to expand, a more rigid secondary cell wall is often 
produced (see Figure 20-3A)—either by thickening of the primary wall or 
by deposition of new layers with a different composition underneath the 
old ones. When plant cells become specialized, they generally produce 
specially adapted types of walls: waxy, waterproof walls for the surface 
epidermal cells of a leaf; hard, thick, woody walls for the xylem cells of 
the stem; and so on. 


Cellulose Microfibrils Give the Plant Cell Wall Its Tensile 
Strength 


Like all extracellular matrices, plant cell walls derive their tensile strength 
from long fibers oriented along the lines of stress. In higher plants, the 
long fibers are generally made from the polysaccharide cellulose, the 
most abundant organic macromolecule on Earth (Figure 20-4). These 
cellulose microfibrils are interwoven with other polysaccharides and 
some structural proteins, all bonded together to form a complex struc- 
ture that resists both compression and tension (Figure 20-5). In woody 
tissue, a highly cross-linked network of lignin (a complex polymer built 
from aromatic alcohol groups) is deposited within this matrix to make it 
more rigid and waterproof. 


For a plant cell to grow or change its shape, the cell wall has to stretch 
or deform. Because the cellulose microfibrils resist stretching, their ori- 
entation governs the direction in which the growing cell enlarges: if, for 
example, they are arranged circumferentially as a corset, the cell will 
grow more readily in length than in girth (Figure 20-6). By controlling the 
way that it lays down its wall, the plant cell consequently controls its own 
shape and thus the direction of growth of the tissue to which it belongs. 


Cellulose is produced in a radically different way from most other 
extracellular macromolecules. Instead of being made inside the cell and 
then exported by exocytosis (discussed in Chapter 15), it is synthesized 
on the outer surface of the cell by enzyme complexes embedded in the 
plasma membrane. These complexes transport glucose monomers from 
the cytosol across the plasma membrane and incorporate them into a set 
of growing cellulose chains at their points of membrane attachment. The 
resulting cellulose chains assemble to form a cellulose microfibril (see 
Figure 20-4). 


The paths followed by the membrane-embedded enzyme complexes dic- 
tate the orientation in which cellulose is deposited in the cell wall. But 
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what directs the enzyme complexes? Just beneath the plasma membrane, 
microtubules are aligned exactly with the cellulose microfibrils outside 
the cell. The microtubules serve as tracks that help guide the movement 
of the enzyme complexes (Figure 20-7). In this curiously indirect way, 
the cytoskeleton controls the shape of the plant cell and the modeling of 
the plant tissues. We will see that animal cells use their cytoskeleton to 
control tissue architecture in a much more direct manner. 


Animal Connective Tissues Consist Largely of 
Extracellular Matrix 


It is traditional to distinguish four major types of tissues in animals: con- 
nective, epithelial, nervous, and muscular. But the basic architectural 
distinction is between connective tissues and the rest. In connective 
tissues, extracellular matrix is abundant and carries the mechanical 
load. In other tissues, such as epithelia, extracellular matrix is sparse, 
and the cells are directly joined to one another and carry the mechanical 
load themselves. We discuss connective tissues first. 


Animal connective tissues are enormously varied. They can be tough and 
flexible like tendons or the dermis of the skin; hard and dense like bone; 
resilient and shock-absorbing like cartilage; or soft and transparent like 
the jelly that fills the interior of the eye. In all these examples, the bulk of 
the tissue is occupied by extracellular matrix, and the cells that produce 
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Figure 20-6 The orientation of cellulose 
microfibrils within the plant cell wall 
influences the direction in which the cell 
elongates. The cells in (A) and (B) start 

off with identical shapes (shown here as 
cubes) but with different orientations of 
cellulose microfibrils (blue) in their walls. 
Although turgor pressure is uniform in all 
directions, each cell tends to elongate in a 
direction perpendicular to the orientation 
of the microfibrils, which have great tensile 
strength. The final shape of an organ, such 
as a shoot, is determined by the direction in 
which its cells expand. 


Figure 20-7 Microtubules help direct the 
deposition of cellulose in the plant cell 
wall. Electron micrographs show (A) oriented 
cellulose microfibrils in a plant cell wall and 
(B) microtubules just beneath a plant cell's 
plasma membrane. (C) The orientation of 
the newly deposited extracellular cellulose 
microfibrils (dark blue strands) is determined 
by the orientation of the underlying 
intracellular microtubules (dark green). The 
large cellulose synthase enzyme complexes 
(light blue) are integral membrane proteins 
that continuously synthesize cellulose 
microfibrils on the outer face of the plasma 
membrane. The distal ends of the stiff 
microfibrils become integrated into the 
texture of the cell wall (not shown), and their 
elongation at the other end pushes the 
synthase complex along in the plane of the 
plasma membrane (blue arrow). The cortical 
array of microtubules attached to the plasma 
membrane by transmembrane proteins 
(light green vertical bars) helps determine 
the direction in which the microfibrils are 
laid down. (A, courtesy of Brian Wells and 
Keith Roberts; B, courtesy of Brian Gunning: 
from Plant Cell Biology on DVD, Information 
for Students and a Resource for Teachers. 
Springer 2009.) 
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Figure 20-8 Extracellular matrix is 
plentiful in connective tissue such as 
bone. This micrograph shows a cross 
section of bone in which the cells have been 
lost during preparation. The spaces where 
the cells had been appear as small, dark, 
antlike shapes in the bone matrix, which 
occupies most of the volume of the tissue 
and provides all its mechanical strength. 
The alternating light and dark bands are 
layers of matrix, consisting almost entirely 
of oriented fibrils of type | collagen (made 
visible with the help of polarized light). 
Calcium phosphate crystals (not visible) 
fill the interstices between the collagen 
fibrils, strengthening the bone matrix and 
hardening it like reinforced concrete. 


QUESTION 20-1 


Cells in the stem of a seedling that 
is grown in the dark orient their 
microtubules horizontally. How 
would you expect this to affect the 
growth of the plant? 


Figure 20-9 Collagen fibrils are organized 
into bundles. The drawings show the 

steps of collagen assembly, from individual 
polypeptide chains to triple-stranded 
collagen molecules, then to fibrils and, 
finally, fibers. The electron micrograph 
shows fully assembled collagen fibers in the 
connective tissue of embryonic chick skin. 
The fibrils are bundled into fibers, some 
running in the plane of the section, others 
approximately at right angles to it. The 

cell in the micrograph is a fibroblast, which 
secretes collagen and other extracellular 
matrix components. (Photograph from 

C. Ploetz, E.l. Zycband, and D.E. Birk, 

J. Struct. Biol. 106:73-81, 1991. With 
permission from Elsevier.) 
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the matrix are scattered within it like raisins in a pudding (Figure 20-8); 
the tensile strength—whether great or small—is chiefly provided not by a 
polysaccharide, as it is in the cell wall of plants, but by fibrous proteins, 
principally collagens. The various types of connective tissues owe their 
specific characters to the type of collagen that they contain, to its quan- 
tity, and, most importantly, to the other molecules that are interwoven 
with it in varying proportions. These other molecules include the rubbery 
protein elastin, which gives the walls of arteries their resilience as blood 
pulses through them, as well as a host of specialized polysaccharide mol- 
ecules, which we discuss shortly. 


Collagen Provides Tensile Strength in Animal Connective 
Tissues 


The collagens are a family of proteins that come in many varieties. 
Mammals have over 40 different collagen genes coding for the various 
collagens that support the structure and function of different tissues. 
Collagens are the chief proteins in bone, tendon, and skin (leather is 
pickled collagen), and they constitute 25% of the total protein mass in a 
mammal—more than any other type of protein. Type I collagen, which is 
the most abundant, accounts for 90% of the body's collagen. 


The characteristic feature of a typical collagen molecule is its long, 
stiff, triple-stranded helical structure, in which three collagen polypep- 
tide chains are wound around one another in a ropelike superhelix (see 
Figure 4-29A). Some types of collagen molecules in turn assemble into 
ordered polymers called collagen fibrils, which are thin cables 10-300 nm 
in diameter and many micrometers long; these can pack together into still 
thicker collagen fibers (Figure 20-9). Other types of collagen molecules 
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decorate the surface of collagen fibrils and link the fibrils to one another 
and to other components in the extracellular matrix. 


The connective-tissue cells that manufacture and inhabit the extracel- 
lular matrix go by various names according to their tissue type: in skin, 
tendon, and many other connective tissues, they are called fibroblasts 
(see Figure 20-9); in bone, they are called osteoblasts. These cells make 
both the collagen and the other macromolecules of the matrix. Almost 
all of these molecules are synthesized intracellularly and then secreted 
in the standard way by exocytosis (discussed in Chapter 15). Outside the 
cell, they assemble into huge, cohesive aggregates. If assembly were to 
occur prematurely, before secretion, the cell would become choked with 
its own products. In the case of collagen, the cells avoid this catastrophe 
by secreting collagen molecules in a precursor form, called procolla- 
gen, which has additional peptide extensions at each end that obstruct 
premature assembly into collagen fibrils. Extracellular enzymes—called 
procollagen proteinases—cut off these terminal extensions to allow 
assembly only after the molecules have emerged into the extracellular 
space (Figure 20-10). 


Some people have a genetic defect in one of the extracellular protein- 
ases, So that their collagen fibrils do not assemble correctly. As a result, 
their connective tissues have a lower tensile strength and are extraordi- 
narily stretchable (Figure 20-11). 


Cells in tissues have to be able to degrade extracellular matrix as well as 
make it. This ability is essential for tissue growth, repair, and renewal; 
it is also important where migratory cells, such as macrophages, need 
to burrow through the thicket of collagen and other matrix polymers. 
Matrix proteases that cleave extracellular proteins play a part in many 
disease processes, ranging from arthritis, where they contribute to the 
breakdown of cartilage in affected joints, to cancer, where they help can- 
cer cells invade normal tissue. 


Cells Organize the Collagen They Secrete 


To do their job, collagen fibrils must be correctly aligned. In skin, for 
example, they are woven in a wickerwork pattern, or in alternating layers 
with different orientations so as to resist tensile stress in multiple direc- 
tions (Figure 20-12). In tendons, which attach muscles to bone, they are 
aligned in parallel bundles along the major axis of tension. 


The connective-tissue cells that produce collagen control this orienta- 
tion, first by depositing the collagen in an oriented fashion and then by 
rearranging it. During development of the tissue, fibroblasts work on the 
collagen they have secreted, crawling over it and pulling on it—helping 
to compact it into sheets and draw it out into cables. This mechanical 
role of fibroblasts in shaping collagen matrices has been demonstrated 
dramatically in cell culture. When fibroblasts are mixed with a mesh- 
work of randomly oriented collagen fibrils that form a gel in a culture 
dish, the fibroblasts tug on the meshwork, drawing in collagen from 
their surroundings and compacting it. If two small pieces of embryonic 
tissue containing fibroblasts are placed far apart on a collagen gel, the 


Figure 20-10 Procollagen precursors are cleaved to form mature 
collagen outside the cell. Collagen is synthesized as a procollagen 
molecule that has unstructured peptides at either end. These peptides 
prevent collagen fibrils from assembling inside the fibroblast. When the 
procollagen is secreted, extracellular procollagen proteinases remove 
its terminal peptides, producing mature collagen molecules. These 
molecules can then self-assemble into ordered collagen fibrils (see also 
Figure 20-9). 


QUESTION 20-2 


Mutations in the genes encoding 
collagens often have detrimental 
consequences, resulting in severely 
crippling diseases. Particularly 
devastating are mutations that 
change glycines, which are required 
at every third position in the 
collagen polypeptide chain so that it 
can assemble into the characteristic 
triple-helical rod (see Figure 

20-9). Would you expect collagen 
mutations to be detrimental if only 
one of the two copies of a collagen 
gene is defective? 
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Figure 20-11 Incorrect collagen assembly can cause the skin to 

be hyperextensible. James Morris, “the elastic skin man,” from a 
photograph taken in about 1890. Abnormally stretchable skin is part 
of a genetic syndrome that results from a defect in collagen assembly. 
In some individuals, this condition arises from a lack of an enzyme that 
converts procollagen to collagen; in others, it is caused by a defect in 
procollagen itself. 


intervening collagen becomes organized into a dense band of aligned fib- 
ers that connect the two explants (Figure 20-13). The fibroblasts migrate 
out from the explants along the aligned collagen fibers. In this way, the 
fibroblasts influence the alignment of the collagen fibers, and the collagen 
fibers in turn affect the distribution of the fibroblasts. Fibroblasts presum- 
ably play a similar role in generating long-range order in the extracellular 
matrix inside the developing body—in helping to create tendons, for 
example, and the tough, dense layers of connective tissue that ensheathe 
and bind together most organs. Fibroblast migration is also important for 
healing wounds (Movie 20.1). 


Integrins Couple the Matrix Outside a Cell to the 
Cytoskeleton Inside It 


Cells are able to interact with the collagen in the extracellular matrix 
thanks to a family of transmembrane receptor proteins called integ- 
rins. The extracellular domain of an integrin binds to components of the 
matrix, while its intracellular domain interacts with the cell cytoskeleton. 
This internal mooring provides a strong and stable point of attachment; 
without it, integrins would be easily torn from the flimsy lipid bilayer, and 
cells would be unable to anchor themselves to the matrix. 


Integrins do not, however, interact directly with collagen fibers in the 
extracellular matrix. Instead, another extracellular matrix protein, 
fibronectin, provides a linkage: part of the fibronectin molecule binds to 
collagen, while another part forms an attachment site for integrins. 


When the extracellular domain of the integrin binds to fibronectin, the 
intracellular domain binds (through a set of adaptor molecules) to an 
actin filament inside the cell (Figure 20-14). For many cells, it is the for- 
mation and breakage of these attachments on either end of an integrin 
molecule that allows the cell to crawl through a tissue, grabbing hold of 
the matrix at its front end and releasing its grip at the rear (see Figure 
17-33). Integrins coordinate these “catch-and-release” maneuvers by 
undergoing remarkable conformational changes. Binding to a molecule 
on one side of the plasma membrane causes the integrin molecule to 
stretch out into an extended, activated state so that it can then latch 
onto a different molecule on the opposite side—an effect that operates 
in either direction across the membrane (Figure 20-15). Thus, an intra- 
cellular signaling molecule can activate the integrin from the cytosolic 
side, causing it to reach out and grab hold of an extracellular structure. 
Similarly, binding to an external structure can switch on a variety of intra- 
cellular signaling pathways by activating protein kinases that associate 
with the intracellular end of the integrin. In this way, a cell’s external 
attachments can help regulate its behavior—and even its survival. 


Figure 20-12 Collagen fibrils in the skin of some animals are 
arranged in a plywoodlike pattern. The electron micrograph shows 

a cross section of tadpole skin. Successive layers of fibrils are laid 
down nearly at right angles to each other (see also Figure 20-9). This 
arrangement is also found in mature bone and in the cornea, but not in 
mammalian skin. (Courtesy of Jerome Gross.) 
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Figure 20-13 Fibroblasts influence 

the alignment of collagen fibers. This 
micrograph shows a region between two 
pieces of embryonic chick heart (rich in 
fibroblasts and heart muscle cells), which 
have grown in culture on a collagen gel for 
four days. A dense tract of aligned collagen 
fibers has formed between the explants, 
presumably as a result of the fibroblasts, 
which have proliferated and migrated out 
from the explants, tugging on the collagen. 
Elsewhere in the culture dish, the collagen 
remains disorganized and unaligned, so that 
it appears uniformly gray. (From D. Stopak 
and A.K. Harris, Dev. Biol. 90:383-398, 1982. 
With permission from Elsevier.) 
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Humans make at least 24 kinds of integrins, each of which recognizes 
distinct extracellular molecules and has distinct functions, depending 
on the cell type in which it resides. For example, the integrins on white 
blood cells (leukocytes) help the cells crawl out of blood vessels at sites 
of infection so as to deal with marauding microbes. People who lack this 
type of integrin develop a disease called leucocyte adhesion deficiency and 
suffer from repeated bacterial infections. A different form of integrin is 
found on blood platelets, and individuals who lack this integrin bleed 
excessively because their platelets cannot bind to the necessary blood- 


clotting protein in the extracellular matrix. 
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Figure 20-14 Fibronectin and transmembrane integrin proteins help attach a cell to the extracellular matrix. Fibronectin 
molecules bind to collagen fibrils outside the cell. Integrins in the plasma membrane bind to the fibronectin and tether it to the 
cytoskeleton inside the cell. (A) Diagram and (B) electron micrograph of a molecule of fibronectin. (C) The transmembrane linkage 
mediated by an integrin protein (blue and green dimer). The integrin molecule transmits tension across the plasma membrane: it 

is anchored inside the cell via adaptor proteins to the actin cytoskeleton and externally via fibronectin to other extracellular matrix 
proteins, such as the collagen fibril shown. The integrin shown here links fibronectin to an actin filament inside the cell. Other integrins 
can connect different extracellular proteins to the cytoskeleton (usually to actin filaments, but sometimes to intermediate filaments). 
(B, from J. Engel et al., J. Mol. Biol. 150:97-120, 1981. With permission from Elsevier.) 
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Figure 20-15 An integrin protein switches 
to an active conformation when it binds 
to molecules on either side of the plasma 
membrane. An integrin protein consists 

of two different subunits, a (green) and B 
(blue), both of which can switch between 

a folded, inactive form and an extended, 
active form. The switch to the activated 
state can be triggered by binding to an 
extracellular matrix molecule (such as 
fibronectin) or to intracellular adaptor 
proteins that then link the integrin to the 
cytoskeleton (see Figure 20-14). In both 
cases, the conformational change alters 
the integrin so that its opposite end 
rapidly forms a counterbalancing 
attachment to the appropriate structure. 

In this way, the integrin establishes a 
reversible mechanical linkage across the 
plasma membrane. (Based on T. Xiao et al., 
Nature 432:59-67, 2004.) 


Figure 20-16 Glycosaminoglycans (GAGs) 
are built from repeating disaccharide 
units. Hyaluronan, a relatively simple GAG, 
is shown here. It consists of a single long 
chain of up to 25,000 repeated disaccharide 
units, each carrying a negative charge 

(red). As in other GAGs, one of the sugar 
monomers (green) in each disaccharide 
unit is an amino sugar. Many GAGs have 
additional negative charges, often from 
sulfate groups (not shown). 
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Gels of Polysaccharides and Proteins Fill Spaces and 
Resist Compression 


While collagen provides tensile strength to resist stretching, a completely 
different group of macromolecules in the extracellular matrix of animal 
tissues provides the complementary function, resisting compression. 
These are the glycosaminoglycans (GAGs), negatively charged poly- 
saccharide chains made of repeating disaccharide units (Figure 20-16). 
Chains of GAGs are usually covalently linked to a core protein to form 
proteoglycans, which are extremely diverse in size, shape, and chem- 
istry. Typically, many GAG chains are attached to a single core protein 
that may, in turn, be linked to another GAG, creating a macromolecule 
that resembles a bottlebrush. Aggrecan in cartilage, for example, is one 
of the most abundant proteoglycans; it has more than 100 GAG chains on 
a single core protein, and it interacts extracellularly with another GAG, 
hyaluronan (see Figure 20-16), creating an enormous aggregate with a 
molecular weight in the millions (Figure 20-17). 


In dense, compact connective tissues such as tendon and bone, the 
proportion of GAGs is small, and the matrix consists almost entirely of 
collagen (or, in the case of bone, of collagen plus calcium phosphate crys- 
tals). At the other extreme, the jellylike substance in the interior of the eye 
consists almost entirely of one particular type of GAG, plus water, with 
only a small amount of collagen. In general, GAGs are strongly hydro- 
philic and tend to adopt highly extended conformations, which occupy a 
huge volume relative to their mass (see Figure 20-17). Thus GAGs act as 
effective “space fillers” in the extracellular matrix of connective tissues. 
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Figure 20-17 Proteoglycans and GAGs can form large aggregates. (A) Electron micrograph of an aggrecan- 
hyaluronan aggregate from cartilage, spread out on a flat surface. (B) Schematic drawing of the giant aggregate 
illustrated in (A), showing how it is built up from aggrecan subunits bristling with numerous GAG chains—chondroitin 
sulfate (long blue lines) and keratan sulfate (short blue lines)—attached to a core protein (light green). These 


subunits then aggregate via link proteins (green) to the GAG hyaluronan (blue). The mass of such a complex can be 


108 daltons or more, and it occupies a volume equivalent to that of a bacterium, which is about 2 x 10712 cm3. 


(A, from J.A. Buckwalter, PJ. Roughley, and L.C. Rosenberg, Microscopy Research & Technique 28:398-408, 1994. 
With permission from John Wiley & Sons.) 


Even at very low concentrations, GAGs form hydrophilic gels: their mul- 
tiple negative charges attract a cloud of cations, such as Nat, that are 
osmotically active, causing large amounts of water to be sucked into the 
matrix. This gives rise to a swelling pressure, which is balanced by ten- 
sion in the collagen fibers interwoven with the GAGs. When the matrix 
is rich in collagen and large quantities of GAGs are trapped in the mesh, 

both the swelling pressure and the counterbalancing tension are enor- QUESTION 20-3 

mous. Such a matrix is tough, resilient, and resistant to compression. The 
cartilage matrix that lines the knee joint, for example, has this character: 
it can support pressures of hundreds of kilograms per square centimeter. 


Proteoglycans are characterized by 
the abundance of negative charges 
on their sugar chains. How would 


Proteoglycans perform many sophisticated functions in addition to pro- the properties of these molecules 
viding hydrated space around cells. They can form gels of varying pore differ if the negative charges were 
size and charge density that act as filters to regulate the passage of mol- not as abundant? 


ecules through the extracellular medium. They can bind secreted growth 
factors and other proteins that serve as extracellular signals for cells. 
They can block, encourage, or guide cell migration through the matrix. 
In all these ways, the matrix components influence the behavior of cells, 
often the same cells that make the matrix—a reciprocal interaction that 
has important effects on cell differentiation and the arrangement of cells 
in a tissue. Much remains to be learned about how cells weave the tapes- 
try of matrix molecules and how the chemical messages they deposit in 
this intricate biochemical fabric are organized and act. 


EPITHELIAL SHEETS AND CELL JUNCTIONS 


There are more than 200 visibly different cell types in the body of a 
vertebrate. The majority of these are organized into epithelia (singu- 
lar epithelium)—multicellular sheets in which adjacent cells are joined 
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Figure 20-18 Cells can be packed 
together in different ways to form an 
epithelial sheet. Micrographs along 
with drawings show four basic types 

of epithelia. In each case, the cells are 
sitting on a thin mat of extracellular 
matrix, the basal lamina (yellow), as 
discussed shortly. (From D.W. Fawcett, 
A Textbook of Histology, 12th ed. 1994. 
With permission from Taylor & Francis 
Books UK.) 
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Figure 20-19 A sheet of epithelial 
cells has an apical and a basal 
surface. The basal surface sits on 

a specialized sheet of extracellular 
matrix called the basal lamina, while 
the apical surface is free. 
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tightly together. In some cases, the sheet is many cells thick, or stratified, 
as in the epidermis (the outer layer of the skin); in other cases, it is a simple 
epithelium, only one cell thick, as in the lining of the gut. The epithelial 
cells, themselves, can also take many forms. They can be tall and colum- 
nar, squat and cuboidal, or flat and squamous (Figure 20-18). Within a 
given sheet, the cells may be all the same type or a mixture of differ- 
ent types. Some epithelia, like the epidermis, act mainly as a protective 
barrier; others have complex biochemical functions. Some secrete spe- 
cialized products such as hormones, milk, or tears; others, such as the 
epithelium lining the gut, absorb nutrients; yet others detect signals, such 
as light, sensed by the layer of photoreceptors in the retina of the eye, or 
sound, sensed by the epithelium containing the auditory hair cells in the 
ear (see Figure 12-28). 


Despite these and many other variations, one can recognize a standard 
set of features that virtually all animal epithelia share. Epithelia cover 
the external surface of the body and line all its internal cavities, and they 
must have been an early feature in the evolution of animals. Cells joined 
together into an epithelial sheet create a barrier, which has the same 
significance for the multicellular organism that the plasma membrane 
has for a single cell. It keeps some molecules in, and others out; it takes 
up nutrients and exports wastes; it contains receptors for environmental 
signals; and it protects the interior of the organism from invading micro- 
organisms and fluid loss. 


The arrangement of cells into epithelia is so commonplace that we some- 
times take it for granted. Yet, as we discuss in this section, establishing 
epithelia requires a collection of specialized structures and molecular 
devices, which are common to a wide variety of epithelial cell types. 


Epithelial Sheets Are Polarized and Rest on a Basal 
Lamina 


An epithelial sheet has two faces: the apical surface is free and exposed 
to the air or to a bodily fluid; the basal surface is attached to a sheet 
of connective tissue called the basal lamina (Figure 20-19). The basal 
lamina consists of a thin, tough sheet of extracellular matrix, composed 
mainly of a specialized type of collagen (type IV collagen) and a pro- 
tein called laminin (Figure 20-20). Laminin provides adhesive sites for 
integrin molecules in the basal plasma membranes of epithelial cells, and 
it thus serves a linking role like that of fibronectin in other connective 
tissues. 


The apical and basal faces of an epithelium are different: each contains a 
different set of molecules that reflect the polarized organization of the indi- 
vidual epithelial cells: each has atop anda bottom, with different properties 
and functions. This polarity is crucial for epithelial function. Consider, for 
example, the simple columnar epithelium that lines the small intestine of 
a mammal. It mainly consists of two intermingled cell types: absorptive 
cells, which take up nutrients, and goblet cells (so called because of their 
shape), which secrete the mucus that protects and lubricates the gut lining 
(Figure 20-21). Both cell types are polarized. The absorptive cells import 
food molecules from the gut lumen through their apical surface and export 
these molecules from their basal surface into the underlying tissues. To 
do this, absorptive cells require different sets of membrane transport pro- 
teins in their apical and basal plasma membranes (see Figure 12-17). 
The goblet cells also have to be polarized, but in a different way: they 
have to synthesize mucus and then discharge it from their apical end 
only (see Figure 20-21); their Golgi apparatus, secretory vesicles, and 
cytoskeleton are all polarized so as to bring this about. For both types 
of epithelial cells, polarity depends on the junctions that the cells form 
with one another and with the basal lamina. These cell junctions in turn 
control the arrangement of an elaborate system of membrane-associated 
intracellular proteins that create the polarized organization of the cyto- 
plasm, as we discuss next. 


Tight Junctions Make an Epithelium Leakproof and 
Separate Its Apical and Basolateral Surfaces 


Epithelial cell junctions can be classified according to their function. 
Some provide a tight seal to prevent the leakage of molecules across 
the epithelium through the gaps between its cells; some provide strong 
mechanical attachments; and some provide for an intimate type of inter- 
cytosolic exchange. In most epithelia, all these types of junctions are 
present. As we will see, each type of junction is characterized by its own 
class of transmembrane proteins. 


In vertebrates, the barrier function of epithelial sheets is made possible 
by tight junctions. These junctions seal neighboring cells together so 
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Figure 20-20 The basal lamina supports a 
sheet of epithelial cells. Light micrograph 
of the epithelial sheet that lines the small 
intestine. The sheet of columnar cells sits 
on a thin mat-like structure, the basal lamina 
(red arrowheads), which is woven from type 
IV collagen and laminin proteins. A network 
of other collagen fibrils and fibers in the 
underlying connective tissue interacts with 
the lower face of the lamina. (Jose Luis 
Calvo/Shutterstock.) 


Figure 20-21 Different types of 
functionally polarized cell types line the 
intestine. Absorptive cells, which take up 
nutrients from the intestine, are mingled 

in the gut epithelium with goblet cells 
(brown), which secrete mucus into the gut 
lumen. The absorptive cells are often called 
brush-border cells, because of the brushlike 
mass of microvilli on their apical surface; 
the microvilli serve to increase the area of 
apical plasma membrane for the transport 
of small molecules into the cell. The goblet 
cells owe their gobletlike shape to the mass 
of mucus-containing secretory vesicles 

that distends the cytoplasm in their apical 
region. (Adapted from R. Krstić, Human 
Microscopic Anatomy. Berlin: Springer, 1991. 
With permission from Springer-Verlag.) 
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Figure 20-22 Tight junctions allow epithelial cell sheets to serve as barriers to 7o 4 
solute diffusion. (A) Schematic drawing showing how a small, extracellular tracer 5° 
molecule (yellow) added on one side of an epithelial cell sheet cannot traverse the 
tight junctions that seal adjacent cells together. (B) Electron micrographs of cells in an 
epithelium where a small, extracellular tracer molecule (dark stain) has been added to 
either the apical side (on the left) or the basolateral side (on the right); in both cases, [= 
the tracer is stopped by the tight junction. (C) A simplified model of the structure CELL CELL 03 im 
of a tight junction, showing how the cells are sealed together by branching strands (C) 1 2 


of transmembrane proteins (green), called claudins and occludins, in the plasma 
membranes of the interacting cells. Each type of protein binds to the same type in 
the apposed membrane (not shown). (B, courtesy of Daniel Friend, by permission of 
E.L. Bearer.) 


that water-soluble molecules cannot easily leak between them. If a small 
tracer molecule is added to one side of an epithelial cell sheet, it will usu- 
ally not pass beyond the tight junction (Figure 20-22A and B). The tight 
junction is formed from proteins called claudins and occludins, which 
are arranged in strands along the lines of the junction to create the seal 
(Figure 20-22C). Without tight junctions to prevent leakage, the pump- 
ing activities of absorptive cells such as those in the gut would be futile, 
and the composition of the extracellular fluid would become the same on 
both sides of the epithelium. 


Tight junctions also play a key part in maintaining the polarity of the 
individual epithelial cells in two ways. First, the tight junctions around 
the apical region of each cell prevent diffusion of proteins in the plasma 
membrane and so keep the contents of apical domain of the plasma 
membrane separate—and different—from the basolateral domain (see 
Figure 11-32). Second, in many epithelia, the tight junctions are sites 
of assembly for the complexes of intracellular proteins that govern the 
apico-basal polarity of the cell interior. 


Cytoskeleton-linked Junctions Bind Epithelial Cells 
Robustly to One Another and to the Basal Lamina 


The cell junctions that hold an epithelium together by forming strong 
mechanical attachments are of three main types. Adherens junctions and 
desmosomes bind one epithelial cell to another, while hemidesmosomes 
bind epithelial cells to the basal lamina. All of these junctions provide 
mechanical strength to the epithelium by the same strategy: the proteins 
that form the junctions span the plasma membrane and are linked inside 
the cell to cytoskeletal filaments. In this way, the cytoskeletal filaments 
are tied into a network that extends from cell to cell across the whole 
expanse of the epithelial sheet. 


Adherens junctions and desmosomes are both built around transmem- 
brane proteins that belong to the cadherin family: a cadherin molecule in 
the plasma membrane of one cell binds directly to an identical cadherin 
molecule in the plasma membrane of its neighbor (Figure 20-23). Such 
interaction of like-with-like is called homophilic binding. In the case of 
cadherins, binding also requires that Ca?* be present in the extracellular 
medium—hence the name. 


At an adherens junction, each cadherin molecule is tethered inside its 
cell, via several linker proteins, to actin filaments. Often, the adherens 
junctions form a continuous adhesion belt around each of the interact- 
ing epithelial cells; this belt is located near the apical end of the cell, 
just below the tight junctions (Figure 20-24). Bundles of actin filaments 
are thus connected from cell to cell across the epithelium. This network 
of actin filaments also contains myosin filaments and can thus contract, 
giving the epithelial sheet the capacity to develop tension and to change 
its shape in remarkable ways. By shrinking the apical surface of an epi- 
thelial sheet along one axis, the sheet can roll itself up into a tube (Figure 
20-25A and B). Alternatively, by shrinking its apical surface locally along 
all axes at once, the sheet can invaginate into a cup and eventually create 
a spherical vesicle by pinching off from the rest of the epithelium (Figure 
20-25C). Epithelial movements such as these are crucial during embry- 
onic development, when they create structures such as the neural tube 
(see Figure 20-25B), which gives rise to the brain and spinal cord, and the 
lens vesicle, which develops into the lens of the eye (see Figure 20-25C). 


At a desmosome, a different set of cadherin molecules connects to kera- 
tin filaments—the intermediate filaments found specifically in epithelial 
cells (see Figure 17-5). Bundles of ropelike keratin filaments criss-cross 
the cytoplasm and are “spot-welded” via desmosome junctions to 
the bundles of keratin filaments in adjacent cells (Figure 20-26). This 
arrangement confers great tensile strength to the epithelial sheet and is 
characteristic of tough, exposed epithelia such as the epidermis of the 
skin. 
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Figure 20-23 Cadherin proteins mediate 
mechanical attachment of one cell to 
another. Identical cadherin molecules in the 
plasma membranes of adjacent cells bind 
to each other extracellularly; inside the cell, 
they are attached, via linker proteins, to 
cytoskeletal filaments—either actin 
filaments or keratin intermediate filaments. 
Movie 20.2 shows how, when epithelial 
cells in culture touch one another, their 
cadherins become concentrated at the 
point of attachment, leading to the 
formation of adherens junctions. 


Figure 20-24 Adherens junctions form 
adhesion belts around epithelial cells in 
the small intestine. A contractile bundle of 
actin filaments runs along the cytoplasmic 
surface of the plasma membrane near 

the apex of each cell. These bundles 

are linked to those in adjacent cells via 
transmembrane cadherin molecules (see 
Figure 20-23). 
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Figure 20-26 Desmosomes link the 
keratin intermediate filaments of one 
epithelial cell to those of another. 

(A) An electron micrograph of a 
desmosome joining two cells in the 
epidermis of newt skin, showing the 
attachment of keratin filaments. 

(B) Schematic drawing of a desmosome. 
On the cytoplasmic surface of each 
interacting plasma membrane is a 
dense plaque composed of a mixture 
of intracellular linker proteins. A bundle 
of keratin filaments is attached to the 
surface of each plaque. The cytoplasmic 
tails of transmembrane cadherin proteins 
bind to the outer face of each plaque; 
their extracellular domains interact 

to hold the cells together. (A, from 

D.E. Kelly, J. Cell Biol. 28:51-72, 1966. 
With permission from The Rockefeller 
University Press.) 
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Figure 20-25 Epithelial sheets can bend to form an epithelial tube or vesicle. 
Contraction of apical bundles of actin filaments linked from cell to cell via adherens 
junctions causes the epithelial cells to narrow at their apex. Depending on whether 
the contraction of the epithelial sheet is oriented along one axis, or is equal in 

all directions, the epithelium will either roll up into a tube or invaginate to form a 
vesicle, respectively. (A) Diagram showing how an apical contraction along one axis 
of an epithelial sheet can cause the sheet to form a tube. (B) Scanning electron 
micrograph of a cross section through the trunk of a two-day chick embryo, showing 
the formation of the neural tube by the process shown in (A). Part of the epithelial 
sheet that covers the surface of the embryo has thickened and rolled up by apical 
contraction; the opposing folds are about to fuse, after which the structure will pinch 
off to form the neural tube. (C) Scanning electron micrograph of a chick embryo 
showing the formation of the eye cup and lens. A patch of surface epithelium 
overlying the forming eye cup has become concave and has pinched off as a 
separate vesicle—the lens vesicle—within the eye cup. This process is driven by an 
apical narrowing of epithelial cells in all directions. (B, courtesy of Jean-Paul Revel; 
C, courtesy of K.W. Tosney.) 


Blisters are a painful reminder that it is not enough for epidermal cells 
to be firmly attached to one another: they must also be anchored to the 
underlying connective tissue. As we noted earlier, the anchorage is medi- 
ated by integrins in the cells’ basal plasma membranes. The extracellular 
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domains of these integrins bind to laminin in the basal lamina; inside 
the cell, the integrin tails are bound via linker proteins to keratin fila- 
ments, creating a structure that looks superficially like half a desmosome. 
These attachments of epithelial cells to the basal lamina beneath them 
are therefore called hemidesmosomes (Figure 20-27). 


Gap Junctions Allow Cytosolic Inorganic lons and Small 
Molecules to Pass from Cell to Cell 


The final type of epithelial cell junction, found in virtually all epithelia 
and in many other types of animal tissues, serves a totally different pur- 
pose from the junctions discussed so far. In the electron microscope, gap 
junctions appear as regions where the plasma membranes of two cells 
lie close together and exactly parallel, with a very narrow gap of 2-4 nm 
between them. The gap, however, is not entirely empty; it is spanned 
by the protruding ends of many identical, transmembrane protein com- 
plexes that reside in the plasma membranes of the two apposed cells. 
These complexes, called connexons, are aligned end-to-end to form nar- 
row, water-filled channels across the interacting membranes (Figure 
20-28). The channels allow inorganic ions and small, water-soluble mol- 
ecules (up to a molecular mass of about 1000 daltons) to move directly 
from the cytosol of one cell to the cytosol of the other. This flow creates 
an electrical and a metabolic coupling between the cells. Gap junctions 
between cardiac muscle cells, for example, provide the electrical cou- 
pling that allows electrical waves of excitation to spread synchronously 
through the heart, triggering the coordinated contraction of the cells that 
produces each heart beat. 


Gap junctions in many tissues can be opened or closed in response to 
extracellular or intracellular signals. The neurotransmitter dopamine, for 
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Figure 20-27 Hemidesmosomes anchor 
the keratin filaments in an epithelial 

cell to the basal lamina. The linkage is 
mediated by a transmembrane attachment 
complex containing integrins, rather than 
cadherins. 


QUESTION 20-4 


Analogs of hemidesmosomes are 
the focal contacts described in 
Chapter 17, which are also sites 
where the cell attaches to the 
extracellular matrix. These junctions 
are prevalent in fibroblasts but 
largely absent in epithelial cells. On 
the other hand, hemidesmosomes 
are prevalent in epithelial cells but 
absent in fibroblasts. In focal contact 
sites, intracellular connections are 
made to actin filaments, whereas, 

in hemidesmosomes, connections 
are made to intermediate filaments. 
Why do you suppose these two 
different cell types attach differently 
to the extracellular matrix? 


Figure 20-28 Gap junctions provide 
neighboring cells with a direct channel of 
intercytosolic communication. (A) Electron 
micrograph of a gap junction between 

two cells in culture. (B) A model of a gap 
junction. The drawing shows the interacting 
plasma membranes of two adjacent cells. 
The apposed membranes are penetrated 
by protein assemblies called connexons 
green), each of which is formed from six 
identical protein subunits. Two connexons 
join across the intercellular gap to form an 
aqueous channel connecting the cytosols 
of the two cells. (A, from N.B. Gilula, in Cell 
Communication [R.P. Cox, ed.], pp. 1-29. 
New York: Wiley, 1974. With permission 
from John Wiley & Sons, Inc.) 
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Figure 20-29 Extracellular signals injected neuron 
can regulate the permeability of gap 
junctions. (A) A neuron in a rabbit retina 
(center) was injected with a dye (dark 

stain) that passes readily through gap 
junctions. The dye diffuses rapidly from 

the injected cell to label the surrounding 
neurons, which are connected by gap 
junctions. (B) Treatment of the retina with 
the neurotransmitter dopamine prior to dye 
injection decreases the permeability of the 
gap junctions and hampers the spread of 
the dye. (Courtesy of David Vaney.) 
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Gap junctions are dynamic structures 
that, like conventional ion channels, example, reduces gap-junction communication between certain neurons 
are aes P cell close bya in the mammalian retina when secreted in response to an increase in 
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in response to changes in the cell. 
The permeability of gap junctions 
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alters the pattern of electrical signaling and helps the retina switch from 
using rod photoreceptors, which are good detectors of low light levels, 
to cone photoreceptors, which detect color and fine detail in bright light. 
The function of gap junctions—and of the other junctions found in animal 


why this form of regulation by Ca2* cells—are summarized in Figure 20-30. 
we e important for üns heel ci Plant tissues lack all the types of cell junctions we have discussed so far, 
a tissue. 


as their cells are held together by their cell walls. Importantly, however, 
they have a functional counterpart of the gap junction. The cytoplasms of 
adjacent plant cells are connected via minute communicating channels 
called plasmodesmata, which span the intervening cell walls. In contrast 
to gap-junction channels, plasmodesmata are cytoplasmic channels lined 
Figure 20-30 Several types of cell with plasma membrane (Figure 20-31). Thus in plants, the cytoplasm is, 
junctions are found in epithelia in animals. in principle, continuous from one cell to the next, allowing the passage of 
Whereas tight junctions are peculiar to both landi leculces includi tei d lat 
eein oha oo a oon oth small and large molecules—including some proteins and regulatory 
RNAs. The controlled traffic of transcription regulators and regulat- 
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Figure 20-31 The cytoplasms of adjacent plant cells are connected via plasmodesmata. (A) The intercytoplasmic 
channels of plasmodesmata pierce the plant cell walls and connect the interiors of all cells in a plant. (B) Each 
plasmodesma is lined with plasma membrane common to the two connected cells. It usually also contains a fine tubular 
structure, the desmotubule, derived from smooth endoplasmic reticulum. (C) Micrograph of plasmodesmata. (C, courtesy 


of L.G. Tilney.) 


STEM CELLS AND TISSUE RENEWAL 


One cannot contemplate the organization of tissues without wondering 
how these astonishingly patterned structures come into being. This ques- 
tion raises an even more challenging one—a puzzle that is one of the 
most ancient and fundamental in all of biology: how is a complex multi- 
cellular organism generated from a single fertilized egg? 


In the process of development, the fertilized egg cell divides repeatedly to 
give a clone of cells—about 10,000,000,000,000 for a human—essentially 
all containing the same genome but specialized in different ways. This 
clone has a remarkable structure. It may take the form of a daisy or an 
oak tree, a sea urchin, a whale, or a mouse. The structure is determined 
by the genome that the fertilized egg contains (Figure 20-32). The linear 
sequence of A, G, C, and T nucleotides in the DNA directs the production 
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Figure 20-32 The genome of the 
fertilized egg determines the ultimate 
structure of the clone of cells that will 
develop from it. (A and B) A sea-urchin 
egg gives rise to a sea urchin; (C and D) 
a mouse egg gives rise to a mouse. 

(A, courtesy of David McClay; B, courtesy 
of Alaska Department of Fish and Game; 
C, courtesy of Patricia Calarco; D, US 
Department of Agriculture, Agricultural 
Research Service.) 
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of a variety of distinct cell types, each expressing different sets of genes 
and arranged in a precise, intricate, three-dimensional pattern. No one 
builds these amazing organisms: they self-assemble during development. 


Although the final structure of an animal’s body may be enormously com- 
plex, it is generated by a limited repertoire of cell activities. Examples 
of all these activities have been discussed in earlier pages of this book. 
Cells grow, divide, migrate, establish connections with other cells, and 
die. They form mechanical attachments and generate forces that bend 
epithelial sheets (see Figure 20-25). They differentiate by switching on or 
off the production of specific sets of proteins and regulatory RNAs. They 
produce molecular signals to influence neighboring and distant cells, and 
they respond to signals that other cells deliver to them. They remember 
the effects of previous signals they have received, and so progressively 
become more and more specialized in the characteristics they adopt. 
The genome, identical in virtually every cell, defines the rules by which 
these various cell activities are called into play. Through its operation in 
each cell individually, the genome guides the whole intricate process by 
which a multicellular organism assembles itself, starting from a fertilized 
egg. Movie 1.1, Movie 20.3, and Movie 20.4 offer some visual examples 
of how development unfurls for the embryos of a frog, a fruit fly, and a 
zebrafish, respectively. 


For developmental biologists, the challenge is to explain how genes 
orchestrate the entire sequence of interlocking events that lead from the 
egg to the adult organism. We will not attempt to set out an answer to 
this problem here: we do not have space to do it justice, even though 
a great deal of the genetic and cell-biological basis of development is 
now understood. But the same basic activities that combine to create the 
organism during development continue even in the adult body, where 
fresh cells are continually generated in precisely controlled patterns. It 
is this more limited topic that we discuss in this section, focusing on the 
organization and maintenance of the tissues of adult vertebrates. 


Tissues Are Organized Mixtures of Many Cell Types 


Although the specialized tissues in our body differ in many ways, they 
all have certain basic requirements, usually fulfilled by a mixture of cell 
types, as illustrated for the skin in Figure 20-33. As discussed earlier, all 
tissues need mechanical strength, which is often supplied by a supporting 
framework of connective tissue laid down and inhabited by fibroblasts 
and related cell types. In this connective tissue, blood vessels lined with 
endothelial cells satisfy the need for oxygen, nutrients, and waste dis- 
posal. Likewise, most tissues are innervated by nerve cell axons, which 
are ensheathed by Schwann cells, some of which wrap around large 
axons to provide electrical insulation. Macrophages dispose of dead and 
damaged cells and other unwanted debris, and, together with lympho- 
cytes and other white blood cells, they help combat infection. Most of 
these cell types originate outside the tissue and invade it, either early in 
the course of its development (endothelial cells, nerve cell axons, and 
Schwann cells) or continuously throughout life (macrophages and other 
cells derived from the blood). 


A similar supporting apparatus is required to maintain the principal spe- 
cialized cells of many tissues: the contractile cells of muscle, the secretory 
cells of glands, or the blood-forming cells of bone marrow, for example. 
Almost every tissue is therefore an intricate mixture of many cell types 
that must remain different from one another while coexisting in the same 
environment. Moreover, in almost all adult tissues, cells are continually 
dying and being replaced; throughout this hurly-burly of cell replacement 
and tissue renewal, the organization of the tissue must be preserved. 
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Three main factors contribute to this stability. 


1. 


Cell communication: each type of specialized cell continually monitors 
its environment for signals from other cells and adjusts its behavior 
accordingly; the proliferation and even the survival of most vertebrate 
cells depends on such social signals (discussed in Chapters 16 and 
18). This communication ensures that new cells are produced and 
survive only when and where they are required. 


. Selective cell adhesion: because different cell types have different 


cadherins and other cell adhesion molecules in their plasma membrane, 
they tend to stick selectively, by homophilic binding, to other cells of 
the same type. They may also form selective attachments to certain 
other cell types and to specific extracellular matrix components. The 
selectivity of these cell adhesions keeps cells in their proper positions. 
Cell memory: as discussed in Chapter 8, specialized patterns of 
gene expression, evoked by signals that acted during embryonic 
development, are afterward stably maintained, so that cells 
autonomously preserve their distinctive character and pass it on to 
their progeny. A fibroblast divides to produce more fibroblasts, an 
endothelial cell divides to produce more endothelial cells, and so on. 


Different Tissues Are Renewed at Different Rates 


Human tissues vary enormously in their rate and pattern of cell turnover. 
At one extreme is the intestinal epithelium, in which cells are replaced 
every three to six days. At the other extreme is nervous tissue, in which 
most of the nerve cells last a lifetime without replacement. Between 
these extremes there is a spectrum of different speeds and styles of tis- 
sue renewal. Bone (see Figure 20-8) has a turnover time of about ten 
years, and it involves renewal of the matrix as well as of cells: old bone 


collagen fibers 


Stem Cells and Tissue Renewal 


Figure 20-33 Mammalian skin is made 
of a mixture of cell types. Schematic 
diagrams showing the cellular architecture 
of the main layers of thick skin. Skin can 

be viewed as a large organ composed 

of two main tissues: epithelial tissue (the 
epidermis) on the outside, and connective 
tissue on the inside. The outermost layer 
of the epidermis consists of flat, dead 

cells, whose intracellular organelles have 
disappeared (see Figure 20-36). The 
underlying connective tissue consists of the 
tough dermis (from which leather is made) 
and the deeper, fatty hypodermis. The 
dermis and hypodermis are richly supplied 
with blood vessels and nerves; some of the 
nerves extend into the epidermis, as shown. 
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matrix is slowly eaten away by a set of cells called osteoclasts, akin to 
macrophages, while new matrix is deposited by another set of cells, oste- 
oblasts, akin to fibroblasts. New red blood cells are generated continually 
by blood-forming precursor cells in the bone marrow; they are released 
into the bloodstream, where they recirculate continually for about 120 
days before being removed and destroyed by phagocytic cells in the liver 
and spleen. In the skin, dead cells in the outer layers of the epidermis are 
continually flaking off and being replaced from below, so that the epider- 
mis is renewed with a turnover time of about two months. And so on. 


Our life depends on these renewal processes, as evidenced by our 
response to excessive exposure to radiation. In high enough doses, ion- 
izing radiation blocks cell division and thus halts tissue renewal: within 
a few days, the lining of the intestine, for example, becomes denuded of 
cells, leading to the devastating diarrhea and water loss characteristic of 
acute radiation sickness. 


Clearly, there have to be elaborate control mechanisms to keep cell pro- 
duction and cell loss in balance in the normal, healthy adult body. Cancers 
originate through violation of these controls, allowing rare mutant cells 
in the self-renewing tissues to survive and proliferate prodigiously. To 
understand cancer, therefore, it is important to understand the normal 
social controls on cell turnover that cancer perverts. 


Stem Cells and Proliferating Precursor Cells Generate a 
Continuous Supply of Terminally Differentiated Cells 


Most of the specialized, differentiated cells that need continual replace- 
ment are themselves unable to divide. This is true of red blood cells, the 
epidermal cells in the upper layers of the skin, and the absorptive and 
goblet cells of the gut epithelium. Such cells are referred to as terminally 
differentiated: they lie at the dead end of their developmental pathway. 


The cells that replace the terminally differentiated cells that are lost are 
generated from a stock of proliferating precursor cells, which themselves 
usually derive from a much smaller number of self-renewing stem cells. 
Stem cells are not differentiated and can divide without limit (or at least 
for the lifetime of the animal). When a stem cell divides, though, each 
daughter has a choice: either it can remain a stem cell, or it can embark 
on a course leading to terminal differentiation, usually via a series of 
precursor-cell divisions (Figure 20-34). The job of the stem cells and pre- 
cursor cells, therefore, is not to carry out the specialized function of the 
differentiated cells, but rather to produce cells that will. 


Both stem cells and proliferating precursor cells are usually retained in 
their resident tissue along with their differentiated progeny cells. Stem 
cells are mostly present in small numbers and often have a nondescript 
appearance, making them difficult to spot; in some tissues, specific 
molecular markers can help identify them. Despite being undifferenti- 
ated, stem cells and precursor cells are nonetheless developmentally 
restricted: under normal conditions, they stably express sets of transcrip- 
tion regulators that ensure that their differentiated progeny will be of the 
appropriate cell types. 


Figure 20-34 When a stem cell divides, each daughter can either 
remain a stem cell (self-renewal) or go on to become terminally 
differentiated. The terminally differentiated cells usually develop 

from proliferating precursor cells (sometimes called transit amplifying 
cells) that divide a limited number of times before they terminally 
differentiate. Stem-cell divisions can also produce two stem cells or two 
precursor cells, as long as the pool of stem cells is maintained. 
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Figure 20-35 Renewal occurs continuously in the epithelial lining of the adult mammalian intestine. 

(A) Micrograph of a section of part of the lining of the small intestine showing the villi and crypts. Mucus-secreting 
goblet cells (stained purple) are interspersed among the absorptive brush-border cells in the epithelium covering 
the villi. Smaller numbers of two other secretory cell types—enteroendocrine cells (not visible here), which secrete 
gut hormones, and Paneth cells, which secrete antibacterial proteins—are also present and derive from the same 
stem cells. (B) Drawings showing the pattern of cell turnover and the proliferation of stem cells and precursor cells. 
The stem cells (red) give rise mainly to proliferating precursor cells (yellow), which slide continuously upward and 
terminally differentiate into secretory (purple) or absorptive (blue) cells, which are shed from the tip of the villus. 
The stem cells also give rise directly to terminally differentiated Paneth cells (gray), which move down to the 
bottom of the crypt. 


The pattern of cell replacement varies from one stem-cell-based tissue to 
another. In the lining of the small intestine, for example, the absorptive 
and secretory cells are arranged as a single-layered, simple epithelium 
covering the surfaces of the fingerlike villi that project into the gut lumen. 
This epithelium is continuous with the epithelium lining the crypts, which 
descends into the underlying connective tissue (Figure 20-35A). The 
stem cells lie near the bottom of the crypts, where they give rise mostly 
to proliferating precursor cells, which move upward in the plane of the 
epithelial sheet. As they move upward, the precursor cells terminally dif- 
ferentiate into absorptive or secretory cells, which are shed into the gut 
lumen and die when they reach the tips of the villi (Figure 20-35B). 


A contrasting example is the epidermis, a stratified epithelium. In the epi- 
dermis, proliferating stem cells and precursor cells are confined to the 
basal layer, adhering to the basal lamina. The differentiating cells travel QUESTION 20-7 

outward from their site of origin in a direction perpendicular to the plane 

of the cell sheet; terminally differentiated cells and their corpses are Why do you suppose epithelial cells 
eventually shed from the skin surface (Figure 20-36). lining the gut are lost and replaced 
(renewed) frequently, whereas most 


neurons last for the lifetime of the 
organism? 


Often, a single type of stem cell gives rise to several types of differentiated 
progeny: the stem cells of the intestine, for example, produce absorptive 
cells, goblet cells, and several other secretory cell types. The process of 
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Figure 20-36 The epidermis of the skin 
is a stratified epithelium renewed from 
stem cells in its basal layer. (A) The basal 
layer contains a mixture of stem cells and 
dividing precursor cells that are produced 
from the stem cells. On emerging from 
the basal layer, the precursor cells stop 
ividing and move outward, progressively 
ifferentiating as they go. Eventually, 

the cells undergo a special form of cell 
death: the nucleus and other organelles 
disintegrate, and the cell shrinks to the form 
of a flattened scale, packed with keratin 
filaments. These scales are ultimately shed 
from the skin surface. (B) Light micrograph 
of a cross section through the sole of a 
human foot, stained with hematoxylin 

and eosin. 
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blood-cell formation, or hemopoiesis, provides an extreme example of 
this phenomenon. All of the different cell types in the blood—both the red 
blood cells that carry oxygen and the many types of white blood cells that 
fight infection (Figure 20-37)—ultimately derive from a shared hemopoi- 
etic stem cell found in the bone marrow (Figure 20-38). 


Specific Signals Maintain Stem-Cell Populations 


Every stem-cell system requires control mechanisms to ensure that new 
differentiated cells are generated in the appropriate places and in the 
right numbers. The controls depend on extracellular signals exchanged 
between the stem cells, their progeny, and other cell types. These signals, 
and the intracellular signaling pathways they activate, fall into a sur- 
prisingly small number of families, corresponding to half-a-dozen basic 
signaling mechanisms, some of which are discussed in Chapter 16. These 
few mechanisms are used again and again—in different combinations— 
evoking different responses in different contexts in both the embryo and 
the adult. 


Almost all these signaling mechanisms contribute to the task of main- 
taining the complex organization of a stem-cell system such as that of 
the intestine. In this system, a class of signal molecules known as the 
Wnt proteins serves to promote the proliferation of the stem cells and 
precursor cells at the base of each intestinal crypt (Figure 20-39). Cells 
in the crypt produce, in addition, other signals that act at longer range 
to prevent activation of the Wnt pathway outside the crypts. The crypt 
cells also exchange yet other signals that control cell diversification, so 
that some precursor cells differentiate into secretory cells while others 
become absorptive cells. 


Figure 20-37 Blood contains many circulating cell types, all derived 
from a single type of stem cell. A sample of blood is smeared onto 

a glass cover slip, fixed (see Panel 1-1, p. 12), and stained with a dye 
that mainly stains the nucleus blue and cytoplasm red. Microscopic 
examination reveals numerous small erythrocytes (red blood cells), 
which lack a nucleus and DNA. The nucleated cells are different types 
of white blood cell: lymphocytes, eosinophils, basophils, neutrophils, 
and monocytes. Blood smears of this kind are routinely used as a 
clinical test in hospitals to look for increases or decreases in specific 
types of blood cells; for example, an increase in specific types of white 
blood cells could signal infection, inflammatory disorders, or leukemia. 
(Courtesy of Peter Takizawa.) 
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Disorders of these signaling mechanisms disrupt the structure of the 
gut lining. In particular, as we see later, defects in the regulation of Wnt 
signaling underlie colorectal cancer—the commonest forms of human 
intestinal cancer. 


Stem Cells Can Be Used to Repair Lost or Damaged 
Tissues 


Because stem cells can proliferate indefinitely and produce progeny that 
differentiate, they provide for both continual renewal of normal tissue 
and repair of tissue lost through injury. For example, by transfusing a few 
hemopoietic stem cells into a mouse whose own blood stem cells have 
been destroyed by irradiation, it is possible to fully repopulate the ani- 
mal with new blood cells and ultimately rescue it from death by anemia, 
infection, or both. A similar approach is used in the treatment of human 
leukemia with irradiation (or cytotoxic drugs) followed by bone marrow 
transplantation. 


Although stem cells taken directly from adult tissues such as bone mar- 
row have already proven their clinical value, another type of stem cell, 
first identified through experiments in mice, may have even greater poten- 
tial—both for treating and understanding human disease. It is possible, 
through cell culture, to derive from early mouse embryos an extraordi- 
nary class of stem cells called embryonic stem cells, or ES cells. Under 
appropriate conditions, these cells can be kept proliferating indefinitely 
in culture and yet retain unrestricted developmental potential, and are 
thus said to be pluripotent: if the cells from the culture dish are put back 
into an early embryo, they can give rise to all the tissues and cell types in 
the body, including the reproductive germ-line cells. Their descendants 
in the embryo are able to integrate perfectly into whatever site they come 
to occupy, adopting the character and behavior that normal cells would 
show at that site. Such an approach can be used to study gene func- 
tion: in this case, the ES cells are genetically manipulated—to inactivate a 
gene or insert a modified one—prior to being returned to an embryo (see 
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Figure 20-38 A hemopoietic stem cell 
divides to generate more stem cells, 

as well as various types of precursor 

cells (not shown) that proliferate and 
differentiate into the mature blood cell 
types found in the circulation. Note that 
monocytes give rise to both macrophages, 
which are found in many tissues of the 
body, and osteoclasts, which eat away bone 
matrix. Megakaryocytes give rise to blood 
platelets by shedding cell fragments (Movie 
20.5). A large number of extracellular signal 
molecules are known to act at various 
points in this cell lineage to help control the 
production of each cell type and to maintain 
appropriate numbers of precursor cells and 
stem cells. 
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Figure 20-39 The Wnt signaling pathway 
maintains the proliferation of the stem 
cells and precursor cells in the intestinal 
crypt. The Wnt proteins are secreted 

by cells in and around the crypt base, 
especially by the Paneth cells—a subclass 
of terminally differentiated secretory cells 
that are generated from the gut stem cells. 
Newly formed Paneth cells, which move 
down to the crypt bottom instead of up to 
the tip of the villus, have a dual function: 
they secrete antimicrobial peptides to keep 
infection at bay, and at the same time they 
provide the signals to sustain the stem-cell 
population. 
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Figure 20-40 Mouse ES cells can be 
induced to differentiate into specific cell 

types in culture. ES cells are harvested e 
from the inner cell mass of an early mouse 
embryo and can be maintained indefinitely 
as pluripotent stem cells in culture. If they 
are allowed to aggregate (not shown) 

and are then exposed to the appropriate 
extracellular signal molecules, in the correct 
sequence and at the right time, these cells 
can be induced to differentiate into specific 
cell types of interest (Movie 20.6). (Based 
on data from E. Fuchs and J.A. Segré, Cell 
100:143-155, 2000.) 
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Figure 10-28). ES cells can also be induced, by the appropriate extracel- 
lular signal molecules, to differentiate in culture into a large variety of cell 
types (Figure 20-40). 


Cells with properties similar to those of mouse ES cells can also be derived 
from early human embryos, and these cells can be induced to differenti- 
ate into a variety of cell types as illustrated in Figure 20-40. In principle, 
human ES cells provide a potentially inexhaustible supply of cells that 
might be used for the replacement or repair of mature human tissues that 
are damaged. For example, experiments in mice suggest that it should be 
possible to use cells derived from human ES cells to replace the skeletal 
muscle fibers that degenerate in victims of muscular dystrophy, the nerve 
cells that die in patients with Parkinson’s disease, the insulin-secreting 
cells that are destroyed by the immune system in type 1 diabetics, and 
the cardiac muscle cells that die during a heart attack. Perhaps one day it 
might even become possible to grow entire organs from human ES cells 
by a recapitulation of embryonic development, as we discuss shortly. 


There are, however, many hurdles to be cleared before such dreams can 
become reality. One major problem concerns immune rejection: if the 
transplanted cells are genetically different from the cells of the patient 
into whom they are grafted, they are likely to be rejected and destroyed 
by the immune system. Beyond the practical scientific difficulties, there 
have been ethical concerns about the use of human embryos to produce 
human ES cells. One way around both of these problems is to generate 
human pluripotent cells in another way, as we now discuss. 


Induced Pluripotent Stem Cells Provide a Convenient 
Source of Human ES-like Cells 


It is now possible to produce pluripotent stem cells without the use 
of embryos. Differentiated cells can be taken from an adult mouse or 
human tissue, grown in culture, and reprogrammed into an ES-like state 
by artificially driving the expression of a set of three or four transcription 


regulators, including Oct4, Sox2, and KIf4. This treatment is sufficient 
to permanently convert fibroblasts into cells with practically all the 
properties of ES cells, including the ability to proliferate indefinitely, dif- 
ferentiate in diverse ways, and, in the case of mouse cells, contribute 
to the formation of any tissue. These ES-like cells are called induced 
pluripotent stem cells (iPS cells). The drawbacks to this approach 
include a low conversion rate—only a small proportion of the fibroblasts 
make the switch to become iPS cells—and concerns over the safety 
of implanting cells with such an abnormal developmental history into 
humans. Much work remains to be done before this approach can be 
used to treat human diseases effectively and ethically. 


Better ways of producing human iPS cells are continually being devel- 
oped. In the meantime, human ES cells, and especially human iPS cells, 
are proving to be valuable in other ways. They can be used to generate 
large, homogeneous populations of differentiated human cells of specific 
types in culture; these can be used to test for potential toxic or beneficial 
effects of candidate drugs. Moreover, it is possible to generate iPS cells 
from patients who suffer from a genetic disease and to use these iPS cells 
to produce affected, differentiated cell types, which can then be studied 
to learn more about the disease mechanism and to search for poten- 
tial treatments. An example is Timothy syndrome, a rare genetic disease 
caused by mutations in a gene that encodes a specific type of Ca2* chan- 
nel. The altered channel fails to close properly after opening, leading to 
multiple defects, including abnormal heart rhythm and, in some individ- 
uals, autism. The iPS cells produced from such individuals have been 
coaxed to differentiate in culture into neurons and heart muscle cells, 
which are now being used to study the physiological consequences of 
the Ca?+ channel abnormality and to hunt for drugs that can correct the 
defects. 


In addition, experiments on pluripotent ES and iPS cells themselves are 
providing insights into some of the many unsolved mysteries of devel- 
opmental and stem-cell biology, including the molecular mechanisms 
that maintain pluripotency and those that restrict specific developmental 
fates. 


Mouse and Human Pluripotent Stem Cells Can Form 
Organoids in Culture 


Remarkably, under appropriate conditions, mouse or human ES cells 
and iPS cells can proliferate, differentiate, and self-assemble in culture 
to form miniature, three-dimensional organs called organoids, which 
closely resemble the normal organ in its organization. An early striking 
example is shown in Figure 20-41, where a developing eye-like structure 
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Figure 20-41 Cultured ES cells can give 
rise to a three-dimensional organoid. 

(A) Schematic drawing shows how, under 
appropriate conditions, mouse or human 
pluripotent cells in culture can proliferate, 
differentiate, and self-assemble to form a 
three-dimensional, eye-like structure (an 
optic cup), which includes a multilayered 
retina similar in organization to the one 
that forms during normal eye development 
in vivo. (B) Fluorescence micrograph of 

an optic cup formed by human ES cells in 
culture. The structure includes a developing 
retina containing multiple layers of neural 
cells (stained green) and an underlying layer 
of pigmented epithelium, the apical surface 
of which is stained red. All nuclei are stained 
blue. (A, adapted from M. Eiraku and Y. 
Sasai, Curr. Opin. Neurobiol. 22:768-777, 
2012; B, adapted from T. Nakano et al., Cell 
Stem Cell 10:771-785, 2012.) 
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is formed from human ES cells, including a multilayered retina similar in 
organization to that seen in the developing human eye in vivo. 


Mouse and human iPS cells, and precursor cells derived from them, have 
now been used to form organoids that resemble a variety of develop- 
ing organs, including the human brain, arguably the most complex and 
sophisticated structure on Earth. Such organoids provide powerful mod- 
els for studying organ development in a culture dish, where one can 
identify and manipulate the genes involved and explore the roles of cell- 
cell interactions in ways not possible in an intact organism. In addition, 
organoids can be used to investigate how developmental pathways can 
be derailed by disease. For example, brain organoids have been produced 
using human iPS cells derived from an individual with microcephaly, a 
condition characterized by severely stunted brain growth and develop- 
ment. Careful analysis of these developing brain organoids revealed that 
the microcephaly in this case was probably caused by the premature 
cessation of proliferation and differentiation of the brain precursor cells, 
resulting in a decreased production of brain cells. 


The development of iPS cells and organoid technology has opened up an 
entirely new way to study human development and disease. It also opens 
up promising avenues for treatment. 


CANCER 


Humans pay a price for having tissues that can renew and repair them- 
selves. The delicately balanced mechanisms that control these processes 
can break down, leading to catastrophic disruption of tissue structure. 
Foremost among the diseases of tissue renewal is cancer, which stands 
alongside infection, malnutrition, war, and heart disease as a major 
cause of death in human populations. In Europe and North America, for 
example, at least one in five of us will die of cancer. 


Cancer arises from violations of the basic rules of social cell behavior. To 
make sense of the origins and progression of the disease, and to devise 
treatments, we have to draw upon almost every part of our knowledge 
of how cells work and interact in tissues. In this section, we examine 
the causes and mechanisms of cancer, the types of cell misbehavior that 
contribute to its progress, and the ways in which we hope to use our 
understanding to defeat these misbehaving cells and, hence, the disease. 
Although there are many types of cancer, each with distinct properties, 
we will refer to them collectively by the umbrella term “cancer,” as they 
are united by certain common principles and abnormalities. 


Cancer Cells Proliferate Excessively and Migrate 
Inappropriately 


As tissues grow and renew themselves, each individual cell must adjust 
its behavior according to the needs of the organism as a whole. The cell 
must divide only when new cells of that type are needed, and refrain from 
dividing when they are not; it must live as long as it is needed, and be 
removed when it is not; it must maintain its specialized character; and it 
must occupy its proper place and not stray into inappropriate territories. 


In a large organism, no significant harm is done if an occasional single 
cell misbehaves. But a potentially devastating breakdown of order occurs 
when a single cell suffers genetic alterations that allow it to survive and 
divide when it should not, producing daughter cells that behave in the 
same antisocial way. Such a relentlessly expanding clone of abnormal 
cells can disrupt the organization of the tissue, and eventually that of the 
body as a whole. It is this catastrophe that occurs in cancer. 
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Cancer cells are defined by two heritable properties: they and their prog- 
eny (1) proliferate in defiance of the normal constraints and (2) invade 
and colonize territories normally reserved for other cells (Movie 20.7). 
It is the combination of these socially deviant features that creates 
the lethal danger. Cells that have the first property but not the second 
proliferate excessively but remain clustered together in a single mass, 
forming a tumor. Such a tumor is said to be benign, and it can usually be 
removed cleanly and completely by surgery. A tumor is cancerous only if 
its cells have the ability to invade surrounding tissue, in which case the 
tumor is said to be malignant. Malignant tumor cells with this invasive 
property often break loose from the primary tumor and enter the blood- 
stream or lymphatic vessels, from where they form secondary tumors, or 
metastases, at other sites in the body (Figure 20-42). The more widely 
the cancer spreads, the harder it is to eradicate. 


Epidemiological Studies Identify Preventable Causes of 
Cancer 


Prevention is always better than cure, but to prevent cancer we need to 
know what causes it. Do factors in our environment or features of our 
way of life trigger the disease and help it to progress? If so, what are they? 
Answers to these questions come mainly from epidemiology—the statisti- 
cal analysis of human populations in search of factors that correlate with 
disease incidence. This approach has provided strong evidence that the 
environment plays an important part in the causation of most cases of 
human cancer. The types of cancers that are common, for example, vary 
from country to country, and studies of migrants show that it is usually 
where people live, rather than where they were born, that governs their 
cancer risk. 


Although it is still hard to discover which specific factors in the environ- 
ment or lifestyle are significant, and many remain unknown, some have 
been precisely identified. For example, it was noted long ago that cervi- 
cal cancer, which arises in the epithelium lining the cervix (neck) of the 
uterus, was much more common in women who were sexually active 
than in those who were not, suggesting a cause related to sexual activity. 
We now know that most cases of cervical cancer depend on infection of 
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Figure 20-42 Cancers invade surrounding 
tissues and often metastasize to distant 
sites. (A) To give rise to a colony ina 

new site—called a secondary tumor or 
metastasis—the cells of a primary tumor 

in an epithelium must typically cross the 
protective barrier provided by the basal 
lamina (yellow), migrate through connective 
tissue (blue), and get into either blood or 
lymphatic vessels. They then have to exit 
from the bloodstream or lymph and settle, 
survive, and proliferate in a new location 
(not shown). (B) Secondary tumors in a 
human liver, originating from a primary 
tumor in the colon. (C) Higher-magnification 
view of one of the secondary tumors, 
stained differently to show the contrast 
between the normal liver cells and the 
cancer cells. (B and C, courtesy of Peter 
Isaacson.) 
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the cervical epithelium with certain subtypes of a common virus called 
human papillomavirus. These viruses are transmitted through sexual 
intercourse and can sometimes, if one is unlucky, provoke uncontrolled 
proliferation of the infected cells. Knowing this, we can attempt to pre- 
vent the cancer by preventing the infection—for example, by vaccination 
against the relevant papillomaviruses. Such a vaccine is now available, 
conferring a high level of protection if given to young people before they 
become sexually active. 


In the great majority of human cancers, however, viruses do not appear 
to play a part: as we will see, cancer is not an infectious disease. But epi- 
demiology reveals that other factors increase the risk of cancer. Obesity 
is one such factor. Smoking tobacco is another: tobacco smoke is not 
only responsible for the great majority of lung cancer cases, but it also 
raises the incidence of several other cancers, such as those of the blad- 
der. By stopping the use of tobacco, we could prevent about 30% of all 
cancer deaths. No other single policy or treatment is known that would 
have such a dramatic impact on the number of cancer deaths. 


As we will explain, although environmental factors affect the incidence of 
cancer and are critical for some forms of the disease, it would be wrong 
to conclude that they are the only cause of cancers. No matter how hard 
we try, healthy living alone will not reduce our risk of cancer to zero. 


Cancers Develop by an Accumulation of Somatic 
Mutations 


Cancer is fundamentally a genetic disease: it arises as a consequence of 
pathological changes in the information carried by DNA. It differs from 
other genetic diseases in that the mutations underlying cancer are mainly 
somatic mutations—those that occur in individual somatic cells of the 
body—as opposed to germ-line mutations, which are handed down via 
the germ cells from which the entire multicellular organism develops. 


Most of the identified agents known to contribute to the causation of 
cancer, including ionizing radiation and most chemical carcinogens, are 
mutagens: they cause changes in the nucleotide sequence of DNA. But 
even in an environment that is free of tobacco smoke, radioactivity, and 
all the other external mutagens that worry us, mutations will occur spon- 
taneously as a result of the fundamental limitations on the accuracy of 
DNA replication and DNA repair (discussed in Chapter 6). In fact, environ- 
mental carcinogens other than tobacco smoke probably account for only 
a small fraction of the mutations responsible for cancer, and elimination 
of all these external risk factors would still leave us prone to the disease. 


Although DNA is replicated and repaired with great accuracy, an average 
of one mistake slips by for every 10? or 10!° nucleotides copied, as we 
discuss in Chapter 6. This means that spontaneous mutations occur at an 
estimated rate of about 10 or 1077 mutations per gene per cell division, 
even without encouragement by external mutagens. About 10!¢ cell divi- 
sions take place in a human body in the course of an average lifetime; 
thus, every single gene is likely to have acquired a mutation on more 
than 10? separate occasions in any individual. From this point of view, 
the problem of cancer seems to be not why it occurs, but why it occurs 
so infrequently. 


The explanation is that most mutations do not contribute to cancer—even 
though they may happen to be found in a cancer cell; these mutations are 
called passenger mutations. At the same time, for those mutations that do 
promote cancer, a single mutation is generally not sufficient. Precisely 
how many of these cancer-critical, or driver, mutations are required is still 
a matter of debate, but for most full-blown cancers it could be at least 
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Figure 20-43 Cancer incidence increases dramatically with age. 180 
The number of newly diagnosed cases of cancer of the colon in 
women in England and Wales in one year is plotted as a function of 
age at diagnosis. Colon cancer, like most human cancers, is caused by 
the accumulation of multiple mutations. Because cells are continually 
experiencing accidental changes to their DNA—which accumulate and 140 
are passed on to progeny cells when the mutated cells divide—the 
chance that a cell will become cancerous increases greatly with age. 
(Data from C. Muir et al., Cancer Incidence in Five Continents, Vol. V. 
Lyon: International Agency for Research on Cancer, 1987.) 
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mutations do not all occur at once but occur sequentially, usually over a 
period of many years. 


cancer incidence rate per 100,000 


Cancer, therefore, is most often a disease of old age, because it takes a 
long time for an individual clone of cells—those derived from a common 
founder—to accumulate a sufficient number of cancer-critical muta- 
tions (Figure 20-43). Many human cancer cells are able to speed up this 
acquisition of mutations because they are also genetically unstable. This o0 20 30 40 50 60 70 80 
genetic instability results from mutations that interfere with the accu- age (years) 
rate replication and maintenance of the genome and thereby increase the 

rate at which mutations accumulate. Sometimes, the increased mutation 

rate may result from a defect in one of the many proteins needed to repair 

damaged DNA or to correct errors in DNA replication. Sometimes, there 

may be a defect in the cell-cycle checkpoint mechanisms that normally 

prevent a cell with damaged DNA from attempting to divide before the 

damage has been fully repaired (discussed in Chapter 18). Sometimes, 

there may be a fault in the machinery of mitosis, which can lead to chro- 

mosomal damage, loss, or gain; an abnormal chromosome number can 

then, itself, increase mitotic errors. These potential sources of genetic 

instability are summarized in Table 20-1. 


Genetic instability can produce chromosome breaks and rearrange- 
ments—even complete chromosome duplications. Such gross abnormal- 
ities can often be seen in a karyotype of the cancer cell (Figure 20-44). 


Cancer Cells Evolve, Acquiring an Increasing 
Competitive Advantage 


The mutations that lead to cancer do not cripple the mutant cells. On the 
contrary, they give these cells a competitive advantage over their neigh- 
bors. It is this advantage enjoyed by the mutant cells that leads to disaster 
for the organism as a whole. As an initial population of mutant cells 
grows, it slowly evolves: new chance mutations occur, some of which are 
favored by natural selection because they enhance cell proliferation, cell 
survival, or both. This process of random mutation followed by selec- 
tion culminates in the genesis of cancer cells that run riot within the 
population of cells that form the body, upsetting its regular structure TABLE 20-1 A VARIETY OF 


(Figure 20-45). FACTORS CAN CONTRIBUTE TO 


Certain environmental or lifestyle factors, such as obesity, may further GENETIC INSTABILITY 
favor the development of cancer by altering the selection pressures that Defects in DNA replication 
operate in tissues. A glut of circulating nutrients, or abnormal increases 
in hormones, survival factors, mitogens, or growth factors, for example, 
may help cells with dangerous mutations to survive, grow, and prolifer- Defects in cell-cycle checkpoint 
ate. Eventually, cells emerge that have all the abnormalities required for mechanisms 

full-blown cancer. 


Defects in DNA repair 


Mistakes in mitosis 
To be successful, a cancer cell must acquire a whole range of abnor- 


s . : ; ; ; Abnormal chromosome numbers 
mal properties—a collection of subversive behaviors. A proliferating 
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Figure 20-44 Cancer cells often have highly abnormal chromosomes, reflecting genetic instability. Shown 
here are karyotypes displaying the chromosomes of (A) a normal human cell and (B) a breast cancer cell. The 
chromosomes are “painted” with a combination of fluorescent stains that give each chromosome a different 
color. The breast cancer karyotype shows multiple translocations, including two instances of a translocation 

of material from chromosome 6 (red) to chromosome 4 (light blue), one of which also includes a piece of 
chromosome 1 (yellow). Whereas the normal cell contains 46 chromosomes, the breast cancer cell has 51; several 
of its chromosomes are missing, and it has an extra copy of a handful of chromosomes—along with 6 copies of 
chromosome 19. Such abnormalities in chromosome number can cause chromosome-segregation errors when the 
cell divides, so that the degree of genetic disruption goes from bad to worse over time (see Table 20-1). (Courtesy 
of Mira Grigorova and Paul Edwards.) 
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precursor cell in the epithelial lining of the gut, for example, must undergo 
changes that permit it to carry on dividing when it would normally stop 
(see Figure 20-35). That cell and its progeny must also be able to avoid 
cell death, displace their normal neighbors, and attract a blood supply 
to nourish continued tumor growth (Movie 20.9). For the tumor cells to 
then become invasive, they must be able to detach from the epithelial 
sheet and digest their way through the basal lamina into the underlying 
connective tissue. To spread to other organs and form metastases, they 
must be able to get in, and then out, of blood or lymph vessels and settle, 
survive, and proliferate in new sites (see Figure 20-42). 


Different cancers display different combinations of properties. 
Nevertheless, we can draw up a general list of characteristics that distin- 
guish cancer cells from normal cells. 


1. Cancer cells have a reduced dependence on signals from other cells 
for their survival, growth, and division. Often, this is because they 
contain mutations in components of the cell signaling pathways that 
normally respond to such stimuli. An activating mutation in a Ras 
gene (discussed in Chapter 16), for instance, can cause an intracellular 
signal for proliferation even in the absence of the extracellular cue 
that would normally be needed to turn Ras on—like a faulty doorbell 
that rings even when nobody is pressing the button. 


Figure 20-45 Tumors evolve by repeated rounds of mutation, 
proliferation, and natural selection. The final outcome is a fully 
malignant tumor. At each step, a single cell undergoes a mutation 
that enhances its ability to proliferate, or survive, or both, so that its 
progeny become a dominant clone in the tumor. Proliferation of this 
clone then hastens occurrence of the next step of tumor progression 
by increasing the size of the cell population at risk of undergoing an 
additional mutation. Some cancers contain multiple malignant clones, 
each with its own collection of mutations, in addition to a common set 
of mutations that reflect the tumor’s origin from a founding mutant cell 
(not shown). 


2. Cancer cells can survive levels of stress and internal derangement 
that would cause normal cells to kill themselves by apoptosis. This 
avoidance of cell suicide is often the result of mutations in genes that 
regulate the intracellular death program responsible for apoptosis 
(discussed in Chapter 18). For example, about 50% of all human 
cancers have an inactivating mutation in the p53 gene. The p53 
protein normally acts as part of a DNA damage response that causes 
cells with DNA damage to either cease dividing (see Figure 18-15) or 
die by apoptosis. Chromosome breakage, for example, if not repaired, 
will generally cause a cell to commit suicide; but if the cell is defective 
in p53, it may survive and divide, creating highly abnormal daughter 
cells that have the potential for further mischief (Movie 20.8). 


3. Unlike most normal human cells, cancer cells can often proliferate 
indefinitely. Most normal human somatic cells will only divide a 
limited number of times in culture, after which they permanently 
stop; this cessation of proliferation—called cell senescence—occurs, 
at least in part, because many cells in the adult organism lose the 
ability to produce the enzyme telomerase. The telomeres at the ends of 
their chromosomes thus become progressively shorter with each cell 
division (see Figure 6-22). In cultured cells, this erosion of telomerase 
sequences can activate a DNA damage response that permanently 
halts cell proliferation. Cancer cells typically break through this 
proliferation barrier by reactivating production of telomerase, enabling 
them to maintain telomere length indefinitely. 


4. Most cancer cells are genetically unstable, with a greatly increased 
mutation rate and an abnormal number of chromosomes (see Table 
20-1 and Figure 20-44). 


5. Cancer cells are abnormally invasive, at least partly because they 
often lack certain cell adhesion molecules, such as cadherins, that 
help hold normal cells in their proper place. 

6. Cancer cells are abnormally avid for nutrients, which they use to 
generate much of their ATP by glycolysis in the cell cytosol. This process 
is less efficient than producing ATP by oxidative phosphorylation in 
the mitochondria, but is useful for fast-growing tumors in which cells 
in the interior are often oxygen-deprived. 

7. Cancer cells can survive and proliferate in abnormal locations, 
whereas most normal cells die when misplaced. This colonization 
of unfamiliar territory may result from the ability of cancer cells to 
produce their own extracellular survival signals and to suppress their 
apoptosis program (as described in #2, above). 

8. As cancer cells evolve, they secrete signals that influence the behavior 
of cells in the surrounding connective tissue, thereby modifying the 
tumor’s microenvironment. Cells in the remodeled microenvironment, 
in return, produce signals that support the survival and proliferation 
of the cancer cells, which renders the microenvironment even more 
hospitable for tumor growth. 

To understand these abnormal properties of cancer cells, we have to 

identify the mutations responsible. 


Two Main Classes of Genes Are Critical for Cancer: 
Oncogenes and Tumor Suppressor Genes 


Investigators have made use of a variety of approaches to track down the 
genes and mutations that are critical for cancer—from studying viruses 
that cause cancer in chickens to following families in which a particular 
cancer occurs unusually often. Though many of the most important of 
these genes have now been identified, the hunt for others continues. 


For many of the cancer-critical genes, the most dangerous mutations are 
ones that render the encoded protein hyperactive. These gain-of-function 
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QUESTION 20-8 


About 1016 cell divisions take place 
in a human body during a lifetime, 
yet an adult human body consists 
of only about 1013 cells. How can 
you reconcile these apparently 
conflicting two numbers? 
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Figure 20-46 Genes that are critical (A) dominant mutation (gain of function) 
for cancer are classified as proto- 
oncogenes or tumor suppressor 


genes, according to whether the MUTATION TATON IN REATES ONCOGENE -ONCOGENE 
dangerous mutations are dominant — 
or recessive. (A) Oncogenes act in a 


dominant manner: a gain-of-function proto-oncogene 
mutation in a single copy of the proto- 
oncogene can drive a cell toward 
cancer. (B) Loss-of-function mutations 
in tumor suppressor genes generally 
act in a recessive manner: the function 
of both copies of the gene must be SECOND MUTATION 
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mutations have a dominant effect: only one gene copy needs to be 
mutated to promote the development of cancer. The resulting mutant 
gene is called an oncogene, and the corresponding normal form of the 
gene is called a proto-oncogene (Figure 20-46A). Figure 20-47 shows 
a variety of ways in which a proto-oncogene can be converted into its 
corresponding oncogene. 


For other cancer-critical genes, the danger lies in mutations that destroy 
their activity. These loss-of-function mutations are generally recessive: 
both copies of the gene must be lost or inactivated to contribute to can- 
cer development; the normal gene is called a tumor suppressor gene 
(Figure 20-46B). In addition to such genetic alterations, tumor suppres- 
sor genes can also be silenced by epigenetic changes, which alter gene 
expression without changing the gene’s nucleotide sequence (as dis- 
cussed in Chapter 8). Epigenetic changes are thought to silence some 
tumor suppressor genes in most human cancers. Figure 20-48 highlights 
a few of the ways in which the activity of a tumor suppressor gene can 
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Figure 20-47 Several kinds of gain-of-function mutations can convert a proto-oncogene into an oncogene. In each case, the 
change leads to an increase in the gene's function. 
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Proto-oncogenes and tumor suppressor genes code for proteins of many 
different types, contributing to the many kinds of misbehavior that cancer 
cells display. Some of these proteins are involved in signaling pathways 
that regulate cell survival, cell growth, cell division, or some combination 
of these. Others take part in DNA repair, help mediate the DNA damage 
response, modify chromatin, or help regulate the cell cycle or apopto- 
sis. Still others (such as cadherins) are involved in cell adhesion or other 
properties critical for metastasis, or have roles that we do not yet prop- 
erly understand. 


Cancer-critical Mutations Cluster in a Few Fundamental 
Pathways 


From the point of view of a cancer cell, proto-oncogenes and tumor sup- 
pressor genes—and the mutations that affect them—are flip sides of the 
same coin. Activation of a proto-oncogene and inactivation of a tumor 
suppressor gene can both promote the development of cancer. And both 
types of mutations contribute to the development of most cancers. In 
classifying cancer-critical genes, it seems that the type of mutation— 
gain-of-function or loss-of-function—matters less than the pathway in 
which it acts. 


Today, rapid, low-cost DNA sequencing is providing an unprecedented 
amount of information about the mutations that drive a variety of can- 
cers. We can now compare the complete genome sequences of the cancer 
cells from a patient's tumor to the genome sequence of the noncancer- 
ous cells in the same individual—or of cancer cells that have spread to 
another location in the body. By putting together such data from many 
different patients, we can begin to draw up exhaustive lists of the genes 
that are critical for specific classes of cancer. And by analyzing data from 
a single patient, we can deduce the “family tree” of his or her cancer cells, 
showing how the progeny of the original founder cell have evolved and 
diversified as they multiplied and metastasized to different sites. 


One remarkable finding has been that many of the driver mutations in 
individual tumors affect genes that fall into a small number of key reg- 
ulatory pathways: those that govern cell proliferation, cell growth, cell 
survival, and the cell's response to DNA damage and stress. For example, 
in almost every case of glioblastoma—the most common type of human 
brain cancer—multations disrupt all four of these fundamental pathways, 
and the same pathways are subverted, in one way or another, in almost 
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Figure 20-48 Several kinds of genetic 
events can eliminate the activity of 

a tumor suppressor gene. Note that 
both copies of such a gene must be 

lost to eliminate its function. (A) A cell 

in which the maternal copy of the tumor 
suppressor gene is inactive because of a 
loss-of-function mutation; this cell is one 
step away from a complete loss of this 
tumor-suppressor function. (B) Cells in 
which the paternal copy of the gene is also 
inactivated in different ways, as shown. 
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Figure 20-49 A small number of key 
regulatory pathways are perturbed in 
almost all human cancers. These pathways 
regulate cell proliferation, cell growth, cell 
survival, and the cell's response to DNA 
damage or stress. 
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all human cancers (Figure 20-49). In any given patient, only a single gene 
tends to be mutated in each pathway, but not always the same gene: it 
is the under- or overactivity of the pathway as a whole that matters for 
cancer development, not the way in which this malfunction is achieved. 
Because the same fundamental control systems are the targets of muta- 
tion in so wide a variety of cancers, it seems that their misregulation 
must be crucial to most cancers’ success. 


Colorectal Cancer Illustrates How Loss of a Tumor 
Suppressor Gene Can Lead to Cancer 


Colorectal cancer provides one well-studied example of how a tumor 
suppressor gene can be identified and its role in tumor growth deter- 
mined. Colorectal cancer arises from the epithelium lining the colon and 
rectum; most cases are seen in old people and do not have any discern- 
ible hereditary cause. A small proportion of cases, however, occurs in 
families that are exceptionally prone to the disease and show an unusu- 
ally early onset. In one set of such “predisposed” families, the affected 
individuals develop colorectal cancer in early adult life, and the onset of 
their disease is foreshadowed by the development of hundreds or thou- 
sands of little tumors, called polyps, in the epithelial lining of the colon 
and rectum. 


By studying these families, investigators traced the development of the 
polyps to a deletion or inactivation of a tumor suppressor gene called 
APC—for Adenomatous Polyposis Coli. (Note that the protein encoded by 
this gene is different from the anaphase-promoting complex, abbreviated 
APC/C, discussed in Chapter 18.) Affected individuals inherit one mutant 
copy of the gene and one normal copy. Although one normal gene copy 
is enough for normal cell behavior, all the cells of these individuals are 
only one mutational step away from total loss of the gene’s function (as 
compared to two steps away for a person who inherits two normal copies 
of the gene). The individual tumors arise from cells that have undergone 
a somatic mutation that inactivates the remaining good copy of APC (see 
Figure 20-46B). Not surprisingly, the disease strikes these individuals at 
an earlier age than it does in individuals with two good copies of APC. 


But what about the great majority of colorectal cancer patients, who do 
not have the hereditary condition or any significant family history of can- 
cer? When their tumors are analyzed, it turns out that in more than 60% 
of cases, although both copies of APC are present in the adjacent normal 
tissue, the tumor cells themselves have lost or inactivated both copies of 
this gene (see Figure 20-48B). 


All these findings clearly identify APC as a tumor suppressor gene and, 
knowing its sequence and mutant phenotype, one can begin to decipher 
how its loss helps to initiate the development of cancer. As explained 
in How We Know (pp. 730-731), the APC gene was found to encode an 
inhibitory protein that normally restricts the activation of the Wnt sign- 
aling pathway, which is involved in stimulating cell proliferation in the 
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Figure 20-50 Colorectal cancer often begins with the 
inactivation of both copies of the tumor suppressor gene APC, 
leading to growth of a polyp. (A) Thousands of small polyps, and 
a few much larger ones, are seen in the lining of the colon of a 
patient with an inherited APC mutation (whereas individuals 
without an APC mutation might have one or two polyps). 

Such polyps arise from cells in which both copies of APC are 
inactivated. Through additional mutations in other tumor 
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crypts of the gut lining, as described earlier (see Figure 20-39). When APC 
function is lost, the pathway is hyperactive and epithelial cells proliferate 
to excess, generating a polyp (Figure 20-50). Within this growing mass 
of tissue, further driver mutations occur, sometimes resulting in invasive 
cancer (Figure 20-51). 


The effect of mutations in a variety of tumor suppressor genes and proto- 
oncogenes, including those involved in colorectal cancer, is presented in 
Table 20-2. 


An Understanding of Cancer Cell Biology Opens the 
Way to New Treatments 


The nature of the defects that promote the survival, proliferation, and 
spread of cancer cells makes the development of effective treatment 
strategies particularly challenging. Because cancer cells are highly muta- 
ble, they can rapidly evolve resistance to treatments used to exterminate 
them. Moreover, because mutations arise randomly, every case of cancer 
is likely to have its own unique combination of genes mutated. Even within 
an individual patient, tumor cells do not all contain the same genetic 
lesions. Thus, no single treatment is likely to work in every patient, or 
even for every cancer cell within the same patient. Finally, the fact that 
cancers generally are not detected until the primary tumor has reached a 
diameter of 1 cm or more—by which time it consists of hundreds of mil- 
lions of cells that are already genetically diverse and often have already 
begun to metastasize (Figure 20-52)—makes treatment even harder still. 


Figure 20-51 A polyp in the epithelial lining of the colon or rectum, 
caused by loss of both copies of the APC gene, can progress 

to cancer by accumulation of additional driver mutations. The 
diagram shows a sequence of random driver mutations that might 
underlie a typical case of colorectal cancer. After the initial mutation, 

all subsequent driver mutations arise in a single cell that has already 
acquired the previous driver mutations. A sequence of events such 

as that shown here would usually be spread over 10 to 20 years or 
more. Though most colorectal cancers are thought to begin with 

the sequential loss or inactivation of both copies of the APC tumor 
suppressor gene, the subsequent sequence of driver mutations is quite 
variable; indeed, most polyps never progress to cancer. 
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suppressor genes or proto-oncogenes, some of the larger 
polyps will progress to become invasive cancers, unless the 
tissue is removed surgically. (B) Cross section of one such polyp; 
note the excessive quantities of deeply infolded epithelium, 
corresponding to crypts full of abnormal, proliferating cells 
(Movie 20.10). (A, courtesy of Kevin Monahan; B, David Litman/ 
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TABLE 20-2 EXAMPLES OF CANCER-CRITICAL GENES 


Ras Proto-oncogene 


B-catenin Proto-oncogene 


p53 Tumor suppressor gene 


APC Tumor suppressor gene 


Brcal and Brca2 
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Figure 20-52 A tumor is generally not 
diagnosed until it has grown to contain 
hundreds of millions of cells. Here, the 
growth of a typical tumor is plotted on a 
logarithmic scale. Years may elapse before 
the tumor becomes noticeable. The time 
it takes for the number of cells in a typical 
breast tumor to double, for example, is 
about 100 days. 


Tumor suppressor genes 


Activating mutations in Ras render the Ras protein continuously active, promoting 
cell proliferation (discussed in Chapter 16) 


Activating mutations in J-catenin make the B-catenin protein resistant to 
degradation, promoting cell proliferation (see How We Know, pp. 730-731) 


Inactivation of both copies of p53 allows cancer cells to continue to survive and 
divide, even in the presence of damaged DNA (discussed in Chapter 18) 


Inactivation of both copies of APC promotes excessive proliferation of cells in the 
intestinal crypt (see Figure 20-52 and How We Know, pp. 730-731) 


Inactivation of both copies of Brca1 or Brca2 allows cancer cells to continue to 
survive and divide in the presence of massively damaged DNA (discussed below). 


Yet, in spite of these difficulties, an increasing number of cancers are 
being treated effectively. Surgery remains a highly effective tactic, and 
surgical techniques are continually improving: in many cases, if a cancer 
has not spread far, it can often be cured by simply removing it. Where 
surgery fails, the intrinsic peculiarities of cancer cells can be used against 
them. Lack of normal cell-cycle control mechanisms, for example, may 
help make cancer cells particularly vulnerable to DNA damage: whereas a 
normal cell will halt its proliferation until such damage is repaired, a can- 
cer cell may charge ahead regardless, producing daughter cells that may 
die because they inherit too many unrepaired breakages in their chro- 
mosomes. Presumably for this reason, cancer cells can often be killed by 
doses of radiotherapy or DNA-damaging chemotherapy that leave most 
normal cells relatively unharmed. 


Surgery, radiation, and chemotherapy are long-established treatments, 
but many novel approaches are also being enthusiastically pursued. In 
some cases, as with loss of a normal response to DNA damage, the very 
feature that helps to make the cancer cell dangerous also makes it vulner- 
able, enabling doctors to kill it with a properly targeted treatment. Some 
cancers of the breast and ovary, for example, owe their genetic instabil- 
ity to the lack of a tumor suppressor protein (either Brcal or Brca2) that 
aids in the accurate repair of double-strand breaks in DNA (discussed in 
Chapter 6); the cancer cells survive by relying on alternative types of DNA 
repair mechanisms. Drugs that inhibit one of these alternative DNA repair 
mechanisms specifically destroy the cancer cells by raising their genetic 
instability to such a level that the cells die from chromosome fragmenta- 
tion when they attempt to divide. Normal cells, which possess an intact 
double-strand break repair mechanism, remain relatively unaffected. 


Another set of strategies aims to use the immune system to kill the tumor 
cells. Antibodies that recognize tumor-specific cell-surface molecules can 
be produced in vitro and injected into the patient to mark the tumor cells 
for destruction. Other antibodies, aimed at the patient’s immune cells 
rather than the cancer cells, can promote the elimination of cancer cells 
by neutralizing the inhibitory cell-surface proteins that keep the killer 
lymphocytes of the immune system (discussed in Chapter 18) in check. 
Such “checkpoint inhibitor” antibodies, which unleash a killer cell attack 
on cancer cells, are proving to be remarkably effective in the treatment of 
certain cancers, such as melanomas, even after they have metastasized 
(Figure 20-53). 


In some cancers, the products of specific oncogenes can be targeted 
directly so as to block their action, causing the cancer cells to die. In 


Figure 20-53 Anti-immune-checkpoint antibodies release a killer 
cell attack on cancer cells. The antibodies disinhibit the killer cells 
that recognize the novel parts of the proteins encoded by the mutant 
oncogenes of the cancer cells. A patient with advanced metastatic 
melanoma received treatment with such an antibody called ipilimumab. 
After 16 weeks of treatment, the tumors were noticeably smaller, and by 
week 72 they had essentially been eliminated. (From A. Hoos et al., 

J. Natl Cancer Inst. 102:1388-—1397, 2010.) 


chronic myeloid leukemia (CML), the misbehavior of the cancer cells 
depends on a mutant intracellular signaling protein (a tyrosine kinase) 
that causes the cells to proliferate and survive when they should not. 
A small drug molecule, called imatinib (trade name Gleevec), blocks 
the activity of this hyperactive mutant kinase. The results have been a 
dramatic success: in many patients, the abnormal proliferation and sur- 
vival of the leukemic cells are strongly inhibited, providing many years of 
symptom-free patient survival. The same drug is also effective in some 
other cancers that depend on similar oncogenes. 


With these examples before us, we can hope that our modern under- 
standing of the molecular biology of cancer will soon allow us to devise 
effective rational treatments for even more forms of cancer. At the same 
time, cancer research has taught us many important lessons about basic 
cell biology. The applications of that knowledge go far beyond the treat- 
ment of cancer, giving us insight into the way that cells and organisms 
develop and operate. 


ESSENTIAL CONCEPTS 


e Tissues are composed of cells and extracellular matrix. 


e In plants, each cell surrounds itself with extracellular matrix in the 
form of a cell wall, which is made chiefly of cellulose and other 
polysaccharides. 


e An osmotic swelling pressure on plant cell walls keeps plant tissue 
turgid. 


e Cellulose microfibrils in the plant cell wall confer tensile strength, 
while other cell-wall polysaccharides resist compression. 


e The orientation in which the cellulose microfibrils are deposited in 
the cell wall controls the orientation of plant cell growth. 


e Animal connective tissues provide mechanical support to organs and 
limbs; these tissues consist mainly of extracellular matrix, which is 
secreted by a sparse scattering of embedded cells. 


e In the extracellular matrix of animals, tensile strength is provided 
by the fibrous collagen proteins, while glycosaminoglycans (GAGs), 
covalently linked to proteins to form proteoglycans, act as space- 
fillers and provide resistance to compression. 


e Transmembrane integrin proteins link extracellular matrix proteins 
such as collagen and fibronectin to the intracellular cytoskeleton of 
cells that contact the matrix. 


e Cells are connected to one another via cell junctions in epithelial 
sheets that line all external and internal surfaces of the animal body. 


e Cell adhesion proteins of the cadherin family span the epithelial cell 
plasma membrane and bind to identical cadherins in adjacent epithe- 
lial cells. 


e At an adherens junction, the cadherins are linked to intracellular 
actin filaments; at a desmosome junction, they are linked to intracel- 
lular keratin intermediate filaments. 


Essential Concepts 


week 12 


week 16 


week 72 
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™ HOW WE KNOW 


MAKING SENSE OF THE GENES THAT ARE CRITICAL FOR CANCER 


The search for genes that are critical for cancer some- 
times begins with a family that shows an inherited 
predisposition to a particular form of the disease. 
APC—a tumor suppressor gene that is frequently deleted 
or inactivated in colorectal cancer—was tracked down 
by searching for genetic defects in such families prone to 
the disease. But identifying the gene is only half the bat- 
tle. The next step is determining what the normal gene 
does in a normal cell—and why alterations in the gene 
promote cancer. 


Guilt by association 


Determining what a gene—or its encoded product— 
does inside a cell is not a simple task. Imagine isolating 
an uncharacterized protein and being told that it acts 
as a protein kinase. That information does not reveal 
how the protein functions in the context of a living cell. 
What proteins does the kinase phosphorylate? In which 
tissues is it active? What role does it have in the growth, 
development, or physiology of the organism? A great 
deal of additional information is required to understand 
the biological context in which the kinase acts. 


Most proteins do not function in isolation: they interact 
with other proteins in the cell. Thus one way to begin to 
decipher a protein’s biological role is to identify its bind- 
ing partners. If an uncharacterized protein interacts with 
a protein whose role in the cell is understood, the func- 
tion of the unknown protein is likely to be in some way 
related. The simplest method for identifying proteins 
that bind tightly to one another is co-immunoprecipi- 
tation (see Panel 4-2, pp. 140-141). In this technique, 
an antibody is used to capture and precipitate a specific 
target protein from an extract prepared by breaking 
open cells; if this target protein is associated tightly with 
another protein, the partner protein will precipitate as 
well. This is the approach that was taken to characterize 
the Adenomatous Polyposis Coli gene product, APC. 


Two groups of researchers used antibodies against APC 
to isolate the protein from extracts prepared from cul- 
tured human cells. The antibodies captured APC along 
with a second protein. When the researchers examined 
the amino acid sequence of this partner, they recognized 
the protein as B-catenin. 


The discovery that APC interacts with B-catenin initially 
led to some wrong guesses about the role of APC in 
colorectal cancer. In mammals, B-catenin was known 
primarily for its role at adherens junctions, where it 
serves as a linker to connect membrane-spanning 
cadherin proteins to the intracellular actin cytoskel- 
eton (see, for example, Figure 20-23). Thus, for some 
time, scientists thought that APC might be involved in 
cell adhesion. But within a few years, it emerged that 


B-catenin also has another, completely different func- 
tion. It is this unexpected function that turned out to be 
the one that is relevant for understanding APC’s role in 
cancer. 


Wingless flies 


Not long before the discovery that APC binds to B-catenin, 
developmental biologists working on the fruit fly 
Drosophila had noticed that the human B-catenin protein 
is very similar in amino acid sequence to a Drosophila 
protein called Armadillo. Armadillo was known to be a 
key protein in a signaling pathway that has important 
roles in normal development in flies. The pathway is acti- 
vated by the Wnt family of extracellular signal proteins, 
the founding member of which was called Wingless, 
after its mutant phenotype in flies. Wnt proteins bind to 
receptors on the surface of a cell, switching on an intra- 
cellular signaling pathway that ultimately leads to the 
activation of a set of genes that influence cell growth, 
division, and differentiation. Mutations in any of the pro- 
teins in this pathway lead to developmental errors that 
disrupt the basic body plan of the fly. The least devastat- 
ing mutations cause flies to develop without wings; most 
mutations, however, result in the death of the embryo. In 
either case, the damage is done through effects on gene 
expression. This strongly suggested that Armadillo, and 
hence its vertebrate homolog B-catenin, were not just 
involved in cell adhesion, but somehow mediated the 
control of gene expression through the Wnt signaling 
pathway. 


Although the Wnt signaling pathway was discovered 
and studied intensively in fruit flies, it was later found 
to control many aspects of development in vertebrates, 
including mice and humans. Indeed, some of the proteins 
in the Wnt pathway function almost interchangeably 
in Drosophila and vertebrates. The direct link between 
B-catenin and gene expression became clear from work 
in mammalian cells. Just as APC could be used as “bait” 
to catch its partner B-catenin by immunoprecipitation, 
so B-catenin could be used as bait to catch the next pro- 
tein in the signaling pathway. This was found to be a 
transcription regulator called LEF-1/TCF, or TCF for 
short. It too was found to have a Drosophila counterpart 
in the Wnt pathway, and a combination of Drosophila 
genetics and mammalian cell biology revealed how the 
gene control mechanism works. 


Wnt transmits its signal by promoting the accumulation 
of “free” B-catenin (or, in flies, Armadillo)—that is, of 
B-catenin that is not locked up in cell junctions. This free 
protein migrates from the cytoplasm into the nucleus. 
There it binds to the TCF transcription regulator, cre- 
ating a complex that activates transcription of various 
Wnt-responsive genes, including genes whose products 
stimulate cell proliferation (Figure 20-54). 


It turns out that APC regulates the activity of this path- 
way by facilitating degradation of B-catenin and thereby 
preventing it from activating TCF in cells where no Wnt 
signal has been received (see Figure 20-54A). Loss of 
APC allows the concentration of B-catenin to rise, so 
that TCF is activated and Wnt-responsive genes are 
turned on even in the absence of a Wnt signal. But how 
does this promote the development of colorectal can- 
cer? To find out, researchers turned to mice that lack 
TCF4, a member of the TCF gene family that is specifi- 
cally expressed in the gut epithelial lining. 


Tales from the crypt 


Although it may seem counterintuitive, one of the most 
direct ways of finding out what a gene normally does is 
to see what happens to the organism when that gene is 
missing. If one can pinpoint the processes that are dis- 
rupted or compromised, one can begin to decipher the 
gene's function. 


With this in mind, researchers generated “knockout” 
mice in which the gene encoding TCF4 was disrupted. 
The mutation is lethal: mice lacking TCF4 die shortly 
after birth. But the dying animals showed an interest- 
ing abnormality in their intestines. The intestinal crypts, 
which contain the stem cells responsible for the renewal 
of the gut lining (see Figure 20-35), had completely 
failed to develop. The researchers concluded that TCF4 
is normally required for maintaining the pool of prolifer- 
ating gut stem cells. 


(A) WITHOUT Wnt SIGNAL 


(B) WITH Wnt SIGNAL 
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When APC is missing, we see the other side of the coin: 
without APC to promote its degradation, B-catenin 
accumulates in excessive quantities, binds to the TCF4 
transcription regulator, and thereby overactivates the 
TCF4-responsive genes. This drives the formation of 
polyps by promoting the inappropriate proliferation of 
gut stem cells and precursor cells. Differentiated prog- 
eny cells continue to be produced and discarded into the 
gut lumen, but the crypt cell population grows too fast 
for this disposal mechanism to keep pace. The result is 
crypt enlargement and a steady increase in the number 
of crypts. The growing mass of tissue bulges out into the 
gut lumen as a polyp (see Figure 20-50 and Movie 20.9). 
A number of additional mutations are needed, however, 
to convert this benign tumor into an invasive cancer 
(see Figure 20-51). 


More than 60% of human colorectal tumors harbor 
mutations in the APC gene. Interestingly, among the 
minority class of tumors that retain functional APC, 
about a quarter have activating mutations in B-catenin 
instead. These mutations tend to make the f-catenin 
protein more resistant to degradation and thus produce 
the same effect as loss of APC. In fact, mutations that 
enhance the activity of B-catenin have been found in a 
wide variety of other tumor types, including melanomas, 
stomach cancers, and liver cancers. Thus, the genes that 
encode proteins that act in the Wnt signaling pathway 
provide multiple targets for mutations that can spur the 
development of cancer. 
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molecule B-catenin. (B) In the presence of 
Wnt (or in the absence of active APC), free 
B-catenin becomes plentiful and combines 
with the transcription regulator TCF to 
drive transcription of Wnt-responsive 
genes and, ultimately, the proliferation 
By of stem cells and precursor cells in the 
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intestinal crypt (see Figure 20-39). In the 
colon, mutations that inactivate APC 
initiate tumors by causing excessive 
activation of the Wnt signaling pathway. 
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e During development, the actin bundles at the adherens junctions that 
connect cells in an epithelial sheet can contract, helping the epithe- 
lium to bend and pinch off, forming an epithelial tube or vesicle. 


e Hemidesmosomes attach the basal face of an epithelial cell to the 
basal lamina, a specialized sheet of extracellular matrix; the attach- 
ment is mediated by transmembrane integrin proteins, which are 
linked to intracellular keratin filaments. 


e Tight junctions seal one epithelial cell to the next, barring the diffu- 
sion of water-soluble molecules across the epithelium. 


e Gap junctions form channels that allow the direct passage of inor- 
ganic ions and small, hydrophilic molecules from cell to cell; in plants, 
plasmodesmata form a different type of channel, which traverses the 
cell walls, is lined by plasma membrane, and allows both small and 
large molecules to pass from cell to cell. 


e Most tissues in vertebrates are complex mixtures of cell types that 
are subject to continual turnover. 


e Most tissues of an adult animal are maintained and renewed by 
the same basic cell processes that generated them in the embryo: 
cell proliferation, movement, differentiation, and death. As in the 
embryo, these processes are controlled by intercellular communica- 
tion, selective cell-cell adhesion, and cell memory. 


e In many tissues, nondividing, terminally differentiated cells are gen- 
erated from stem cells, usually via proliferating precursor cells. 


e Embryonic stem cells (ES cells) can proliferate indefinitely in culture 
and remain capable of differentiating into any cell type in the body— 
that is, they are pluripotent. 


e Induced pluripotent stem cells (iPS cells), which resemble ES cells, 
can be generated from the cells of adult mammalian tissues, includ- 
ing those of human, through the artificial expression of a small set of 
transcription regulators. 


e Pluripotent stem cells can be induced to form specific cell types and 
even small organs (organoids) in suitable culture conditions, provid- 
ing powerful models for studying human development and genetic 
diseases. 


e Cancer cells fail to obey the social constraints that normally ensure 
that cells survive and proliferate only when and where they should, 
and do not invade regions where they do not belong. 


e Cancers arise from the accumulation of many mutations in a single 
somatic cell lineage; they are genetically unstable, having increased 
mutation rates and, often, major chromosomal abnormalities. 


e Unlike most normal human cells, cancer cells typically express tel- 
omerase, enabling them to proliferate indefinitely without losing DNA 
at their chromosome ends. 


e Most human cancer cells harbor mutations in the p53 gene, allowing 
them to survive and divide even when their DNA is damaged. 


e The mutations that promote cancer can do so either by converting 
one copy of a proto-oncogene into a hyperactive (or overexpressed) 
oncogene or by inactivating both copies of a tumor suppressor gene. 


e Sequencing of cancer genomes reveals that most cancers have 
mutations that subvert the same key pathways controlling cell prolif- 
eration, cell growth, cell survival, and the response to DNA damage. 
In different cases of cancer, these pathways are subverted in different 
ways. 


e Knowing the molecular abnormalities that underlie a particular can- 
cer, one can begin to design treatments targeted specifically to those 
abnormalities. 


KEY TERMS 
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embryonic stem (ES) cell stem cell 

epithelium (plural epithelia) tight junction 
extracellular matrix tissue 

fibroblast tumor suppressor gene 
fibronectin Wnt protein 

gap junction 


QUESTIONS 


QUESTION 20-9 


Which of the following statements are correct? Explain your 
answers. 


A. Gap junctions connect the cytoskeleton of one cell to 
that of a neighboring cell or to the extracellular matrix. 


B. A wilted plant leaf can be likened to a deflated bicycle 
tire. 


C. Because of their rigid structure, proteoglycans can 
withstand a large amount of compressive force. 


D. The basal lamina is a specialized layer of extracellular 
matrix to which sheets of epithelial cells are attached. 


E. Epidermal cells in the skin are continually shed and are 
renewed every few weeks; for a tattoo to be long-lasting, 
it is therefore necessary to deposit pigment below the 
epidermis. 


F. Although stem cells are not differentiated, they are 
specialized in the sense that they give rise only to specific 
cell types. 


QUESTION 20-10 


Which of the following substances would you expect to 
spread from one cell to the next through (a) gap junctions 
and (b) plasmodesmata: glutamic acid, mRNA, cyclic AMP, 
Ca”, proteins, and plasma membrane phospholipids? 


QUESTION 20-11 


Discuss the following statement: “If plant cells contained 
intermediate filaments to provide the cells with tensile 
strength, their cell walls would be dispensable.” 


QUESTION 20-12 


Through the exchange of small metabolites and ions, gap 
junctions provide metabolic and electrical coupling between 
cells. Why, then, do you suppose that neurons communicate 
primarily through chemical synapses (as shown in Figure 
12-40) rather than through gap junctions? 


QUESTION 20-13 


Gelatin is primarily composed of collagen, which is 
responsible for the remarkable tensile strength of 
connective tissue. It is the basic ingredient of jello; yet, 

as you probably experienced many times yourself while 
consuming the strawberry-flavored variety, jello has virtually 
no tensile strength. Why? 


QUESTION 20-14 


“The structure of an organism is determined by the genome 
that the fertilized egg contains.” What is the evidence on 
which this statement is based? Indeed, a friend challenges 
you and suggests that you replace the DNA of a stork’s egg 
with human DNA to see if a human baby results. How would 
you answer him? 


QUESTION 20-15 


Leukemias—that is, cancers arising through mutations that 
cause excessive production of white blood cells—have an 
earlier average age of onset than other cancers. Propose an 
explanation for why this might be the case. 


QUESTION 20-16 
Carefully consider the graph in Figure 20—43, which shows 
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the number of cases of colon cancer diagnosed per 100,000 
women per year as a function of age. Why is this graph 

so steep and curved, if mutations occur with a similar 
frequency throughout a person’s life-span? 


QUESTION 20-17 


Heavy smokers or industrial workers exposed for a limited 
time to a chemical carcinogen that induces mutations in 
DNA do not usually begin to develop cancers characteristic 
of their habit or occupation until 10, 20, or even more years 
after the exposure. Suggest an explanation for this long 
delay. 


QUESTION 20-18 


High levels of the female sex hormone estrogen increase 
the risk of some forms of cancer. Thus, some early types 
of contraceptive pills containing high concentrations of 
estrogen were eventually withdrawn from use because this 
was found to increase the risk of cancer of the lining of the 
uterus. Male transsexuals who use estrogen preparations 
to give themselves a female appearance have an increased 
risk of breast cancer. High levels of androgens (male sex 
hormones) increase the risk of some other forms of cancer, 
such as cancer of the prostate. Can one infer that estrogens 
and androgens are mutagenic? 


QUESTION 20-19 


Is cancer hereditary? 


Answers 


Chapter 1 


ANSWER 1-1 Trying to define life in terms of properties 
is an elusive business, as suggested by this scoring exercise 
(Table A1-1). Vacuum cleaners are highly organized objects, 
and take matter and energy from the environment and 
transform the energy into motion, responding to stimuli 
from the operator as they do so. On the other hand, they 
cannot reproduce themselves, or grow and develop—but 
then neither can old animals. Potatoes are not particularly 
responsive to stimuli, and so on. It is curious that standard 
definitions of life usually do not mention that living 
organisms on Earth are largely made of organic molecules— 
that is, life is carbon based. As we now know, the key 

types of “informational macromolecules"—DNA, RNA, and 
protein—are the same in every living species. 


TABLE A1-1 PLAUSIBLE “LIFE” SCORES FOR A 
VACUUM CLEANER, A POTATO, AND A HUMAN 


Characteristic Freee ewe ae 
Cleaner 
1. Organization Yes Yes Yes 
2. Homeostasis Yes Yes Yes 
3. Reproduction No Yes Yes 
4. Development No Yes Yes 
5. Energy Yes Yes Yes 
6. Responsiveness Yes No Yes 
7. Adaptation No Yes Yes 
ANSWER 1-2 Most random changes to the shoe design 


would result in objectionable defects: shoes with multiple 
heels, with no soles, or with awkward sizes would obviously 
not sell and would therefore be selected against by market 
forces. Other changes would be neutral, such as minor 
variations in color or in size. A minority of changes, however, 
might result in more desirable shoes: deep scratches in a 
previously flat sole, for example, might create shoes that 
would perform better in wet conditions; the loss of high 
heels might produce shoes that are more comfortable 

(and less dangerous). The example illustrates that random 
changes can lead to significant improvements if the 
number of trials is large enough and selective pressures 

are imposed. 


ANSWER 1-3 It is extremely unlikely that you created 

a new organism in this experiment. Far more probably, 

a spore from the air landed in your broth, germinated, 

and gave rise to the cells you observed. In the middle of 
the nineteenth century, Louis Pasteur invented a clever 
apparatus to disprove the then widely accepted belief that 
life could arise spontaneously. He showed that sealed flasks 


containing a nutrient broth that could support microbial 
growth never grew anything if properly heat-sterilized first. 
He overcame the objections of those who pointed out the 
lack of oxygen, or who suggested that his heat sterilization 
killed the life-generating principle, by using a special flask 
with a slender “swan's neck,” which was designed to 
prevent spores carried in the air from contaminating the 
culture (Figure A1-3). The heat-sterilized nutrient broth in 
these flasks never showed any signs of life; however, it was 
capable of supporting life, as could be demonstrated by 
washing some of the “dust” from the neck of the flask into 
the broth. 


original flask swan’s-neck flask 
Figure A1-3 
ANSWER 1-4 6x 103? (= 6 x 1027 g/10712 g) bacteria 


would have the same mass as the Earth. And 6 x 109? = 
21/20, according to the equation describing exponential 
growth. Solving this equation for t results in t = 2642 
minutes (or 44 hours). This represents only 132 generation 
times(!), whereas 5 x 1014 bacterial generation times have 
passed during the last 3.5 billion years. Obviously, the total 
mass of bacteria on this planet is nowhere close to the mass 
of the Earth. This illustrates that exponential growth can 
occur only for very few generations—that is, for minuscule 
periods of time compared with evolution. In any realistic 
scenario, food supplies very quickly become limiting. 

This simple calculation shows us that the ability to grow 
and divide quickly when food is ample is only one factor 
in the survival of a species. Food is generally scarce, and 
individuals of the same species have to compete with one 
another for the limited resources. Natural selection favors 
mutants that either win the competition or find ways to 
exploit food sources that their neighbors are unable to use. 


ANSWER 1-5 By engulfing substances such as food 
particles, eukaryotic cells can sequester them and feed 
on them efficiently. Bacteria, in contrast, have no way of 
capturing lumps of food; they can export substances that 
help break down food substances in the environment, but 
the products of this labor must then be shared with other 
organisms in the same neighborhood. 


ANSWER 1-6 Conventional light microscopy is much 
easier to use and requires much simpler instruments. 
Objects that are 1 um in size can easily be resolved; the 
lower limit of resolution is 0.2 um, which is a theoretical limit 
imposed by the wavelength of visible light. Visible light is 


A:2 Answers 


nondestructive and passes readily through water, making 

it possible to observe living cells. Electron microscopy, on 
the other hand, is much more complicated, both in the 
nature of the instrument and in the preparation of the 
sample (which needs to be extremely thinly sliced, stained 
with an electron-dense heavy metal, and completely 
dehydrated). Living cells cannot be observed in an electron 
microscope. The resolution of electron microscopy is much 
higher, however, and biological objects as small as 1 nm 
can be resolved. To see any structural detail, microtubules, 
mitochondria, and bacteria would need to be analyzed 
either by electron microscopy or by using specific dyes 

to make them visible by confocal or super-resolution 
fluorescence microscopy (although no form of fluorescence 
microscopy can match the resolution of an electron 
microscope). 


ANSWER 1-7 Because the basic workings of all cells 
are so similar, a great deal has been learned from studying 
model systems. Brewer's yeast is a good model for 
eukaryotic cells because yeast cells are much simpler than 
human cancer cells. We can grow them inexpensively and 
in vast quantities, and we can manipulate them genetically 
and biochemically much more easily than human cells. 
This allows us to use yeast to decipher the ground rules 
governing how cells grow and divide. Cancer cells grow 
and divide when they should not (and therefore give rise 
to tumors), and a basic understanding of how cell growth 
and division are normally controlled is therefore directly 
relevant to the cancer problem. Indeed, the National Cancer 
Institute, the American Cancer Society, and many other 
institutions that are devoted to finding a cure for cancer 
strongly support basic research on various aspects of cell 
growth and division in different model systems, including 
yeast. 


ANSWER 1-8 
Panel 1-2 (p. 25). 


ANSWER 1-9 

A. False. The hereditary information is encoded in the cell’s 
DNA, which in turn specifies its proteins (via RNA). 

B. True. Bacteria do not have a nucleus. 

C. False. Plants, like animals, are composed of eukaryotic 
cells, but unlike animal cells, they contain chloroplasts as 
cytoplasmic organelles. The chloroplasts are thought to 
be evolutionarily derived from engulfed photosynthetic 
bacteria. 

D. True. The number of chromosomes varies from one 
organism to another, but is constant in all nucleated 
cells (except germ cells) within the same multicellular 
organism. 

E. False. The cytosol is the cytoplasm excluding all 
membrane-enclosed organelles. 

F. True. The nuclear envelope is a double membrane, and 
mitochondria are surrounded by both an inner and an 
outer membrane. 

G. False. Protozoans are single-celled organisms and 
therefore do not have different tissues or cell types. 
They have a complex structure, however, that has highly 
specialized parts. 

H. Somewhat true. Peroxisomes and lysosomes contain 
enzymes that catalyze the breakdown of substances 
produced in the cytosol or taken up by the cell. One 
can argue, however, that many of these substances are 
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degraded to generate food molecules, and as such are 
certainly not “unwanted.” 


ANSWER 1-10 In this plant cell, A is the nucleus, B is a 
vacuole, C is the cell wall, and D is a chloroplast. The scale 
bar is about 10 um, the width of the nucleus. 


ANSWER 1-11 The three major filaments are actin 
filaments, intermediate filaments, and microtubules. 

Actin filaments are involved in rapid cell movement, 

and are the most abundant filaments in a muscle cell; 
intermediate filaments provide mechanical stability and 

are the most abundant filaments in epidermal cells of the 
skin; and microtubules function as “railroad tracks” for 
many intracellular movements and are responsible for the 
separation of chromosomes during cell division. Other 
functions of all these filaments are discussed in Chapter 17. 


ANSWER 1-12 It takes only 20 hours (i.e., less than a day) 
before mutant cells become more abundant in the culture. 
Using the equation provided in the question, we see that 
the number of the original (“wild-type”) bacterial cells at 
time t minutes after the mutation occurred is 106 x 2#/20, 
The number of mutant cells at time tis 1 x 2/15. To find out 
when the mutant cells “overtake” the wild-type cells, we 
simply have to make these two numbers equal to each other 
(i.e., 106 x 2/20 = 2t/15). Taking the logarithm to base 10 of 
both sides of this equation and solving it for t results in 

t = 1200 minutes (or 20 hours). At this time, the culture 
contains 2 x 1024 cells (10° x 260 + 1 x 280), Incidentally, 

2 x 104 bacterial cells, each weighing 10712 g, would weigh 
2 x 1012 g (= 2 x 10? kg, or 2 million tons!). This can only 
have been a thought experiment. 


ANSWER 1-13 Bacteria continually acquire mutations in 
their DNA. In the population of cells exposed to the poison, 
one or a few cells may already harbor a mutation that makes 
them resistant to the action of the poison. Antibiotics that 
are poisonous to bacteria because they bind to certain 
bacterial proteins, for example, would not work if the 
proteins have a slightly changed surface so that binding 
occurs more weakly or not at all. These mutant bacteria 
would continue dividing rapidly while their cousins are 
slowed down. The antibiotic-resistant bacteria would soon 
become the predominant species in the culture. 


ANSWER 1-14 1013 = 2/1). Therefore, it would take only 
43 days [t = 13/log(2)]. This explains why some cancers 
can progress extremely rapidly. Many cancer cells divide 
much more slowly, however, and many die because of 
their internal abnormalities or because they do not have 

a sufficient blood supply, and so the actual progression of 
cancer is usually slower. 


ANSWER 1-15 Living cells evolved from nonliving 
matter, but they grow and replicate. Like the material they 
originated from, they are governed by the laws of physics, 
thermodynamics, and chemistry. Thus, for example, they 
cannot create energy de novo or build ordered structures 
without the expenditure of free energy. We can understand 
virtually all cellular events, such as metabolism, catalysis, 
membrane assembly, and DNA replication, as complicated 
chemical reactions that can be experimentally reproduced, 
manipulated, and studied in test tubes. 

Despite this fundamental reducibility, a living cell is 
more than the sum of its parts. We cannot randomly mix 


proteins, nucleic acids, and other chemicals together in a 
test tube, for example, and make a cell. The cell functions 
by virtue of its organized structure, and this is a product of 
its evolutionary history. Cells always come from preexisting 
cells, and the division of a mother cell passes both chemical 
constituents and structures to its daughters. The plasma 
membrane, for example, never has to form de novo, but 
grows by expansion of a preexisting membrane; there will 
always be a ribosome, in part made up of proteins, whose 
function it is to make more proteins, including those that 
build more ribosomes. 


ANSWER 1-16 In a multicellular organism, different cells 
take on specialized functions and cooperate with one 
another, so that any one cell type does not have to perform 
all activities for itself. Through such division of labor, 
multicellular organisms are able to exploit food sources 
that are inaccessible to single-celled organisms. A plant, for 
example, can reach the soil with its roots to take up water 
and nutrients, while at the same time, its leaves above 
ground can harvest light energy and COz from the air. By 
protecting its reproductive cells with other specialized cells, 
the multicellular organism can develop new ways to survive 
in harsh environments or to fight off predators. When food 
runs out, it may be able to preserve its reproductive cells 
by allowing them to draw upon resources stored by their 
companions—or even to cannibalize relatives (a common 
process, in fact). 


ANSWER 1-17. The volume and the surface area are 

5.24 x 10-19 m3 and 3.14 x 10712 m? for the bacterial cell, 
and 1.77 x 10-15 m3 and 7.07 x 10-710 m2 for the animal cell, 
respectively. From these numbers, the surface-to-volume 
ratios are 6 x 10° m~! and 4 x 10° m1, respectively. In 
other words, although the animal cell has a 3375-fold larger 
volume, its membrane surface is increased only 225-fold. If 
internal membranes are included in the calculation, however, 
the surface-to-volume ratios of both cells are about equal. 
Thus, because of their internal membranes, eukaryotic cells 
can grow bigger and still maintain a sufficiently large area 
of membrane, which—as we discuss in more detail in later 
chapters—is required for many essential cell functions. 


ANSWER 1-18 There are many lines of evidence for a 
common ancestor cell. Analyses of modern-day living 
cells show an amazing degree of similarity in the basic 
components that make up the inner workings of otherwise 
vastly different cells. Many metabolic pathways, for 
example, are conserved from one cell type to another, 
and the organic compounds that make up polynucleotides 
(DNA and RNA) and proteins are the same in all living 
cells, even though it is easy to imagine that a different 
choice of compounds (e.g., amino acids with different side 
chains) would have worked just as well. Similarly, it is not 
uncommon to find that important proteins have closely 
similar detailed structures in prokaryotic and eukaryotic 
cells. Theoretically, there would be many different ways 
to build proteins that could perform the same functions. 
The evidence overwhelmingly shows that most important 
processes were “invented” only once and then became 
fine-tuned during evolution to suit the particular needs of 
specialized cells and specific organisms. 

It seems highly unlikely, however, that the first cell 
survived to become the primordial founder cell of today’s 
living world. As evolution is not a directed process with 
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purposeful progression, it is more likely that there were a 
vast number of unsuccessful trial cells that replicated for 

a while and then became extinct because they could not 
adapt to changes in the environment or could not survive 

in competition with other trial cells. We can therefore 
speculate that the primordial ancestor cell was a “lucky” cell 
that ended up in a relatively stable environment in which it 
had a chance to replicate and evolve. 


ANSWER 1-19 A quick inspection might reveal the 
characteristic beating of cilia on the cell surface; their 
presence would tell you that the cell was eukaryotic 
(prokaryote flagella have entirely different structures and 
motions compared to eukaryote cilia and flagella). If you 
don’t see them—and you are quite likely not to—you will 
have to look for other distinguishing features. If you are 
lucky, you might see the cell divide. Watch it then with 

the right optics, and you might be able to see condensed 
mitotic chromosomes, which again would tell you that it was 
a eukaryote. Fix the cell and stain it with a dye for DNA: if 
the DNA is contained in a well-defined nucleus, the cell is a 
eukaryote; if you cannot see a well-defined nucleus, the cell 
may be a prokaryote. Alternatively, stain it with fluorescent 
antibodies that bind actin or tubulin (proteins that are highly 
conserved in eukaryotes but absent in bacteria). Embed it, 
section it, and look with an electron microscope: can you 
see organelles such as mitochondria inside your cell? Try 
staining it with Gram stain, which is specific for molecules in 
the cell wall of some classes of bacteria. But all these tests 
might fail, leaving you still uncertain. For a definitive answer, 
you could attempt to analyze the sequences of the DNA 
and RNA molecules that it contains, using the sophisticated 
methods we describe more fully in Chapter 10. If the nucleic 
acid sequences encode molecules that are highly conserved 
in eukaryotes, such as those that form the core components 
of the nuclear pore, you can be sure your cell is a eukaryote. 
If there are no eukaryote-specific sequences, you should 
still be able to distinguish whether you are looking at a 
bacterium or an archaeon. If you can’t detect any DNA or 
RNA, you are probably looking not at a cell but at a 

piece of dirt. 


Chapter 2 


ANSWER 2-1 The chances are excellent because of 

the enormous size of Avogadro’s number. The original 

cup contained one mole of water, or 6 1023 molecules, 
and the volume of the world’s oceans, converted to cubic 
centimeters, is 1.5 x 1024 cm?. After mixing, there should be 
on average 0.4 of a “Greek” water molecule per cm? 

(6 x 1023/1.5 x 1024), or 7.2 molecules in 18 g of Pacific 
Ocean. 


ANSWER 2-2 

A. The atomic number is 6; the atomic weight is 12 
(= 6 protons + 6 neutrons). 

B. The number of electrons is 6 (= the number of protons). 

C. The first shell can accommodate two and the second 
shell eight electrons. Carbon therefore needs four 
additional electrons (or would have to give up four 
electrons) to obtain a full outermost shell. Carbon is 
most stable when it shares four additional electrons with 
other atoms (including other carbon atoms) by forming 
four covalent bonds. 


A:4 Answers 


D. Carbon-14 has two additional neutrons in its nucleus. 
Because the chemical properties of an atom are 
determined by its electrons, carbon-14 is chemically 
identical to carbon-12. 


ANSWER 2-3 The statement is correct. Both ionic 

and covalent bonds are based on the same principles: 

an exchange of electrons. In polar covalent bonds, the 
electrons are shared unequally between the interacting 
atoms. In ionic bonds, the electrons are completely lost by 
one atom and gained by the other. And at the other end of 
the spectrum, electrons can be shared equally between two 
interacting atoms to form a nonpolar covalent bond. There 
are bonds of every conceivable intermediate state, and for 
borderline cases it becomes arbitrary whether a bond is 
described as a very polar covalent bond or an ionic bond. 


ANSWER 2-4 The statement is correct. The hydrogen- 
oxygen bond in water molecules is polar, so the oxygen 
atom carries a more negative charge than the hydrogen 
atoms. These partial negative charges are attracted to the 
positively charged sodium ions, but are repelled from the 
negatively charged chloride ions. 


ANSWER 2-5 

A. Hydronium (H30*) ions result from water dissociating 
into protons and hydroxyl ions, each proton binding to a 
water molecule to form a hydronium ion 
(2H20 — H20 + Ht + OH- — H30* + OHJ). At neutral 
pH—that is, in the absence of an acid providing more 
H30* ions or a base providing more OH- ions—the 
concentrations of H30* ions and OH" ions are equal. We 
know that at neutrality the pH = 7.0, and therefore the 
H* concentration is 107 M. The H+ concentration equals 
the H30* concentration. 

B. To calculate the ratio of H3O* ions to H20 molecules, we 
need to know the concentration of water molecules. The 
molecular weight of water is 18 (i.e., 18 g/mole), and 
1 liter of water weighs 1 kg. Therefore, the concen- 
tration of water is 55.6 M (= 1000 [g/L]/[18 g/mole]), 
and the ratio of H3O* ions to H2O molecules is 
1.8 x 107? (= 10-7/55.6); that is, only two water molecules 
in a billion are dissociated at neutral pH. 


ANSWER 2-6 The synthesis of a macromolecule with 

a unique structure requires that in each position only one 
stereoisomer is used. Changing one amino acid from its 

L- to its p-form would result in a different protein. Thus, if 
for each amino acid a random mixture of the D- and L-forms 
were used to build a protein, its amino acid sequence could 
not specify a single structure, but many different structures 
(2N different structures) would be formed (where N is the 
number of amino acids in the protein). 

Why L-amino acids were selected in evolution as the 
exclusive building blocks of proteins is a mystery; we could 
easily imagine a cell in which certain (or even all) amino acids 
were used in the p-forms to build proteins, as long as these 
particular stereoisomers were used exclusively. 


ANSWER 2-7 The term “polarity” has two different 
meanings. In one meaning, polarity refers to a directional 
asymmetry—for example, in linear polymers such as 
polypeptides, which have an N-terminus and a C-terminus; 
or nucleic acids, which have a 3’ and a5’ end. Because 
bonds form only between the amino and the carboxyl 
groups of the amino acids in a polypeptide, and between 


the 3’ and the 5’ ends of nucleotides, nucleic acids and 
polypeptides always have two different ends, which give the 
chains a defined chemical polarity. 

In the other meaning, polarity refers to a separation of 
electric charge in a bond or molecule. This kind of polarity 
promotes hydrogen-bonding to water molecules, and 
because the water solubility, or hydrophilicity, of a molecule 
depends upon its being polar in this sense, the term “polar” 
also indicates water solubility. 


ANSWER 2-8 A major advantage of condensation 
reactions is that they are readily reversible by hydrolysis 
(and water is readily available in the cell). This allows cells 
to break down their macromolecules (or macromolecules of 
other organisms that were ingested as food) and to recover 
the subunits intact so that they can be “recycled;” that is, 
used to build new macromolecules. 


ANSWER 2-9 Many of the functions that macromolecules 
perform rely on their ability to associate and dissociate 
readily. This chemical flexibility allows cells, for example, 

to remodel their interior when they move or divide, and 

to transport components from one organelle to another. 
Covalent bonds would be too strong and too permanent for 
such a purpose, requiring a specific enzyme to break each 
kind of bond. 


ANSWER 2-10 

A. True. All nuclei are made of positively charged protons 
and uncharged neutrons; the only exception is the 
hydrogen nucleus, which consists of only one proton. 

B. False. Atoms are electrically neutral. The number of 
positively charged protons is always balanced by an 
equal number of negatively charged electrons. 

C. True—but only for the cell nucleus (see Chapter 1), not 
for the atomic nucleus discussed in this chapter. 

D. False. Elements can have different isotopes, which differ 
only in their number of neutrons. 

E. True. In certain isotopes, the large number of neutrons 
destabilizes the nucleus, which decomposes in a process 
called radioactive decay. 

F. True. Examples include granules of glycogen, a polymer 
of glucose, found in liver cells; and fat droplets, made of 
aggregated triacylglycerols, found in fat cells. 

G. True. Individually, these bonds are weak and readily 
broken by thermal motion, but because interactions 
between two macromolecules involve a large number of 
such bonds, the overall binding can be quite strong; and 
because hydrogen bonds form only between correctly 
positioned groups on the interacting macromolecules, 
they are very specific. 


ANSWER 2-11 

A. One cellulose molecule has a molecular weight of 
n x (12[C] + 2 x 1[H] + 16[O]). We do not know n, but 
we can determine the ratio with which the individual 
elements contribute to the weight of cellulose. The 
contribution of carbon atoms is 40% 
[= 12/(12 + 2 + 16) x 100%]. Therefore, 2 g (40% 
of 5 g) of carbon atoms are contained in the cellulose 
that makes up this page. The atomic weight of carbon 
is 12 g/mole, and there are 6 x 1023 atoms or 
molecules in a mole. Therefore, 1023 carbon atoms 
[= (2 g/12 [g/mole]) x 6 x 1023 (molecules/mole)] make 
up this page. 


B. The volume of the page is 4 x 10% m3 (= 21.2 cm x 
27.6 cm x 0.07 mm), which is the same as the volume 
of a cube with a side length of 1.6 cm (= 3V4 x 10% mì). 
Because we know from part A that the page contains 
1023 carbon atoms, geometry tells us that there could be 
about 4.6 x 107 carbon atoms (= 3/1023) lined up along 
each side of this cube. Therefore, in cellulose, about 
200,000 carbon atoms [= (4.6 x 107) x (0.07 x 10-3 m)/ 
1.6 x 1072? m] span the thickness of the page. 

C. If tightly stacked, 350,000 carbon atoms with a 0.2-nm 
diameter would span the 0.07-mm thickness of the page. 

D. The 1.7-fold difference in the two calculations reflects 
(1) that carbon is not the only atom in cellulose and 
(2) that paper is not an atomic lattice of precisely 
arranged molecules (as a diamond would be for precisely 
arranged carbon atoms), but a random meshwork of 
fibers containing many voids. 


ANSWER 2-12 
A. The occupancies of the three innermost electron levels 
are 2, 8, 8. 


B. helium already has full outer electron shell 
oxygen gain 2 
carbon gain 4 or lose 4 
sodium lose 1 
chlorine gain 1 


C. Helium, with its fully occupied outer electron shell, is 
chemically unreactive. Sodium and chlorine, on the other 
hand, are extremely reactive and readily form stable Nat 
and CI- ions that form ionic bonds, as in table salt. 


ANSWER 2-13 Whether a substance is a liquid or a gas 
at a given temperature depends on the attractive forces 
between its molecules. H2S is a gas at room temperature 
and H20 is a liquid because the hydrogen bonds that 
hold H2O molecules together do not form between H2S 
molecules. A sulfur atom is much larger than an oxygen 
atom, and because of its larger size, the outermost 
electrons are not as strongly attracted to the nucleus of the 
sulfur atom as they are in an oxygen atom. Consequently, 
the hydrogen-sulfur bond is much less polar than the 
hydrogen-oxygen bond. Because of the reduced polarity, 
the sulfur in an H2S molecule is not strongly attracted to 
the hydrogen atoms in an adjacent H2S molecule, and the 
hydrogen bonds that are so predominant in water do not 
form. 


ANSWER 2-14 The reactions are diagrammed in Figure 
A2-14, where R4 and R2 are amino acid side chains. 


R; Ry 


| 
HN E —COOH + HN A — COOH 
H H 
H,O H20 
RD o R3 
| | 
H,N — C — C —N — C —COOH 
| | | 


H H H 
Figure A2-14 
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ANSWER 2-15 

A. False. The properties of a protein depend on both the 
amino acids it contains and the order in which they are 
linked together. The diversity of proteins is due to the 
almost unlimited number of ways in which 20 different 
amino acids can be combined in a linear sequence. 

B. False. Lipids assemble into bilayers by noncovalent 
bonds. A membrane is therefore not a macromolecule. 

C. True. The backbone of nucleic acids is made up 
of alternating ribose (or deoxyribose in DNA) and 
phosphate groups. Ribose and deoxyribose are sugars. 

D. True. About half of the 20 naturally occurring amino 
acids have hydrophobic side chains. In folded proteins, 
many of these side chains face toward the inside of 
the folded-up proteins, because they are repelled from 
water. 

E. True. Hydrophobic hydrocarbon tails contain only 
nonpolar covalent bonds. Thus, they cannot participate 
in hydrogen-bonding and are repelled from water. We 
consider the underlying principles in more detail in 
Chapter 11. 

F. False. RNA contains the four listed bases, but DNA 
contains T instead of U. T and U are very much alike, 
however, and differ only by a single methyl group. 


ANSWER 2-16 

A. (a) 400 (= 202); (b) 8000 (= 203); (c) 160,000 (= 204). 

B. A protein with a molecular mass of 4800 daltons is 
made of about 40 amino acids; thus there are 1.1 x 1052 
(= 2040) different ways to make such a protein. Each 
individual protein molecule weighs 8 x 10-21 g 
(= 4800/6 x 1023); thus a mixture of one molecule each 
weighs 9 x 1031 g(=8x 10-21 gx 1.1x 10°2), which 
is 15,000 times the total weight of the planet Earth, 
weighing 6 x 1074 kg. You would need a very large 
container indeed. 

C. Given that most cellular proteins are even larger than the 
one used in this example, it is clear that only a minuscule 
fraction of the total possible amino acid sequences is 
used in living cells. 


ANSWER 2-17 Because all living cells are made up of 
chemicals, and because all chemical reactions (whether 

in living cells or in test tubes) follow the same rules, an 
understanding of basic chemical principles is fundamentally 
important to the understanding of biology. In the course of 
this book, we will frequently refer back to these principles, 
on which all of the more complicated pathways and 
reactions that occur in cells are based. 


ANSWER 2-18 

A. Hydrogen bonds require specific groups to interact; one 
is always a hydrogen atom linked by a polar covalent 
bond to an oxygen or a nitrogen, and the other is usually 
a nitrogen or an oxygen atom. Van der Waals attractions 
are weaker and occur between any two atoms that are 
in close enough proximity. Both hydrogen bonds and 
van der Waals attractions are short-range interactions 
that come into play only when two molecules are already 
close. Both types of bonds can therefore be thought 
of as ameans of “fine-tuning” an interaction; that is, 
helping to position two molecules correctly with respect 
to each other once they have been brought together by 
diffusion. 

B. Van der Waals attractions would form in all three 
examples. Hydrogen bonds would form in (c) only. 
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ANSWER 2-19 Noncovalent bonds form between the 
subunits of macromolecules—e.g., the side chains of amino 
acids in a polypeptide chain—and cause the polymer chain 
to assume a unique shape. These interactions include 
hydrogen bonds, ionic bonds, van der Waals attractions, 
and hydrophobic forces. Because these interactions are 
weak, they can be broken with relative ease; thus, most 
macromolecules can be unfolded by heating, which 
increases thermal motion. 


ANSWER 2-20 Amphipathic molecules have both a 
hydrophilic and a hydrophobic end. Their hydrophilic ends 
can hydrogen-bond to water, but their hydrophobic ends 
are repelled from water because they interfere with the 
water structure. Consequently, the hydrophobic ends of 
amphipathic molecules tend to be exposed to air at air- 
water interfaces, or will always cluster together to minimize 
their contact with water molecules—both at this interface 
and in the interior of an aqueous solution. (See Figure 
A2-20.) 


] hydrophilic 


hydrophobic 


AIR 


water 
molecules 


Figure A2-20 
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A, B. (A) and (B) are both correct formulas of the amino acid 
phenylalanine. In formula (B), phenylalanine is shown in 
the ionized form that exists in solution in water, where 
the basic amino group is protonated and the acidic 
carboxylic group is deprotonated. 

C. Incorrect. This structure of a peptide bond is missing a 
hydrogen atom bound to the nitrogen. 

D. Incorrect. This formula of an adenine base features one 
double bond too many, creating a five-valent carbon 
atom and a four-valent nitrogen atom. 

E. Incorrect. In this formula of a nucleoside triphosphate, 
there should be two additional oxygen atoms, one 
between each of the phosphorus atoms. 

F. This is the correct formula of ethanol. 

G. Incorrect. Water does not hydrogen-bond to hydrogens 
bonded to carbon. The lack of the capacity to hydrogen- 


bond makes hydrocarbon chains hydrophobic; that is, 
water-hating. 

H. Incorrect. Na and Cl form an ionic bond, Na*Cl, but a 
covalent bond is drawn. 

I. Incorrect. The oxygen atom attracts electrons more than 
the carbon atom; the polarity of the two bonds should 
therefore be reversed. 

J. This structure of glucose is correct. 

K. Almost correct. It is more accurate to show that only 
one hydrogen is lost from the -NH2 group and the -OH 
group is lost from the -COOH group. 


Chapter 3 


ANSWER 3-1 The equation represents the “bottom 
line" of photosynthesis, which occurs as a large set of 
individual reactions that are catalyzed by many individual 
enzymes. Because sugars are more complicated molecules 
than CO2 and H20, the reaction generates a more 
ordered state inside the cell. As demanded by the second 
law of thermodynamics, this increase in order must be 
accompanied by a greater increase in disorder, which 
occurs because heat is generated at many steps on the 
long pathway leading to the products summarized in this 
equation. 


ANSWER 3-2 Oxidation is defined as removal of 
electrons, and reduction represents a gain of electrons. 
Therefore, (A) is an oxidation and (B) is a reduction. The 

red carbon atom in (C) remains largely unchanged; the 
neighboring carbon atom, however, loses a hydrogen atom 
(i.e., an electron and a proton) and hence becomes oxidized. 
The red carbon atom in (D) becomes oxidized because it 
loses a hydrogen atom, whereas the red carbon atom in (E) 
becomes reduced because it gains a hydrogen atom. 


ANSWER 3-3 

A. Both states of the coin, H and T, have an equal 
probability. There is therefore no driving force—that 
is, no energy difference—that would favor H turning 
to T, or vice versa. Therefore, AG” = 0 for this reaction. 
However, a reaction proceeds if H and T coins are not 
present in the box in equal numbers. In this case, the 
concentration difference between H and T creates a 
driving force and AG 0; when the reaction reaches 
equilibrium—that is, when there are equal numbers of H 
and T—AG = 0. 

B. The amount of shaking corresponds to the temperature, 
as it results in the “thermal” motion of the coins. The 
activation energy of the reaction is the energy that 
needs to be expended to flip the coin, that is, to stand it 
on its rim, from where it can fall back facing either side 
up. Jigglase would speed up the flipping by lowering 
the energy required for this; it could, for example, be 
a magnet that is suspended above the box and helps 
lift the coins. Jigglase would not affect where the 
equilibrium lies (at an equal number of H and T), but it 
would speed up the process of reaching the equilibrium, 
because in the presence of jigglase more coins would flip 
back and forth. 


ANSWER 3-4 See Figure A3-4. Note that AG°x — yis 
positive, whereas AG°y _, z and AG°y — z are negative. The 
graph also shows that AG®x — z = AG®x _,y + AG°y >z. 


X=Y SZ 


Figure A3-4 


We do not know from the information given in Figure 
3-12 how high the activation-energy barriers are; they are 
therefore drawn to an arbitrary height (solid lines). The 
activation energies would be lowered by enzymes that 
catalyze these reactions, thereby speeding up the reaction 
rates (dotted lines), but the enzymes would not change the 
AG? values. 


ANSWER 3-5 The reaction rates might be limited by: 

(1) the concentration of the substrate—that is, how often a 
molecule of CO2 collides with the active site on the enzyme; 
(2) how many of these collisions are energetic enough to 
lead to a reaction; and (3) how fast the enzyme can release 
the products of the reaction and therefore be free to bind 
more CO2. The diagram in Figure A3-5 shows that the 


activation energy 
for enzyme reaction 


activation energy 
for uncatalyzed reaction 


number of molecules —> 


B A 
energy per molecule —> 


Figure A3-5 
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enzyme lowers the activation-energy barrier, so that more 
CO2 molecules have sufficient energy to undergo the 
reaction. The area under the curve from point A to infinite 
energy or from point B to infinite energy indicates the total 
number of molecules that will react without or with the 
enzyme, respectively. Although not drawn to scale, the ratio 
of these two areas should be 107. 


ANSWER 3-6 All reactions are reversible. If the 
compound AB can dissociate to produce A and B, then it 
must also be possible for A and B to associate to form AB. 
Which of the two reactions predominates depends on the 
equilibrium constant of the reaction and the concentrations 
of A, B, and AB (as discussed in Figure 3—19). Presumably, 
when this enzyme was isolated, its activity was detected by 
supplying A and B in relatively large amounts and measuring 
the amount of AB generated. But suppose, however, that 
in the cell there is a large concentration of AB, in which 
case the enzyme would actually catalyze AB — A + B. (This 
question is based on an actual example in which an enzyme 
was isolated and named according to the reaction in one 
direction, but was later shown to catalyze the reverse 
reaction in living cells.) 


ANSWER 3-7 

A. The rocks in Figure 3—29B provide the energy to lift the 
bucket of water. (i) In the reaction X + ATP > 
Y + ADP + Pj, ATP hydrolysis is driving the reaction; 
thus ATP corresponds to the rocks on top of the cliff. 
(ii) The broken debris in Figure 3—29B corresponds 
to ADP and Pj, the products of ATP hydrolysis. 
(iii) and (iv) In the reaction, ATP hydrolysis is coupled 
to the conversion of X to Y. X, therefore, is the starting 
material, the bucket on the ground, which is converted 
to Y, the bucket at its highest point. 

B. (i) The rocks hitting the ground would be the futile 
hydrolysis of ATP—for example, in the absence of 
an enzyme that uses the energy released by the ATP 
hydrolysis to drive an otherwise unfavorable reaction; 
in this case, the energy stored in the phosphoanhydride 
bond of ATP would be lost as heat. (ii) The energy 
stored in Y could be used to drive another reaction. If 
Y represented the activated form of amino acid X, for 
example, it could undergo a condensation reaction to 
form a peptide bond during protein synthesis. 


ANSWER 3-8 The free energy AG derived from ATP 
hydrolysis depends on both the AG’? and the concentrations 
of the substrate and products. For example, for a particular 
set of concentrations, one might have 


AG =-50 kJ/mole = -30.5 kJ/mole + 2.58 In [ADP] x [Pj] 
[ATP] 


AG is smaller than AG”, largely because the ATP 
concentration in cells is high (in the millimolar range) and 
the ADP concentration is low (in the 10 uM range). The 
concentration term of this equation is therefore smaller than 
1 and its logarithm is a negative number. 

AG” is a constant for the reaction and will not vary with 
reaction conditions. AG, in contrast, depends on the 
concentrations of ATP, ADP, and phosphate, which can be 
somewhat different between cells. 


ANSWER 3-9 Reactions B, D, and E all require coupling 
to other, energetically favorable reactions. In each 
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case, higher-order structures are formed that are more 
complicated and have higher-energy bonds than the 
starting materials. In contrast, reaction A is a catabolic 
reaction that leads to compounds in a lower energy state 
and will occur spontaneously. The nucleoside triphosphates 
in reaction C contain enough energy to drive DNA synthesis 
(see Figure 3—42). 


ANSWER 3-10 

A. Nearly true, but strictly speaking, false. Because 
enzymes enhance the rate but do not change the 
equilibrium point of a reaction, a reaction will always 
occur in the absence of the enzyme, though often at a 
minuscule rate. Moreover, competing reactions may use 
up the substrate more quickly, thus further impeding 
the desired reaction. Thus, in practical terms, without 
an enzyme, some reactions may never occur to an 
appreciable extent. 

B. False. High-energy electrons are more easily transferred; 
that is, they are more loosely bound to the donor 
molecule. This does not mean that they move any faster. 

C. True. Hydrolysis of an ATP molecule to form AMP also 
produces a pyrophosphate (PP;) molecule, which in turn 
is hydrolyzed into two phosphate molecules. This second 
reaction releases almost the same amount of energy 
as the initial hydrolysis of ATP, thereby approximately 
doubling the energy total yield. 

D. True. Oxidation is the removal of electrons, which 
reduces the diameter of the carbon atom. 

E. True. ATP, for example, can donate both chemical-bond 
energy and a phosphate group. 

F. False. Living cells have a particular kind of chemistry 
in which most oxidations are energy-releasing events; 
under different conditions, however, such as in a 
hydrogen-containing atmosphere, reductions would be 
energy-releasing events. 

G. False. All cells, including those of cold- and warm- 
blooded animals, radiate comparable amounts of heat 
as a consequence of their metabolic reactions. For 
bacterial cells, for example, this becomes apparent when 
a compost pile heats up. 

H. False. The equilibrium constant of the reaction 
X «> Y remains unchanged. If Y is removed by a second 
reaction, more X is converted to Y so that the ratio of X 
to Y remains constant. 


ANSWER 3-11 The free-energy difference (AG°) between 
Y and X due to three hydrogen bonds is -12.6 kJ/mole. 
(Note that the free energy of Y is lower than that of X, 
because energy would need to be expended to break the 
bonds to convert Y to X. The value for AG? for the transition 
X — Y is therefore negative.) The equilibrium constant for 
the reaction is therefore about 100 (from Table 3-1, p. 96); 
that is, there are about 100 times more molecules of Y than 
of X at equilibrium. An additional three hydrogen bonds 
would increase AG? to -25.2 kcal/mole and increase the 
equilibrium constant about another 100-fold to 104. 

Thus, relatively small differences in energy can have a 
major effect on equilibria. 


ANSWER 3-12 

A. The equilibrium constant is defined as 
K = [AB]/([A] x [B]). The square brackets indicate the 
concentration. Thus, if A, B, and AB are each 1 uM 
(10% M), K will be 10° liters/mole [= 10-6/(10-* x 10%)]. 


B. Similarly, if A, B, and AB are each 1 nM (10-? M), then 
K will be 10° liters/mole. 

C. This example illustrates that interacting proteins that 
are present in cells in lower concentrations need to bind 
to each other with higher affinities so that a significant 
fraction of the molecules are bound at equilibrium. 

In this particular case, lowering the concentration by 
1000-fold (from uM to nM) requires an increase in the 
equilibrium constant by 1000-fold to maintain the AB 
protein complex in the same proportion (corresponding 
to -17.8 kJ/mole of free energy; see Table 3-1). This 
corresponds to about four or five extra hydrogen bonds. 


ANSWER 3-13 The statement is correct. The criterion for 
whether a reaction proceeds spontaneously is AG, not AG®, 
and takes the concentrations of the reacting components 
into account. A reaction with a negative AG®, for example, 
would not proceed spontaneously under conditions where 
there is a large enough excess of products; that is, more 
than at equilibrium. Conversely, a reaction with a positive 
AG*° might spontaneously go forward under conditions 
where there is a huge excess of substrate. 


ANSWER 3-14 

A. A maximum of 57 ATP molecules (= 2867/50) 
corresponds to the total energy released by the 
complete oxidation of glucose to CO2 and H20. 

B. The overall efficiency of ATP production would be about 
53%, calculated as the number of actually produced ATP 
molecules (30) divided by the number of ATP molecules 
that could be obtained if all the energy stored in a 
glucose molecule could be harvested as chemical energy 
in ATP (57). 

C. During the oxidation of 1 mole of glucose, 1347 kJ (the 
remaining 47% of the available 2867 kJ in one mole of 
glucose that is not stored as chemical energy in ATP) 
would be released as heat. This amount of energy would 
heat your body by 4.3°C (1347 kJ x 0.24 = 323 kcal; 

323 kcal/75 kg = 4.3). This is a significant amount of 
heat, considering that 4°C of elevated temperature 
would be a quite incapacitating fever and that 1 mole 
(180 g) of glucose is no more than two cups of sugar. 

D. If the energy yield were only 20%, then instead of 47% 
in part (C) above, 80% of the available energy would be 
released as heat and would need to be dissipated by 
your body. The heat production would be more than 1.7- 
fold higher than normal, and your body would certainly 
overheat. 

E. The chemical formula of ATP is C19H12013N5P3, and 
its molecular weight is therefore 503 g/mole. Your 
resting body therefore hydrolyzes about 80 moles 
(= 40 kg/0.503 kg/mole) of ATP in 24 hours (this 
corresponds to about 4200 kJ of liberated chemical 
energy). Because every mole of glucose yields 30 
moles of ATP, this amount of energy could be produced 
by oxidation of 480 g glucose (= 180 g/mole x 80 
moles/30). 


ANSWER 3-15 This scientist is definitely a fake. The 57 
ATP molecules would store about 2850 kJ (= 57 x 50 kJ) 

of chemical energy, which implies that the efficiency of 

ATP production from glucose would have been greater 

than 99%. This impossible degree of efficiency would leave 
virtually no energy to be released as heat, and this release is 
required according to the laws of thermodynamics. 


ANSWER 3-16 

A. From Table 3-1 (p. 96) we know that a free-energy 
difference of 17.8 kJ/mole corresponds to an 
equilibrium constant of 10-3; that is, [A*]/[A] = 107%. The 
concentration of A* is therefore 1000-fold lower than 
that of A at equilibrium. 

B. The ratio of A to A* would be unchanged. Lowering 
the activation-energy barrier with an enzyme would 
accelerate the rate of the reaction; that is, it would allow 
more molecules in a given time period to convert from 
A — A* and from A* — A, but it would not affect the 
ratio of A to A* at equilibrium. 


ANSWER 3-17 

A. The mutant mushroom would probably be safe to eat. 
ATP hydrolysis can provide approximately -50 kJ/mole 
of energy. This amount of energy shifts the equilibrium 
point of a reaction by an enormous factor: about 
108-fold. (From Table 3-1, p. 96, we see that 
-23.8 kJ/mole corresponds to an equilibrium constant 
of 104; thus, -50 kJ/mole corresponds to about 108. 
Note that, for coupled reactions, energies are additive, 
whereas equilibrium constants are multiplied.) Therefore, 
if the energy of ATP hydrolysis cannot be utilized by 
the enzyme, 108-fold less poison is made. This example 
illustrates that coupling a reaction to the hydrolysis of an 
activated carrier molecule can shift the equilibrium point 
drastically. 

B. It would be risky to consume this mutant mushroom. 
Slowing down the reaction rate would not affect its 
equilibrium point, and if the reaction were allowed to 
proceed for a long enough time, the mushroom would 
likely be loaded with poison. It is possible that the 
reaction would not reach equilibrium, but it would not 
be advisable to take a chance. 


ANSWER 3-18 Enzyme A is beneficial. It allows the 
interconversion of two energy-carrier molecules, both 
of which are required as the triphosphate form for many 
metabolic reactions. Any ADP that is formed is quickly 
converted to ATP, and thus the cell maintains a high 
ATP/ADP ratio. Because of enzyme A, called nucleotide 
phosphokinase, some of the ATP is used to keep the 
GTP/GDP ratio similarly high. 

Enzyme B would be highly detrimental to the cell. Cells 
use NAD* as an electron acceptor in catabolic reactions 
and must maintain high concentrations of this form of the 
carrier, as it is used in reactions that break down glucose to 
make ATP. In contrast, NADPH is used as an electron donor 
in biosynthetic reactions and is kept at a high concentration 
in the cells so as to allow the synthesis of nucleotides, 
fatty acids, and other essential molecules. Because enzyme 
B would deplete the cell's reserves of both NAD* and 
NADPH, it would decrease the rates of both catabolic and 
biosynthetic reactions. 


ANSWER 3-19 Because enzymes are catalysts, enzyme 
reactions have to be thermodynamically feasible; the 
enzyme only lowers the activation-energy barrier that 
otherwise slows the rate with which the reaction occurs. 
Heat confers more kinetic energy to substrates so that a 
higher fraction of them can surmount the normal activation- 
energy barrier. Many substrates, however, have many 
different ways in which they could react, and all of these 
potential pathways will be enhanced by heat. An enzyme, 
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by contrast, acts selectively to facilitate only one particular 
pathway that, in evolution, was selected to be useful for the 
cell. Heat, therefore, cannot substitute for enzyme function, 
and chicken soup must exert its claimed beneficial effects by 
other mechanisms, which remain to be discovered. 


Chapter 4 


ANSWER 4-1 Urea is a very small organic molecule 

that functions both as an efficient hydrogen-bond donor 
(through its NH2 groups) and as an efficient hydrogen-bond 
acceptor (through its C=O group). As such, it can squeeze 
between hydrogen bonds that stabilize protein molecules 
and thus destabilize protein structures. In addition, the 
nonpolar side chains of a protein are held together in 

the interior of the folded structure because they would 
disrupt the structure of water if they were exposed. At 
high concentrations of urea, the hydrogen-bonded network 
of water molecules becomes disrupted so that these 
hydrophobic forces are significantly diminished. Proteins 
unfold in urea as a consequence of its effect on these two 
forces. 


ANSWER 4-2 The amino acid sequence consists of 
alternating nonpolar and charged or polar amino acids. 
The resulting strand in a B sheet would therefore be polar 
on one side and hydrophobic on the other. Such a strand 
would probably be surrounded on either side by similar 
strands that together form a B sheet with a hydrophobic 
face and a polar face. In a protein, such a P sheet (called 
“amphipathic,” from the Greek amphi, “of both kinds,” 
and pathos, “passion,” because of its two surfaces with 
such different properties) would be positioned so that the 
hydrophobic side would face the protein's interior and the 
polar side would be on its surface, exposed to the water 
outside. 


ANSWER 4-3 Mutations that are beneficial to an 
organism are selected in evolution because they confer 

a reproductive or survival advantage to the organism. 
Examples might be a more efficient utilization of a food 
source, enhanced resistance to environmental insults, or an 
improved ability to attract a mate for sexual reproduction. 
In contrast, useless proteins are detrimental to organisms, 
as the metabolic energy required to make them is a wasted 
cost. If such mutant proteins were made in excess, the 
synthesis of normal proteins would suffer because the 
synthetic capacity of the cell is limited. In more severe 
cases, a mutant protein could interfere with the normal 
workings of the cell; a mutant enzyme that still binds an 
activated carrier molecule but does not catalyze a reaction, 
for example, may compete for a limited amount of this 
carrier and therefore inhibit normal processes. Natural 
selection therefore provides a strong driving force that 
eliminates both useless and harmful proteins. 


ANSWER 4-4 Strong reducing agents that break all of 
the S-S bonds would cause all of the keratin filaments to 
separate. Individual hairs would be weakened and fragment. 
Indeed, strong reducing agents are used commercially in 
hair-removal creams sold by your local pharmacist. However, 
mild reducing agents are used in treatments that either 
straighten or curl hair, the latter requiring hair curlers. (See 
Figure A4-4.) 
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ANSWER 4-6 


A. Feedback inhibition from Z that affects the reaction 
B — C would increase the flow through the B > X > 
Y — Z pathway, because the conversion of B to C is 
inhibited. Thus, the more Z there is, the more production 
of Z would be stimulated. This is likely to result in an 
uncontrolled “runaway” amplification of this pathway. 

B. Feedback inhibition from Z affecting Y —> Z would only 
inhibit the production of Z. In this scheme, however, X 
and Y would still be made at normal rates, even though 
both of these intermediates are no longer needed at this 
level. This pathway is therefore less efficient than the 
one shown in Figure 4—42. 

C. If Z is a positive regulator of the step B — X, then the 
more Z there is, the more B will be converted to X and 
therefore shunted into the pathway producing more Z. 


This would result in a runaway amplification similar to 
that described for (A). 

D. If Zis a positive regulator of the step B — C, then 
accumulation of Z leads to a redirection of the pathway 
to make more C. This is a second possible way, in 
addition to that shown in the figure, to balance the 
distribution of compounds into the two branches of the 
pathway. 


ANSWER 4-7 Both nucleotide binding and 
phosphorylation can induce allosteric changes in proteins. 
These can have a multitude of consequences, such as 
altered enzyme activity, drastic shape changes, and changes 
in affinity for other proteins or small molecules. Both 
mechanisms are quite versatile. An advantage of nucleotide 
binding is the fast rate with which a small nucleotide 

can diffuse to the protein; the shape changes that 
accompany the function of motor proteins, for example, 
require quick nucleotide replenishment. If the different 
conformational states of a motor protein were controlled 

by phosphorylation, for example, a protein kinase would 
either need to diffuse into position at each step, a much 
slower process, or be associated permanently with each 
motor protein. One advantage of phosphorylation is that it 
requires only a single amino acid on the protein's surface, 
rather than a specific binding site. Phosphates can therefore 
be added to many different side chains on the same protein 
(as long as protein kinases with the proper specificities 
exist), thereby vastly increasing the complexity of regulation 
that can be achieved for a single protein. 


ANSWER 4-8 In working together in a complex, all three 
proteins contribute to the specificity (by binding to the safe 
and key directly). They help position one another correctly, 
and provide the mechanical bracing that allows them to 
perform a task that they could not perform individually 

(the key is grasped by two of the proteins, for example). 
Moreover, their functions are generally coordinated in time 
(for instance, the binding of ATP to one subunit is likely to 
require that ATP has already been hydrolyzed to ADP by 
another). 


ANSWER 4-9 The a helix is right-handed. The three 
strands that form the large P sheet are antiparallel. There 
are no knots in the polypeptide chain, presumably because 
a knot would interfere with the folding of the protein into its 
three-dimensional conformation after protein synthesis. 


ANSWER 4-10 

A. True. Only a few amino acid side chains contribute to the 
active site. The rest of the protein is required to maintain 
the polypeptide chain in the correct conformation, 
provide additional binding sites for regulatory purposes, 
and localize the protein in the cell. 

B. True. Some enzymes form covalent intermediates with 
their substrates (see middle panels of Figure 4-39); 
however, in all cases, the enzyme is restored to its 
original structure after the reaction. 

C. False. P sheets can, in principle, contain any number of 
strands because the two strands that form the rims of 
the sheet are available for hydrogen-bonding to other 
strands. (B sheets in known proteins contain from 2 to 16 
strands.) 

D. False. It is true that the specificity of an antibody 
molecule is exclusively contained in polypeptide loops 


on its surface; however, these loops are contributed 
by both the folded light and heavy chains (see Figure 
4-33). 

E. False. The possible linear arrangements of amino acids 
that lead to a stably folded protein domain are so few 
that most new proteins evolve by alteration of old ones. 

F. True. Allosteric enzymes generally bind one or more 
molecules that function as regulators at sites that are 
distinct from the active site. 

G. False. Although single noncovalent bonds are weak, 
many such bonds acting together are major contributors 
to the three-dimensional structure of macromolecules. 

H. False. Affinity chromatography separates specific 
macromolecules because of their interactions with 
specific ligands, not because of their charge. 

|. False. The larger an organelle is, the more centrifugal 
force it experiences and the faster it sediments, despite 
an increased frictional resistance from the fluid through 
which it moves. 


ANSWER 4-1] In an a helix and in the central strands of a 
B sheet, all of the N-H and C=O groups in the polypeptide 
backbone are engaged in hydrogen bonds. This gives 
considerable stability to these secondary structural 
elements, and it allows them to form in many different 
proteins. 


ANSWER 4-12 No. It would not have the same or even a 
similar structure, because the peptide bond has a polarity. 
Looking at two sequential amino acids in a polypeptide 
chain, the amino acid that is closer to the N-terminal end 
contributes the carboxyl group and the other amino acid 
contributes the amino group to the peptide bond that 
links the two amino acids. Changing their order would 

put the side chains into different positions with respect to 
the peptide backbone and therefore change the way the 
polypeptide folds. 


ANSWER 4-13 As it takes 3.6 amino acids to complete a 
turn of an a helix, this sequence of 14 amino acids would 
make close to 4 full turns. It is remarkable because its 
polar and hydrophobic amino acids are spaced so that all 
the polar ones are on one side of the a helix and all the 
hydrophobic ones are on the other. It is therefore likely 
that such an amphipathic a helix is exposed on the protein 
surface with its hydrophobic side facing the protein's 
interior. In addition, two such helices might wrap around 
each other as shown in Figure 4—16. 


ANSWER 4-14 

A. ES represents the enzyme-substrate complex. 

B. Enzyme and substrate are in equilibrium between their 
free and bound states; once bound to the enzyme, a 
substrate molecule may either dissociate again (hence 
the bidirectional arrows) or be converted to product. 

As the substrate is converted to product (with the 
concomitant release of free energy), however, a reaction 
usually proceeds strongly in the forward direction, as 
indicated by the unidirectional arrow. 

C. The enzyme is a catalyst and is therefore liberated in an 
unchanged form after the reaction; thus, E appears at 
both ends of the equation. 

D. Often, the product of a reaction resembles the substrate 
sufficiently that it can also bind to the enzyme. Any 
enzyme molecules that are bound to the product 
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(i.e., are part of an EP complex) are unavailable for 
catalysis; excess P therefore can inhibit the reaction by 
lowering the concentration of free E. 

E. Compound X would act as an inhibitor of the reaction 
and work similarly by forming an EX complex. However, 
since P has to be made before it can inhibit the reaction, 
it takes longer to act than X, which is present from the 
beginning of the reaction. 


ANSWER 4-15 The polar amino acids Ser, Ser-P, Lys, Gln, 
His, and Glu are more likely to be found on a protein's 
surface, and the hydrophobic amino acids Leu, Phe, Val, 
lle, and Met are more likely to be found in its interior. The 
oxidation of two cysteine side chains to form a disulfide 
bond eliminates their potential to form hydrogen bonds 
and therefore makes them even more hydrophobic; thus 
disulfide bonds are usually found in the interior of proteins. 
Irrespective of the nature of their side chains, the most 
N-terminal amino acid and the most C-terminal amino acid 
each contain a charged group (the amino and carboxyl 
groups, respectively, that mark the ends of the polypeptide 
chain) and hence are usually found on the protein's surface. 


ANSWER 4-16 Many secondary structural elements are 
not stable in isolation but are stabilized by other parts of 
the polypeptide chain. Hydrophobic regions of fragments, 
which would normally be hidden in the inside of a folded 
protein, would be exposed to water molecules in an 
aqueous solution; such fragments would tend to aggregate 
nonspecifically, and not have a defined structure, and they 
would be inactive for ligand binding, even if they contained 
all of the amino acids that would normally contribute to 
the ligand-binding site. A protein domain, in contrast, is 
considered a folding unit, and fragments of a polypeptide 
chain that correspond to intact domains are often able to 
fold correctly. Thus, separated protein domains often retain 
their activities, such as ligand binding, if the binding site is 
contained entirely within the domain. Thus the most likely 
place in which the polypeptide chain of the protein in Figure 
4-20 could be severed to give rise to stable fragments 

is at the boundary between the two domains (i.e., at the 
loop between the two a helices at the bottom right of the 
structure shown). 


ANSWER 4-17 Because of the lack of secondary structure, 
the C-terminal region of neurofilament proteins undergoes 
continual Brownian motion. The high density of negatively 
charged phosphate groups means that the C-terminals 
also experience repulsive interactions, which cause 

them to stand out from the surface of the neurofilament 
like the bristles of a brush. In electron micrographs of 

a cross section of an axon, the region occupied by the 
extended C-terminals appears as a clear zone around 

each neurofilament, from which organelles and other 
neurofilaments are excluded. 


ANSWER 4-18 The heat-inactivation of the enzyme 
suggests that the mutation causes the enzyme to have a 
less stable structure. For example, a hydrogen bond that 
is normally formed between two amino acid side chains 
might no longer be formed because the mutation replaces 
one of these amino acids with a different one that cannot 
participate in the bond. Lacking such a bond that normally 
helps to keep the polypeptide chain folded properly, the 
protein partially or completely unfolds at a temperature at 
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which it would normally be stable. Polypeptide chains that 
denature when the temperature is raised often aggregate, 
and they rarely refold into active proteins when the 
temperature is decreased. 


ANSWER 4-19 The motor protein in the illustration can 
move just as easily to the left as to the right and so will not 
move steadily in one direction. However, if just one of the 
steps is coupled to ATP hydrolysis (for example, by making 
detachment of one foot dependent on binding of ATP and 
coupling the reattachment to hydrolysis of the bound ATP), 
then the protein will show unidirectional movement that 
requires the continued consumption of ATP. Note that, in 
principle, it does not matter which step is coupled to ATP 
hydrolysis (Figure A4—19). 
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Figure A4-19 


ANSWER 4-20 The slower migration of small molecules 
through a gel-filtration column occurs because smaller 
molecules have access to many more spaces in the porous 
beads that are packed into the column than do larger 
molecules. However, it is important that the flow rate 
through the column is slow enough to give the smaller 
molecules sufficient time to diffuse into the spaces inside 
the beads. At very rapid flow rates, all molecules will move 
rapidly around the beads, so that large and small molecules 
will now tend to exit together from the column. 


ANSWER 4-2] The a helix in the figure is right-handed, 
whereas the coiled-coil is left-handed. The reversal occurs 
because of the staggered positions of hydrophobic side 
chains in the a helix. 


ANSWER 4-22 The atoms at the binding sites of proteins 
must be precisely located to fit the molecules that they 
bind. Their location in turn requires the precise positioning 
of many of the amino acids and their side chains in the core 
of the protein, distant from the binding site itself. Thus, 
even a small change in this core can disrupt protein function 
by altering the conformation at a binding site far away. 


ANSWER 4-23 

A. When [S] << Km, the term (Km + [S]) approaches Km. 
Therefore, the equation is simplified to rate = 
VmaxlS]/Km. Therefore, the rate is proportional to [S]. 

B. When [S] = Km, the term [S]/(Km + [S]) equals 1⁄2. 
Therefore, the reaction rate is half of the maximal rate 
Vmax: 

C. If [S] >> Km, the term (Ky + [S]) approaches [S]. 
Therefore, [S]/(Km + [S]) equals 1 and the reaction occurs 
at its maximal rate Vmax- 


ANSWER 4-24 The substrate concentration is 1 mM. 
This value can be obtained by substituting values into the 
equation, but it is simpler to note that the desired rate 
(50 umole/sec) is exactly half of the maximum rate, Vmax 
where the substrate concentration is typically equal to the 
Ku. The two plots requested are shown in Figure A4—24. 
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A plot of 1/rate versus 1/[S] is a straight line because 
rearranging the standard equation yields the equation given 
in Question 4-26B. 


ANSWER 4-25 If [S] is very much smaller than Ky, the 
active site of the enzyme is mostly unoccupied. If [S] is very 
much greater than Ky, the reaction rate is limited by the 
enzyme concentration (because most of the catalytic sites 
are fully occupied). 


ANSWER 4-26 

A,B. The data in the boxes have been used to plot the red 
curve and red line in Figure A4—26. From the plotted 
data, the Ky is 1 uM and the Vmax is 2 Lmole/min. Note 
that the data are much easier to interpret in the linear 
plot, because the curve in (A) approaches, but never 
reaches, Vmax- 

C. It is important that only a small quantity of product is 
made, because otherwise the rate of reaction would 
decrease as the substrate was depleted and product 
accumulated. Thus the measured rates would be lower 
than they should be. 

D. If the Ky increases, then the concentration of substrate 
needed to give a half-maximal rate is increased. As 
more substrate is needed to produce the same rate, 
the enzyme-catalyzed reaction has been inhibited by 
the phosphorylation. The expected data plots for the 
phosphorylated enzyme are the green curve and the 
green line in Figure A4—26. 


Chapter 5 


ANSWER 5-1 

A. False. The polarity of a DNA strand commonly refers to 
the orientation of its sugar-phosphate backbone, one 
end of which contains a phosphate group and the other 
a hydroxyl group. 
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B. True. G-C base pairs are held together by three 
hydrogen bonds, whereas A-T base pairs are held 
together by only two. 


ANSWER 5-2 Histone octamers occupy about 9% of the 
volume of the nucleus. The volume of the nucleus is 


V = 4/3 x 3.14 x (3 x 103 nm)’ 
V= 1.13 x 1011 nm? 
The volume of the histone octamers is 
V = 3.14 x (4.5 nm)? x (5 nm) x (32 x 106) 
V = 1.02 x 1010 nm 


The ratio of the volume of histone octamers to the nuclear 
volume is 0.09; thus, histone octamers occupy about 9% of 
the nuclear volume. Because the DNA also occupies about 
9% of the nuclear volume, together they occupy about 18% 
of the volume of the nucleus. 


ANSWER 5-3 In contrast to most proteins, which 
accumulate amino acid changes over evolutionary time, 

the functions of histone proteins must involve nearly all of 
their amino acids, so that a change in any position would be 
deleterious to the cell. 


Answers 


ANSWER 5-4 Men have only one copy of the X 
chromosome in their cells; a defective gene carried on it 
therefore has no backup copy. Women, on the other hand, 
have two copies of the X chromosome in their cells, one 
inherited from each parent, so a defective copy of the 
gene on one X chromosome can generally be compensated 
for by a normal copy on the other chromosome. This 
is the case with regard to the gene that causes color 
blindness. However, during female development, one X 
chromosome in each cell is inactivated by compaction into 
heterochromatin, shutting down gene expression from that 
chromosome (see Figure 5-28). This inactivation occurs 
at random in each cell to one or the other of the two X 
chromosomes, and therefore some cells of the woman will 
express the mutant copy of the gene, whereas others will 
express the normal copy. This process results in a retina 
in which, on average, only every other cone cell is color- 
sensitive, and women carrying the mutant gene on one X 
chromosome therefore see colored objects with reduced 
resolution. 

A woman who is color-blind must have two defective 
copies of this gene, one inherited from each parent. 
Her father must therefore carry the mutation on his X 
chromosome; because this is his only copy of the gene, he 
would be color-blind. Her mother could carry the defective 
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gene on either or both of her X chromosomes: if she carried 


it on both, she would be color-blind; if she carried it on 
one, she would have color vision but reduced resolution, as 
described above. Several different types of inherited color 
blindness are found in the human population; this question 
applies to only one type. 


ANSWER 5-5 
A. The complementary strand reads 


5'-TGATTGTGGACAAAAATCC-3'. Paired DNA strands have 
opposite polarity, and the convention is to write a single- 


stranded DNA sequence in the 5'-to-3' direction. 

B. The DNA is made of four nucleotides (100% = 13% A + 
x% T + y% G + z% C). Because A pairs with T, the two 
nucleotides are represented in equimolar proportions in 
DNA. Therefore, the bacterial DNA in question contains 
13% thymine. This leaves 74% [= 100% - (13% + 13%)] 
for G and C, which also form base pairs and hence are 
equimolar. Thus y = z = 74/2 = 37% of each. 

C. A single-stranded DNA molecule that is N nucleotides 
long can have any one of 4 possible sequences. 

D. To specify a unique sequence that is N nucleotides long, 
4N has to be larger than 3 x 10°. Thus, 
4N > 3 x 10°, solved for N, gives N > In(3 x 10°)/ 

In(4) = 10.7. Thus, on average, a sequence of only 

11 nucleotides in length is unique in the genome. 
Performing the same calculation for the genome size of 
an animal cell yields a minimal stretch of 16 nucleotides. 
This shows that a relatively short sequence can mark 

a unique position in the genome and is sufficient, for 
example, to serve as an identity tag for one specific 
gene. 


ANSWER 5-6 If the wrong bases were frequently 
incorporated during DNA replication, genetic information 
could not be inherited accurately. Life, as we know it, could 
not exist. Although the bases can form hydrogen-bonded 
pairs as indicated, these do not fit into the structure of the 
double helix. The angle at which the A base is attached to 
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the sugar—-phosphate backbone is vastly different in the 

A-C pair compared with A-T, and the spacing between the 
two sugar-phosphate strands is considerably increased 

in the A-G pair, where two large purine rings interact. 
Consequently, it is energetically unfavorable to incorporate 
a wrong base in DNA, and such errors occur only very rarely. 


ANSWER 5-7 

A. The bases V, W, X, and Y can form a DNA-like double- 
helical molecule with virtually identical properties to 
those of bona fide DNA. V would always pair with X, 
and W with Y. Therefore, the macromolecule could 
be derived from a living organism that uses the same 
principles to replicate its genome as those used by 
organisms on Earth. In principle, different bases, such 
as V, W, X, and Y, could have been selected during 
evolution on Earth as building blocks for DNA. (Similarly, 
there are many more conceivable amino acid side chains 
than the set of 20 selected in evolution that make up all 
proteins.) 

B. None of the bases V, W, X, or Y can replace A, T, G, or 
C. To preserve the distance between the two sugar- 
phosphate strands in a double helix, a pyrimidine always 
has to pair with a purine (see, for example, Figure 5-4). 
Thus, the eight possible combinations would be V-A, 
V-G, W-A, W-G, X-C, X-T, Y-C, and Y-T. Because of the 
positions of hydrogen-bond acceptors and hydrogen- 
bond donor groups, however, no stable base pairs would 
form in any of these combinations, as shown for the 
pairing of V and A in Figure A5-7, where only a single 
hydrogen bond could form. 
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ANSWER 5-8 As the two strands are held together 


by hydrogen bonds between the bases, the stability of a 
DNA double helix is largely dependent on the number of 
hydrogen bonds that can be formed. Thus two parameters 
determine the stability: the number of nucleotide pairs and 
the number of hydrogen bonds that each nucleotide pair 
contributes. As shown in Figure 5-4, an A-T pair contributes 
two hydrogen bonds, whereas a G-C pair contributes three 
hydrogen bonds. Therefore, helix C (containing a total of 

34 hydrogen bonds) would melt at the lowest temperature, 
helix B (containing a total of 65 hydrogen bonds) would melt 


next, and helix A (containing a total of 78 hydrogen bonds) 
would melt last. Helix A is the most stable, largely owing 

to its high GC content. Indeed, the DNA of organisms that 
grow in extreme temperature environments, such as certain 
prokaryotes that grow in geothermal vents, has an unusually 
high GC content. 


ANSWER 5-9 The DNA would be enlarged by a factor 

of 2.5 x 106 (= 5 x 1073/2 x 10-9 m). Thus the extension 
cord would be 2500 km long. This is approximately the 
distance from London to Istanbul, San Francisco to Kansas 
City, Tokyo to the southern tip of Taiwan, and Melbourne to 
Cairns. Adjacent nucleotides would be about 0.85 mm apart 
(which is only about the thickness of a stack of 12 pages of 
this book). A gene that is 1000 nucleotide pairs long would 
be about 85 cm in length. 


ANSWER 5-10 

A. It takes two bits to specify each nucleotide pair (for 
example, 00, 01, 10, and 11 would be the binary codes 
for the four different nucleotides, each paired with its 
appropriate partner). 

B. The entire human genome (3 x 10? nucleotide pairs) 
could be stored on two CDs (3 x 10? x 2 bits/4.8 x 10? 
bits). 


ANSWER 5-11 

A. True. 

B. False. Nucleosome core particles are approximately 
11 nm in diameter. 


ANSWER 5-12 The definitions of the terms can be found 
in the Glossary. DNA assembles with specialized proteins 
to form chromatin. At a first level of packing, histones form 
the core of nucleosomes. A nucleosome includes the DNA 
wrapped around this histone core plus a segment of linker 
DNA. Between nuclear divisions—that is, in interphase—the 
chromatin of the interphase chromosomes is in a relatively 
extended form in the nucleus, although some regions of 

it, the heterochromatin, remain densely packed and are 
transcriptionally inactive. During nuclear division—that is, in 
mitosis—replicated chromosomes become condensed into 
mitotic chromosomes, which are transcriptionally inactive 
and are designed to be readily distributed between the two 
daughter cells. 


ANSWER 5-13 Colonies are clumps of cells that originate 
from a single founder cell and grow outward as the cells 
divide again and again. In the lower colony of Figure Q5-13, 
the Ade2 gene is inactivated when placed near a telomere, 
but apparently it can become spontaneously activated in a 
few cells, which then turn white. Once activated in a cell, the 
Ade2 gene continues to be active in the descendants of that 
cell, resulting in clumps of white cells (the white sectors) 

in the colony. This result shows both that the inactivation 

of a gene positioned close to a telomere can be reversed 
and that this change is passed on to further generations. 
This change in Ade2 expression probably results from a 
spontaneous decondensation of the chromatin structure 
around the gene. 


ANSWER 5-14 In the electron micrographs, one can 
detect chromatin regions of two different densities; the 
densely stained regions correspond to heterochromatin, 
while less condensed chromatin is more lightly stained. 
The chromatin in (A) is mostly in the form of condensed, 


transcriptionally inactive heterochromatin, whereas most 
of the chromatin in (B) is decondensed and therefore 
potentially transcriptionally active. The nucleus in (A) is 
from a reticulocyte, a red blood cell precursor, which is 
largely devoted to making a single protein, hemoglobin. 
The nucleus in (B) is from a lymphocyte, which is active in 
transcribing many different genes. 


ANSWER 5-15 Helix (A) is right-handed. Helix (C) is 
left-handed. Helix (B) has one right-handed strand and 
one left-handed strand. There are several ways to tell the 
handedness of a helix. For a vertically oriented helix, like 
the ones in Figure Q5-15, if the strands in front point up 
to the right, the helix is right-handed; if they point up to 
the left, the helix is left-handed. Once you are comfortable 
identifying the handedness of a helix, you will be amused 
to note that nearly 50% of the “DNA” helices shown 

in advertisements are left-handed, as are a surprisingly 
high number of the ones shown in books. Amazingly, 

a version of Helix (B) was used in advertisements for a 
prominent international conference, celebrating the 30-year 
anniversary of the discovery of the DNA helix. 


ANSWER 5-16 The packing ratio within a nucleosome core 
is 4.5 [(147 bp x 0.34 nm/bp)/(11 nm) = 4.5]. If there is an 
additional 54 bp of linker DNA, then the packing ratio for 
“beads-on-a-string” DNA is 2.3 [(201 bp x 0.34 nm/bp)/ 

(11 nm + {54 bp x 0.34 nm/bp}) = 2.3]. This first level of 
packing represents only 0.023% (2.3/10,000) of the total 
condensation that occurs at mitosis. 


Chapter 6 


ANSWER 6-1 

A. The distance between replication forks 4 and 5 is about 
280 nm, corresponding to 824 nucleotides (= 280/0.34). 
These two replication forks would collide in about 8 
seconds. Forks 7 and 8 move away from each other and 
would therefore never collide. 

B. The total length of DNA shown in the electron 
micrograph is about 1.5 um, corresponding to 4400 
nucleotides. This is only about 0.002% 


[= (4400/1.8 x 108) x 100%] of the total DNA in a fly cell. 


ANSWER 6-2 Although the process may seem wasteful, 
it is not possible to proofread during the initial stages of 
primer synthesis. To start a new primer on a piece of single- 
stranded DNA, one nucleotide needs to be put in place 
and then linked to a second, and then to a third, and so on. 
Even if these first nucleotides were perfectly matched to 
the template strand, they would bind with very low affinity, 
and it would consequently be difficult for a hypothetical 
primase with proofreading activity to distinguish the correct 
from incorrect bases; the enzyme would therefore stall. 

The task of the primase is to “just polymerize nucleotides 
that bind reasonably well to the template without worrying 
too much about accuracy.” Later, these sequences are 
removed and replaced by DNA polymerase, which uses 
newly synthesized, adjacent DNA—which has already been 
proofread—as its primer. 


ANSWER 6-3 

A. Without DNA polymerase, no replication can take place 
at all. RNA primers will be laid down at the origin of 
replication. 
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B. DNA ligase links the DNA fragments that are produced 
on the lagging strand. In the absence of ligase, the 
newly replicated DNA strands will remain as fragments, 
but no nucleotides will be missing. 

C. Without the sliding clamp, the DNA polymerase will 
frequently fall off the DNA template. In principle, it can 
rebind and continue, but the continual falling off and 
rebinding will be so time-consuming that the cell will be 
unable to divide. 

D. Inthe absence of RNA-excision enzymes, the RNA 
fragments will remain covalently attached to the newly 
replicated DNA fragments. No ligation will take place, 
because the DNA ligase will not link DNA to RNA. 

The lagging strand will therefore consist of fragments 
composed of both RNA and DNA. 

E. Without DNA helicase, the DNA polymerase will 
stall because it cannot separate the strands of the 
template DNA ahead of it. Little or no new DNA will be 
synthesized. 

F. In the absence of primase, RNA primers cannot be 
made on either the leading or the lagging strand. DNA 
replication therefore cannot begin. 


ANSWER 6-4 DNA damage by deamination and 
depurination reactions occurs spontaneously. This type 
of damage is not the result of replication errors and is 
therefore equally likely to occur on either strand. If DNA 
repair enzymes recognized such damage only on newly 
synthesized DNA strands, half of the defects would go 
uncorrected. The statement is therefore incorrect. 


ANSWER 6-5 Ifthe old strand were “repaired” using the 
new strand that contains a replication error as the template, 
then the error would become a permanent mutation in 

the genome. The old information would be erased in the 
process. Therefore, if repair enzymes did not distinguish 
between the two strands, there would be only a 50% chance 
that any given replication error would be corrected. 


ANSWER 6-6 You cannot transform an individual from 
one species into another species simply by introducing 
random changes into the DNA. It is exceedingly unlikely 
that the 5000 mutations that would accumulate every day in 
the absence of the DNA repair enzyme would be in the very 
positions where human and chimpanzee DNA sequences 
are different. It is very likely that, at such a high mutation 
frequency, many essential genes would be inactivated, 
leading to cell death. Furthermore, your body is made up 
of about 101? cells. For you to turn into an ape, not just 

one but many of these cells would need to be changed. 
And even then, many of these changes would have to occur 
during development to effect changes in your body plan 
(making your arms longer than your legs, for example). 


ANSWER 6-7 

A. False. Identical DNA polymerase molecules catalyze 
DNA synthesis on the leading and lagging strands 
of a bacterial replication fork. The replication fork 
is asymmetrical because the lagging strand is made 
in pieces while the leading strand is synthesized 
continuously. 

B. False. Okazaki fragments initially contain both RNA 
primers and DNA, but only the RNA primers are 
removed by RNA nucleases. 
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C. True. With proofreading, DNA polymerase has an error 
rate of one mistake in 10” nucleotides polymerized; 
99% of its errors are corrected by DNA mismatch repair 
enzymes, bringing the final error rate to one in 107. 

D. True. Mutations would accumulate rapidly, inactivating 
many genes. 

E. True. If a damaged nucleotide also occurred naturally 
in DNA, the repair enzyme would have no way of 
identifying the damage. It would therefore have only a 
50% chance of fixing the right strand. 

F. True. Usually, multiple mutations of specific types need 
to accumulate in a somatic cell lineage to produce a 
cancer. A mutation in a gene that codes for a DNA repair 
enzyme can make a cell more liable to accumulate these 
mutations, thereby accelerating the onset of cancer. 


ANSWER 6-8 With a single origin of replication, which 
launches two DNA polymerases in opposite directions on 
the DNA, each moving at 100 nucleotides per second, the 
number of nucleotides replicated in 24 hours will be 

1.73 x 107 (= 2 x 100 x 24 x 60 x 60). To replicate all the 
6 x 10? nucleotides of DNA in the cell in this time, 
therefore, will require at least 348 (= 6 x 109/1.73 x 107) 
origins of replication. The estimated 10,000 origins of 
replication in the human genome are therefore more than 
sufficient to satisfy this minimum requirement. 


ANSWER 6-9 

A. Dideoxycytidine triphosphate (ddCTP) is identical to 
dCTP, except it lacks the 3’-hydroxyl group on the 
sugar ring. ddCTP is recognized by DNA polymerase 
as dCTP and becomes incorporated into DNA; because 
it lacks the crucial 3'-hydroxyl group, however, its 
addition to a growing DNA strand creates a dead end 
to which no further nucleotides can be added. Thus, 
if ddCTP is added in large excess, new DNA strands 
will be synthesized until the first G (the nucleotide 
complementary to C) is encountered on the template 
strand. ddCTP will then be incorporated instead of 
C, and no further extension of this strand will occur. 
This strategy is exploited by a drug, 3’-azido-3’- 
deoxythymidine (AZT), that is now commonly used in 
HIV-infected patients to treat AIDS. AZT is converted 
in cells to the triphosphate form and is incorporated 
into the growing viral DNA. Because the drug lacks a 
3'-hydroxyl group, it blocks further DNA synthesis and 
replication of the virus. AZT inhibits viral replication 
preferentially because reverse transcriptase has a higher 
affinity for the drug than for thymidine triphosphate; 
human cellular DNA polymerases do not show this 
preference and therefore still function in the presence of 
the drug. 

B. If ddCTP is added at about 10% of the concentration 
of the available dCTP, there is a 1 in 10 chance of its 
being incorporated whenever a G is encountered 
on the template strand. Thus a population of DNA 
fragments will be synthesized, and from their lengths 
one can deduce where the G nucleotides are located 
on the template strand. This strategy forms the basis of 
methods used to determine the sequence of nucleotides 
in a stretch of DNA (discussed in Chapter 10). 

C. Dideoxycytidine monophosphate (ddCMP) lacks the 
5’-triphosphate group as well as the 3’-hydroxyl group 
of the sugar ring. It therefore cannot provide the energy 


that drives the polymerization reaction of nucleotides 
into DNA and therefore will not be incorporated into the 
replicating DNA. Addition of this compound should thus 
not affect DNA replication. 


ANSWER 6-10 See Figure A6-10. 
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Figure A6-10 


ANSWER 6-11 The two strands of the bacterial 
chromosome contain 6 x 10° nucleotides in total. During the 
polymerization of nucleoside triphosphates into DNA, 

two phosphoanhydride bonds are broken for each 
nucleotide added: the nucleoside triphosphate is hydrolyzed 
to produce the nucleoside monophosphate added to the 
growing DNA strand, and the released pyrophosphate is 
hydrolyzed to phosphate. Therefore, 1.2 x 107 high-energy 
bonds are hydrolyzed during each round of bacterial DNA 
replication. This requires 4 x 105 (= 1.2 x 107/30) glucose 
molecules, which weigh 1.2 x 10716 g [= (4 x 10° molecules) 
x (180 g/mole)/(6 x 1023 molecules/mole)], which is 0.01% 
of the total weight of the cell. 


ANSWER 6-12 The statement is correct. If the DNA 

in somatic cells is not sufficiently stable (that is, if it 
accumulates mutations too rapidly), the organism dies (of 
cancer, for example), and because this may often happen 
before the organism can reproduce, the species will die out. 
If the DNA in reproductive cells is not sufficiently stable, 
many mutations will accumulate and be passed on to future 
generations, so that the species will not be maintained. 


ANSWER 6-13 As shown in Figure A6—13, thymine 

and uracil lack amino groups and therefore cannot be 
deaminated. Deamination of adenine and guanine produces 
purine rings that are not found in conventional nucleic 
acids. In contrast, deamination of cytosine produces uracil. 
Therefore, if uracil were a naturally occurring base in DNA 
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(as it is in RNA), repair enzymes could not distinguish 
whether a uracil is the appropriate base or whether it arose 
through spontaneous deamination of cytosine. This dilemma 
is not encountered, however, because thymine, rather than 
uracil, is used in DNA. Therefore, if a uracil base is found in 
DNA, it can be automatically recognized as a damaged base 
and then excised and replaced by cytosine. 


ANSWER 6-14 

A. DNA polymerase requires a 3'-OH to synthesize DNA; 
without telomeres and telomerase, the ends of linear 
chromosomes would shrink during each round of DNA 
replication. For bacterial chromosomes, which have no 
ends, the problem does not arise; there will always be 
a 3'-OH group available to prime the DNA polymerase 
that replaces the RNA primer with DNA (Figure A6é-14). 
Telomeres and telomerase prevent the shrinking of 
chromosomes because they extend the 3’ end of the 
template DNA strand (see Figure 6-23). This extension 
of the lagging-strand template provides the “space” to 
begin the final Okazaki fragments. 

B. As shown in Figure A6é—14A, telomeres and telomerase 
are still needed even if the last fragment of the lagging 
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strand were initiated by primase at the very 3’ end of 
chromosomal DNA, inasmuch as the RNA primer must 
be removed. 


ANSWER 6-15 

A. If the single origin of replication were located exactly in 
the center of the chromosome, it would take more than 
8 days to replicate the DNA 
[= 75 x 10° nucleotides/(100 nucleotides/sec)]. The rate 
of replication would therefore severely limit the rate of 
cell division. If the origin were located at one end, the 
time required to replicate the chromosome would be 
approximately double this. 

B. Achromosome end that is not “capped” with a telomere 
would lose nucleotides during each round of DNA 
replication and would gradually shrink. Eventually, 
essential genes would be lost, and the chromosome’s 
ends might be recognized by the DNA damage-response 
mechanisms, which would stop cell division or induce cell 
death. 

C. Without centromeres, which attach mitotic chromosomes 
to the mitotic spindle, the two new chromosomes that 
result from chromosome duplication would not be 
partitioned accurately between the two daughter cells. 
Therefore, many daughter cells would die, because they 
would not receive a full set of chromosomes. 


Chapter 7 


ANSWER 7-1 Perhaps the best answer was given by 
Francis Crick himself, who coined the term in the mid-1950s: 
“| called this idea the central dogma for two reasons, | 
suspect. | had already used the obvious word hypothesis 

in the sequence hypothesis, which proposes that genetic 
information is encoded in the sequence of the DNA 

bases, and in addition | wanted to suggest that this new 
assumption was more central and more powerful... As it 
turned out, the use of the word dogma caused more trouble 
than it was worth. Many years later Jacques Monod pointed 
out to me that | did not appear to understand the correct 
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use of the word dogma, which is a belief that cannot be 
doubted. | did appreciate this in a vague sort of way but 
since | thought that all religious beliefs were without serious 
foundation, | used the word in the way | myself thought 
about it, not as the world does, and simply applied it to a 
grand hypothesis that, however plausible, had little direct 
experimental support at the time.” (Francis Crick, What Mad 
Pursuit: A Personal View of Scientific Discovery. Basic Books, 
1988.) 


ANSWER 7-2 Actually, the RNA polymerases are not 
moving at all in the micrograph, because they have been 
fixed and coated with metal to prepare the sample for 
viewing in the electron microscope. However, before they 
were fixed, they were moving from left to right, as indicated 
by the gradual lengthening of the RNA transcripts. The RNA 
transcripts are not fully extended because they begin to 
fold up and interact with proteins as they are synthesized; 
this is why they are shorter than the corresponding DNA 
segments. 


ANSWER 7-3 At first glance, the catalytic activities of 

an RNA polymerase used for transcription could replace 
the primase that operates during DNA replication. Upon 
further reflection, however, there would be some serious 
problems. (1) The RNA polymerase used to make primers 
would need to initiate every few hundred bases, which is 
much more often than promoters are spaced on the DNA. 
Initiation would therefore need to occur in a promoter- 
independent fashion or many more promoters would 

have to be present in the DNA, both of which would be 
problematic for the synthesis of mRNA. In addition, RNA 
polymerase normally begins transcription on double- 
stranded DNA, whereas the DNA replication primers are 
synthesized using single-stranded DNA. (2) Similarly, the 
RNA primers used in DNA replication are much shorter than 
mRNAs. The RNA polymerase would therefore need to 
terminate much more frequently than during transcription. 
Termination would need to occur spontaneously (i.e., 
without requiring a terminator sequence in the DNA) or else 
many more terminators would need to be present. Again, 
both of these scenarios would be problematic for mRNA 
production. Although it might be possible to overcome this 
problem if special control proteins became attached to RNA 
polymerase during replication, the problem has been solved 
by the evolution of separate enzymes with specialized 
properties. Some small DNA viruses, however, do utilize 
the host RNA polymerase to make RNA primers for their 
replication. 


ANSWER 7-4 This experiment demonstrates that, once 
an amino acid has been coupled to a tRNA, the ribosome 
will trust the tRNA and “blindly” incorporate that amino 
acid into the position according to the match between 
the codon and anticodon. We can therefore conclude that 
a significant part of the correct reading of the genetic 
code—that is, the matching of a codon in an mRNA with 
the correct amino acid—is performed by the synthetase 
enzymes that correctly match tRNAs and amino acids. 


ANSWER 7-5 The mRNA will have a 5’-to-3’ polarity, 
opposite to that of the DNA strand that serves as 

the template. Thus the mRNA sequence will read 
5'-GAAAAAAGCCGUUAA-3’. The N-terminal amino acid 
coded for by GAA is glutamic acid. UAA specifies a stop 


codon, so the C-terminal amino acid is coded for by CGU 
and is an arginine. Note that the usual convention in 
describing the sequence of a gene is to give the sequence 
of the DNA strand that is not used as a template for RNA 
synthesis; this sequence is the same as that of the RNA 
transcript, with T written in place of U. 


ANSWER 7-6 The first statement is probably correct: 
RNA is thought to have been the first self-replicating 
catalyst and, in modern cells, is no longer self-replicating. 
We can debate, however, whether this represents a “loss.” 
RNA now serves many roles in the cell: as messengers, 

as adaptors for protein synthesis, as primers for DNA 
replication, as regulators of gene expression, and as 
catalysts for some of the most important reactions, 
including RNA splicing and protein synthesis. 


ANSWER 7-7 

A. False. Ribosomes can make any protein that is specified 
by the particular mRNA that they are translating. After 
translation, ribosomes are released from the mRNA and 
can then start translating a different mRNA. It is true, 
however, that a ribosome can only make one type of 
protein at a time. 

B. False. mRNAs are translated as linear polymers; there 
is no requirement that they have any particular folded 
structure. In fact, such structures that are formed by 
mRNA can inhibit its translation, because the ribosome 
has to unfold the mRNA in order to read the message it 
contains. 

C. False. Ribosomal subunits can exchange partners after 
each round of translation. After a ribosome is released 
from an mRNA, its two subunits dissociate and enter a 
pool of free small and large subunits from which new 
ribosomes assemble around a new mRNA. 

D. False. Ribosomes are not individually enclosed in a 
membrane. 

E. False. The position of the promoter determines the 
direction in which transcription proceeds and therefore 
which of the two DNA strands is used as the template. 
Transcription of the other strand would produce an 
mRNA with a completely different (and in most cases 
meaningless) sequence. 

F. False. RNA contains uracil but not thymine. 

G. False. The level of a protein depends on its rate of 
synthesis and degradation but not on its catalytic 
activity. 


ANSWER 7-8 Because the deletion in the Lacheinmal 
mRNA is internal, it likely arose from incorrect splicing 
of the pre-mRNA. The simplest interpretation is that the 
Lacheinmal gene contains a 173-nucleotide-long exon 
(labeled “E2” in Figure A7—8), and that this exon is lost 
(“skipped”) during the processing of the mutant precursor 
mRNA (pre-mRNA). This could occur, for example, if the 
mutation changed the 3’ splice site in the preceding intron 
(“11”) so that it was no longer recognized by the splicing 
machinery (a change in the CAG sequence shown in Figure 
7-20 could do this). The snRNP would search for the next 
available 3’ splice site, which is found at the 3’ end of the 
next intron (“12”), and the splicing reaction would therefore 
remove E2 together with 11 and 12, resulting in a shortened 
mRNA. The mRNA is then translated into a defective 
protein, resulting in the Lacheinmal deficiency. 

Because 173 nucleotides do not amount to an integral 
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number of codons, the lack of this exon in the mRNA will 
shift the reading frame at the splice junction. Therefore, the 
Lacheinmal protein would be made correctly only through 
exon E1. As the ribosome begins translating sequences 

in exon E3, it will be in the wrong reading frame and will 
therefore will produce a protein sequence that is unrelated 
to the Lacheinmal sequence normally encoded by exon E3. 
Most likely, the ribosome will soon encounter a stop codon, 
which would be expected to occur on average about once in 
every 21 codons (there are 3 stop codons in the 64 codons 
of the genetic code). 


ANSWER 7-9 Sequence 1 and sequence 4 both code 
for the peptide Arg-Gly-Asp. Because the genetic code is 
redundant, different nucleotide sequences can encode the 
same amino acid sequence. 


ANSWER 7-10 

A. Incorrect. The bonds are not covalent, and their 
formation does not require an input of energy. 

B. Correct. The aminoacyl-tRNA enters the ribosome at the 
A site and forms hydrogen bonds with the codon in the 
mRNA. 

C. Correct. As the ribosome moves along the mRNA, 
the tRNAs that have donated their amino acid to 
the growing polypeptide chain are ejected from the 
ribosome and the mRNA. The ejection takes place two 
cycles after the tRNA first enters the ribosome (see 
Figure 7-37). 


ANSWER 7-11 Replication. Dictionary definition: the 
creation of an exact copy; molecular biology definition: the 
act of copying a DNA sequence. Transcription. Dictionary 
definition: the act of writing out a copy, especially from one 
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physical form to another; molecular biology definition: the 
act of copying the information stored in DNA into RNA. 
Translation. Dictionary definition: the act of putting words 
into a different language; molecular biology definition: 

the act of polymerizing amino acids into a defined linear 
sequence using the information provided by the linear 
sequence of nucleotides in mRNA. (Note that “translation” 
is also used in a quite different sense, both in ordinary 
language and in scientific contexts, to mean a movement 
from one place to another.) 


ANSWER 7-12 With four different nucleotides to choose 
from, a code of two nucleotides could specify 16 different 
amino acids (= 42), anda triplet code in which the position 
of the nucleotides is not important could specify 20 different 
amino acids (= 4 possibilities of 3 of the same bases + 

12 possibilities of 2 bases the same and one different + 

4 possibilities of 3 different bases). In both cases, these 
maximal amino acid numbers would need to be reduced by 
at least 1 because of the need to specify translation stop 
codons. It is relatively easy to envision how a doublet code 
could be translated by a mechanism similar to that used in 
our world by providing tRNAs with only two relevant bases 
in the anticodon loop. It is more difficult to envision how 
the nucleotide composition of a stretch of three nucleotides 
could be translated without regard to their order, because 
base-pairing can then no longer be used: AUG, for example, 
will not base-pair with the same anticodon as UGA. 


ANSWER 7-13 It is likely that in early cells the matching 
between codons and amino acids was less accurate than 

it is in present-day cells. The feature of the genetic code 
described in the question may have allowed early cells to 
tolerate this inaccuracy by allowing a blurred relationship 
between sets of roughly similar codons and roughly similar 
amino acids. One can easily imagine how the matching 
between codons and amino acids could have become more 
accurate, step by step, as the translation machinery evolved 
into that found in modern cells. 


ANSWER 7-14 The codon for Trp is 5'-UGG-3’. Thus 

a normal tRNA"? contains the sequence 5'-CCA-3’ as 

its anticodon (see Figure 7-33). If this tRNA contains a 
mutation so that its anticodon is changed to UCA, it will 
recognize a UGA codon and lead to the incorporation of 

a tryptophan instead of causing translation to stop. Many 
other protein-encoding sequences, however, contain UGA 
codons as their natural stop sites, and these stops would 
also be affected by the mutant tRNA. Depending on the 
competition between the altered tRNA and the normal 
translation release factors (Figure 7-41), some of these 
proteins would be made with additional amino acids at their 
C-terminal end. The additional lengths would depend on the 
number of codons before the ribosomes encounter a non- 
UGA stop codon in the mRNA in the reading frame in which 
the protein is translated. 


ANSWER 7-15 One effective way of driving a reaction to 
completion is to remove one of the products, so that the 
reverse reaction cannot occur. ATP contains two high-energy 
bonds that link the three phosphate groups. In the reaction 
shown, PP; is released, consisting of two phosphate groups 
linked by one of these high-energy bonds. Thus PP; can be 
hydrolyzed with a considerable gain of free energy, and 
thereby can be efficiently removed. This happens rapidly in 
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cells, and reactions that produce and further hydrolyze PP; 
are therefore virtually irreversible (see Figure 3—41). 


ANSWER 7-16 

A. A titin molecule is made of 25,000 (3,000,000/120) amino 
acids. It therefore takes about 3.5 hours 
[(25,000/2 ) x (1/60) x (1/60)] to synthesize a single 
molecule of titin in muscle cells. 

B. Because of its large size, the probability of making a titin 
molecule without any mistakes is only 0.08 
[= (1 — 10~4)25.000); that is, only 8 in 100 titin molecules 
synthesized are free of mistakes. In contrast, over 97% 
of newly synthesized proteins of average size are made 
correctly. 

C. The error rate limits the sizes of proteins that can be 
synthesized accurately. If a eukaryotic ribosomal protein 
were synthesized as a single molecule, a large portion 
(87%) of this hypothetical giant ribosomal protein 
would be expected to contain at least one mistake. 

It is therefore more advantageous to make ribosomal 
proteins individually, because in this way only a small 
proportion of each type of protein will be defective, and 
these few bad molecules can be individually eliminated 
by proteolysis to ensure that there are no defects in the 
ribosome as a whole. 

D. To calculate the time it takes to transcribe a titin mRNA, 
you would need to know the size of its gene, which is 
likely to contain many introns. Transcription of the exons 
alone (25,000 x 3 = 75,000 nucleotides) requires about 
42 minutes [(75,000/30) x (1/60)]. Because introns can 
be quite large, the time required to transcribe the entire 
gene is likely to be considerably longer. 


ANSWER 7-17 Mutations of the type described in (B) and 
(D) are often the most harmful. In both cases, the reading 
frame would be changed, and because this frameshift 
occurs near the beginning or in the middle of the coding 
sequence, much of the protein will contain a nonsensical 
and/or truncated sequence of amino acids. In contrast, 

a reading-frame shift that occurs toward the end of the 
coding sequence, as described in (A), will result in a largely 
correct protein that may be functional. Deletion of three 
consecutive nucleotides, as described in (C), leads to the 
deletion of an amino acid but does not alter the reading 
frame. The deleted amino acid may or may not be important 
for the folding or activity of the protein; in many cases, 
such mutations are silent—that is, they have no or only 
minor consequences for the organism. Substitution of 

one nucleotide for another, as in (E), is often completely 
harmless. In some cases, it will not change the amino acid 
sequence of the protein; in other cases, it will change a 
single amino acid; at worst, it may create a new stop codon, 
giving rise to a truncated protein. 


3’ end of template DNA strand 


Figure A7-18 


ANSWER 7-18 The RNA transcripts that are growing from 
the DNA template like bristles on a bottlebrush tend to be 
shorter at the left-hand side of each gene and longer on 
the right-hand side. Because RNA polymerase synthesizes 
in the 5'-to-3' direction it must move along the DNA 
template strand in the 3'-to-5' direction (see Figure 7-7). 
The longest RNAs, therefore, should appear at the 5’ end of 
the template strand—when transcription is nearly complete. 
Hence the 3’ end of the template strand is toward the left of 
the image (Figure A7—18). The RNA transcripts, meanwhile, 
are synthesized in the 5’-to-3’ direction. Thus, the 5’ end of 
each transcript can be found at the end of each bristle (see 
Figure A7—18); the 3’ end of each transcript can be found 
within the RNA polymerase molecules that dot the spine of 
the DNA template molecule. 


Chapter 8 


ANSWER 8-1 

A. Transcription of the tryptophan operon would no longer 
be regulated by the absence or presence of tryptophan; 
the enzymes would be permanently turned on in 
scenarios (1) and (2) and permanently shut off in scenario 
(3). 

B. In scenarios (1) and (2), the normal tryptophan 
repressor molecules would restore the regulation of 
the tryptophan biosynthesis enzymes. In contrast, 
expression of the normal protein would have no effect 
in scenario (3), because the tryptophan operator would 
remain occupied by the mutant protein, even in the 
presence of tryptophan. 


ANSWER 8-2 Contacts can form between the protein 
and the edges of the base pairs that are exposed in the 
major groove of the DNA (Figure A8-2). These sequence- 
specific contacts can include hydrogen bonds with the 
highlighted oxygen, nitrogen, and hydrogen atoms, as 

well as hydrophobic interactions with the methyl group on 
thymine (yellow). Note that the arrangement of hydrogen- 
bond donors (blue) and hydrogen-bond acceptors (red) of 
a T-A pair is different from that of a C-G pair. Similarly, the 
arrangements of hydrogen-bond donors and hydrogen- 
bond acceptors of A-T and G-C pairs are different from one 
another and from the two pairs shown in the figure. These 
differences allow recognition of specific DNA sequences 
via the major groove. In addition to the contacts shown in 
the figure, electrostatic attractions between the positively 
charged amino acid side chains of the protein and the 
negatively charged phosphate groups in the DNA backbone 
usually stabilize DNA-protein interactions. Finally, some 
DNA-binding proteins also contact bases from the minor 
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groove (see Figure 8-4). The minor groove, however, 
contains fewer features that distinguish one base from 
another than does the major groove. 


ANSWER 8-3 Bending proteins can help to bring distant 
DNA regions together that normally would contact each 
other only inefficiently (Figure A8-3). Such proteins are 
found in both prokaryotes and eukaryotes and are involved 
in many examples of transcriptional regulation. 
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ANSWER 8-4 

A. UV light throws the switch from the prophage to the 
lytic state: when cl protein is destroyed, Cro is made and 
turns off the further production of cl. The virus produces 
coat proteins, and new virus particles are made. 

B. When the UV light is switched off, the virus remains 
in the lytic state. Thus, cl and Cro form a transcription 
switch that “memorizes” its previous setting. 

C. This switch makes sense in the viral life cycle: UV light 
tends to damage the bacterial DNA (see Figure 6—25), 
thereby rendering the bacterium an unreliable host for 
the virus. A prophage will therefore switch to the lytic 
state and leave the “sinking ship” in search of new host 
cells to infect. 


ANSWER 8-5 

A. True. Prokaryotic mRNAs are often transcripts of entire 
operons. Ribosomes can initiate translation at the 
internal AUG start sites of these “polycistronic” mRNAs 
(see Figures 7—40 and 8-6). 

B. True. The major groove of double-stranded DNA is 
sufficiently wide to allow a protein surface, such as 
one face of an a helix, access to the base pairs. The 
sequence of H-bond donors and acceptors in the major 
groove can then be “read” by the protein to determine 
the sequence of the DNA (see Figure A8-2). 

C. True. It is advantageous to exert control at the earliest 
possible point in a pathway. This conserves metabolic 
energy because unnecessary products are not made. 


ANSWER 8-6 From our knowledge of enhancers, one 
would expect their function to be relatively independent of 
their distance from the promoter—possibly weakening as 
this distance increases. The surprising feature of the data 
(which have been adapted from an actual experiment) is 
the periodicity: the enhancer is maximally active at certain 
distances from the promoter (50, 60, or 70 nucleotides), 

but almost inactive at intermediate distances (55 or 65 
nucleotides). The periodicity of 10 suggests that the mystery 
can be explained by considering the structure of double- 
helical DNA, which has 10 base pairs per turn. Thus, placing 
an enhancer on the side of the DNA opposite to that of the 
promoter (Figure A8-6) would make it more difficult for the 
activator that binds to it to interact with the proteins bound 
at the promoter. At longer distances, there is more DNA to 
absorb the twist, and the effect is diminished. 
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ANSWER 8-7 The affinity of the dimeric 4 repressor for 
its binding site depends on the interactions made by each 
of the two DNA-binding domains. A single DNA-binding 
domain can make only half the contacts and therefore 
provide just half the binding energy compared with the 
dimer. Although cleavage of the repressor does not change 
the concentration of binding domains, the affinity that each 
repressor monomer has for DNA is sufficiently weak that the 
repressors do not remain bound. As a result, the genes for 
lytic growth are turned on. 


ANSWER 8-8 The function of these Arg genes is to 
encode the enzymes that synthesize arginine. When arginine 
is abundant, expression of these genes should be turned 
off. If ArgR acts as a gene repressor (which it does in 

reality), then binding of arginine should increase its affinity 
for its regulatory sites, allowing it to bind and shut off gene 
expression. If ArgR acted as a gene activator instead, then 
the binding of arginine would be predicted to reduce its 
affinity for its regulatory DNA, preventing its binding and 
thereby shutting off expression of the Arg genes. 


ANSWER 8-9 The results of this experiment favor DNA 
looping, which should not be affected by the protein bridge 
(so long as it allowed the DNA to bend, which it does). The 
scanning or entry-site model, however, is predicted to be 
affected by the nature of the linkage between the enhancer 
and the promoter. If the proteins enter at the enhancer 

and scan to the promoter, they would have to traverse the 
protein linkage. If such proteins are geared to scan on DNA, 
they would likely have difficulty scanning across such a 
barrier. 


ANSWER 8-10 The most definitive result is one showing 
that a single differentiated cell taken from a specialized 
tissue can re-create a whole organism. This proves that 

the cell must contain all the information required to produce 
a whole organism, including all of its specialized cell 

types. Experiments of this type are summarized in 

Figure 8-2. 


ANSWER 8-11 In principle, you could create 16 different 
cell types with 4 different transcription regulators (all the 

8 cell types shown in Figure 8-17, plus another 8 created 
by adding an additional transcription regulator). MyoD by 
itself is sufficient to induce muscle-specific gene expression 
only in certain cell types, such as some kinds of fibroblasts. 
The action of MyoD is therefore consistent with the model 
shown in Figure 8-17: if muscle cells were specified, for 
example, by the combination of transcription regulators 

1, 3, and MyoD, then the addition of MyoD would convert 
only two of the cell types of Figure 8-17 (cells F and H) to 
muscle. 


ANSWER 8-12 The induction of a transcriptional activator 
protein that stimulates its own synthesis creates a positive 
feedback loop that can produce cell memory. The continued 
self-stimulated synthesis of activator A can in principle last 
for many cell generations, serving as a constant reminder 
of an event that took place in the past. By contrast, the 
induction of a transcriptional repressor that inhibits its own 
synthesis creates a negative feedback loop that ensures 
that the response to the transient stimulus will be similarly 
transient. Because repressor R shuts off its own synthesis, 
the cell will quickly return to the state that existed before 
the signal. 


ANSWER 8-13 Many transcription regulators are 
continually made in the cell; that is, their expression is 
constitutive and the activity of the protein is controlled 

by signals from inside or outside the cell (e.g., the 
availability of nutrients, as for the tryptophan repressor, 

or by hormones, as for the glucocorticoid receptor). 

In this way, the transcriptional program is adjusted to 

the physiological needs of the cell. Moreover, a given 
transcription regulator usually controls the expression of 
many different genes. Transcription regulators are often 
used in various combinations and can affect each other's 
activity, thereby further increasing the possibilities for 
regulation with a limited set of proteins. Nevertheless, most 
cells devote a large fraction of their genomes to the control 
of transcription: about 10% of protein-coding genes in 
eukaryotic cells code for transcription regulators. 


Chapter 9 


ANSWER 9-1 When it comes to genetic information, a 
balance must be struck between stability and change. If the 
mutation rate were too high, a species would eventually die 
out because all its individuals would accumulate mutations 
in genes essential for survival. And for a species to be 
successful—in evolutionary terms—individual members must 
have a good genetic memory; that is, there must be high 
fidelity in DNA replication. At the same time, occasional 
changes are needed if the species is to adapt to changing 
conditions. If the change leads to an improvement, it 

will persist by selection; if it is neutral, it may or may 

not accumulate; but if the change proves disastrous, the 
individual organism that was the unfortunate subject of 
nature's experiment will die, but the species will survive. 


ANSWER 9-2 In single-celled organisms, the genome is 
the germ line and any modification is passed on to the next 
generation. By contrast, in multicellular organisms, most 

of the cells are somatic cells and make no contribution to 
the next generation; thus, modification of those cells by 
horizontal gene transfer would have no consequence for the 
next generation. The germ-line cells are usually sequestered 
in the interior of multicellular organisms, minimizing their 
contact with foreign cells, viruses, and DNA, thus insulating 
the species from the effects of horizontal gene transfer. 
Nevertheless, horizontal gene transfer is possible for 
multicellular organisms. For example, the genomes of some 
insect species contain DNA that was horizontally transferred 
from bacteria that infect them. 


ANSWER 9-3 Itis extremely unlikely that any gene 
came into existence perfectly optimized for its function. 
Ribosomal RNA sequences have been highly conserved 
because this molecule plays such an important role in 
protein synthesis in the cell. Nonetheless, the environment 
an organism finds itself in is changeable, so no gene can 
be optimal indefinitely. Thus we find there are indeed 
significant differences in ribosomal RNAs among species. 


ANSWER 9-4 Each time another copy of a transposon 

is inserted into a chromosome, the change can be either 
neutral, beneficial, or detrimental for the organism. Because 
individuals that accumulate detrimental insertions would 

be selected against, the proliferation of transposons is 
controlled by natural selection. If a transposon arose that 


proliferated uncontrollably, it is unlikely that a viable host 
organism could be maintained. For this reason, most 
transposons move only rarely. Many transposons, for 
example, synthesize only infrequent bursts of very small 
amounts of the transposase that is required for their 
movement. 


ANSWER 9-5 Viruses cannot exist as free-living 
organisms: they have no internal metabolism, and cannot 
reproduce themselves. They thus have none of the 
attributes that one normally associates with life. Indeed, 
they can even be crystallized. Only inside cells can they 
redirect normal cellular biosynthetic activities to the task of 
making more copies of themselves. Thus, the only aspect of 
“living” that viruses display is their capacity to direct their 
own reproduction once inside a cell. 


ANSWER 9-6 Although they can harm individuals, 
mobile genetic elements do provide opportunities for 
homologous recombination events, thereby causing 
genomic rearrangements. They could insert into genes, 
possibly obliterating splicing signals and thereby changing 
the protein produced by the gene. They could also insert 
into the regulatory DNA sequences of a gene, where 
insertion between an enhancer and a transcription start 
site could block the function of the enhancer and therefore 
reduce the level of expression of a gene. In addition, the 
mobile genetic element could itself contain an enhancer and 
thereby change the time and place in the organism where 
the gene is expressed. 


ANSWER 9-7 With their ability to facilitate genetic 
recombination, mobile genetic elements have almost 
certainly played an important part in the evolution of 
modern-day organisms. They can facilitate gene duplication 
and the creation of new genes via exon shuffling, and they 
can change the way in which existing genes are expressed. 
Although the transposition of a mobile genetic element 
can be harmful for an individual organism—if, for example, 
it disrupts the activity of a critical gene—these agents of 
genetic change may well be beneficial to the species as a 
whole. 


ANSWER 9-8 About 7.6% of each gene is converted 

to mRNA [(5.4 exons/gene x 266 nucleotide pairs/exon)/ 
(19,000 nucleotide pairs/gene) = 7.6%]. Protein-coding 
genes occupy about 28% of Chromosome 22 [(700 genes x 
19,000 nucleotide pairs/gene)/(48 x 10° nucleotide pairs) = 
27.7%]. However, over 90% of this DNA is made of introns. 


(A) POSITIONS OF HUMAN B-GLOBIN EXONS 
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ANSWER 9-9 This statement is probably true. For 
example, nearly half our DNA is composed of defunct 
mobile genetic elements. And only about 10% of the human 
genome appears to be under positive selection. However, 

it is possible that future research will uncover functions for 
some portion of our DNA that now seems unimportant. 


ANSWER 9-10 The HoxD cluster is packed with complex 
and extensive regulatory DNA sequences that direct each 
of its genes to be expressed at the correct time and place 
during development. Insertions of mobile genetic elements 
into the HoxD cluster were probably selected against 
because they would disrupt proper regulation of these 
genes. 


ANSWER 9-11 

A. The exons in the human -globin gene correspond to 
the positions of sequence similarity (in this case identity) 
with the cDNA, which is a direct copy of the mRNA and 
thus contains no introns. The introns correspond to the 
regions between the exons. The positions of the introns 
and exons in the human B-globin gene are indicated 
in Figure A9-11A. Also shown (in open bars) are 
sequences present in the mature B-globin mRNA (and in 
the gene) that are not translated into protein. 

B. From the positions of the exons, as defined in Figure 
A9-11A, it is clear that the first two exons of the 
human -globin gene have counterparts, with similar 
sequence, in the mouse -globin gene (Figure A9-11B). 
However, only the first half of the third exon of the 
human B-globin gene is similar to the mouse B-globin 
gene. The similar portion of the third exon contains 
sequences that encode protein, whereas the portion that 
is different represents the 3’ untranslated region of the 
gene. Because this portion of the gene does not encode 
protein (nor does it contain extensive regulatory DNA 
sequences), its sequence is probably not constrained 
and the mouse and human sequences have drifted apart. 

C. The human and mouse -globin genes are also similar at 
their 5’ ends, as indicated by the cluster of points along 
the same diagonal as the first exon (Figure A9-11B). 
These sequences correspond to the regulatory DNA 
sequences upstream of the start sites for transcription. 
Functional sequences, which are under selective 
pressure, diverge much more slowly than sequences 
without function. 

D. The diagon plot shows that the first intron, although it is 
not conserved in sequence, it is nearly the same length 
in the human and mouse genes; however, the length of 
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the second intron is noticeably different (Figure A9-11B). 
If the introns were the same length, the line segments 
that represent sequence similarity would fall on the same 
diagonal. The easiest way to test for the colinearity of 
the line segments is to tilt the page and sight along the 
diagonal. It is impossible to tell from this comparison if 
the change in length is due to a shortening of the mouse 
intron or to a lengthening of the human intron, or some 
combination of those possibilities. 


ANSWER 9-12 Computer algorithms that search for exons 
are complex, as you might imagine. To identify unknown 
genes, these programs combine statistical information 
derived from known genes, such as: 

1. An exon that encodes protein will have an open reading 
frame. If the amino acid sequence specified by this 
open reading frame matches a protein sequence in any 
database, there is a high likelihood that it is an authentic 
exon. 

2. The reading frames of adjacent exons in the same gene 
will match up when the intron sequences are omitted. 

3. Internal exons (excluding the first and the last) will have 
splicing signals at each end; most of the time (~98%) 
these will be AG at the 5’ ends of the exons and GT at 
the 3’ ends. 

4. The multiple codons for most individual amino acids are 
not used with equal frequency. This so-called coding 
bias, which varies from one species to the next, can be 
factored in to aid in the recognition of true exons. 

5. Exons and introns have characteristic length 
distributions. The median length of exons in human 
genes is about 120 nucleotide pairs. Introns tend to be 
much larger: a median length of about 2 kb in genomic 
regions of 30-40% GC content, and a median length of 
about 500 nucleotide pairs in regions above 50% GC. 

6. The initiation codon for protein synthesis (nearly always 
an ATG) has a statistical association with adjacent 
nucleotides that seem to enhance its recognition by 
translation factors. 

7. The terminal exon will have a signal (most commonly 
AATAAA) for cleavage and polyadenylation close to its 
3’ end. 


The statistical nature of these features, coupled with the 
low frequency of coding information in the genome (1.5% 
for humans) and the high frequency of alternative splicing 
(estimated to occur in 95% of human genes), makes it 
difficult for an algorithm to correctly identify all exons. As 
shown in Figure 9-36, these bioinformatic approaches are 
usually coupled with direct experimental data, such as those 
obtained from full-genome RNA sequencing (RNA-Seq). 


ANSWER 9-13 It is often not a simple matter to determine 
the function of a gene, nor is there a universal recipe for 
doing so. Nevertheless, there are a variety of standard 
questions whose answers help to narrow down the 
possibilities. Below we list some of these questions. 

In what tissues is the gene expressed? If the gene is 
expressed in all tissues, it is likely to have a general function. 
If it is expressed in one or a few tissues, its function is 
likely to be more specialized, perhaps related to the 
specific functions of the tissues. If the gene is expressed 
in the embryo but not the adult, it probably functions in 
development. 

In what compartment of the cell is the protein found? 
Knowing the subcellular localization of the protein—nucleus, 


plasma membrane, mitochondria, etc.—can also help to rule 
out or support potential functions. For example, a protein 
that is localized to the plasma membrane is likely to be a 
transporter, a receptor or other component of a signaling 
pathway, a cell adhesion molecule, etc. 

What are the effects of mutations in the gene? Mutations 
that eliminate or modify the function of the gene product 
can provide important clues to function. For example, 
if the gene product is critical at a certain time during 
development, mutant embryos will often die at that stage or 
develop obvious abnormalities. 

With what other proteins does the encoded protein 
interact? In carrying out their function, proteins often 
interact with other proteins involved in the same or closely 
related processes. If an interacting protein can be identified, 
and if its function is already known (through previous 
research or through the searching of databases), the range 
of possible functions can often be narrowed. 

Can mutations in other genes alter effects of mutation in 
the unknown gene? Searching for such mutations can be 
a very powerful approach to investigating gene function, 
especially in organisms such as bacteria and yeast, which 
have simple genetic systems. Although much more 
difficult to perform in the mouse, this type of approach 
can nonetheless be used. The rationale for this strategy 
is analogous to that of looking for interacting proteins: 
genes that interact genetically—so that the double- 
mutant phenotype is more selective than either of the 
individual mutants—are often involved in the same process 
or in closely related processes. Identification of such an 
interacting gene (and knowledge of its function) would 
provide an important clue to the function of the unknown 
gene. 

Addressing each of these questions requires specialized 
experimental expertise and a substantial time commitment 
from the investigator. It is no wonder that progress is made 
much more rapidly when a clue to a gene's function can be 
found simply by identifying a similar gene of known function 
in the database. As more and more genes are studied, this 
strategy will become increasingly successful. 


ANSWER 9-14 In along, random sequence of DNA, each 
of the 64 different codons will occur with equal frequency. 

Because 3 of the 64 are stop codons, they will be expected 
to occur on average every 21 codons (64/3 = 21.3). 


ANSWER 9-15 All of these mechanisms contribute to 

the evolution of new protein-coding genes. A, C, D, and 

E were discussed in the text. Recent studies indicate that 
certain short protein-coding genes arose from previously 
untranslated regions of genomes, so choice B is also correct. 


ANSWER 9-16 

A. Because synonymous changes do not alter the amino 
acid sequence of the protein, they usually do not affect 
the overall fitness of the organism and are therefore not 
selected against. By contrast, nonsynonymous changes, 
which substitute a new amino acid in place of the original 
one, can alter the function of the encoded protein and 
change the fitness of the organism. Since most amino 
acid substitutions probably harm the protein, they tend 
to be selected against. 

B. Virtually all amino acid substitutions in the histone 
H3 protein are deleterious and are therefore selected 
against. The extreme conservation of histone H3 argues 


that its function is very tightly constrained, probably 
because of extensive interactions with other proteins 
and with DNA. 

C. Histone H3 is clearly not in a “privileged” site in the 
genome because it undergoes synonymous nucleotide 
changes at about the same rate as other genes. 


ANSWER 9-17 

A. The data embodied in the phylogenetic tree (Figure 
Q9-17) refutes the hypothesis that plant hemoglobin 
genes were acquired by horizontal transfer from animals. 
Looking at the more familiar parts of the tree, we see 
that the hemoglobins of vertebrates (fish to human) have 
approximately the same phylogenetic relationships as 
do the species themselves. Plant hemoglobins also form 
a distinct group that displays accepted evolutionary 
relationships, with barley, a monocot, diverging before 
bean, alfalfa, and lotus, which are all dicots (and 
legumes). The basic hemoglobin gene, therefore, was 
in place long ago in evolution. The phylogenetic tree of 
Figure Q9-17 indicates that the hemoglobin genes in 
modern plant and animal species were inherited from a 
common ancestor. 

B. Had the plant hemoglobin genes arisen by horizontal 
transfer from a nematode, then the plant sequences 
would have clustered with the nematode sequences in 
the phylogenetic tree in Figure Q9-17. 


ANSWER 9-18 In each human lineage, new mutations will 
be introduced at a rate of 1071? alterations per nucleotide 
per cell generation, and the differences between two human 
lineages will accumulate at twice this rate. To accumulate 
10-3 differences per nucleotide will thus take 10-3/ 

(2 x 10710) cell generations, corresponding to (1/200) x 
10-3/(2 x 10-19) = 25,000 human generations, or 750,000 
years. In reality, we are not descended from one pair of 
genetically identical ancestral humans; rather, it is likely that 
we are descended from a relatively small founder population 
of humans who were already genetically diverse. More 
sophisticated analysis suggests that this founder population 
existed about 200,000 years ago. 


ANSWER 9-19 The virus that causes AIDS in humans, 
HIV, is a retrovirus, and thus synthesizes DNA from an RNA 
template using reverse transcriptase. This leads to frequent 
mutation of the viral genome. In fact, people who are HIV- 
positive often carry many different genetic variants of HIV 
that are distinct from the original virus that infected them. 
This posed a problem in treating the infection: drugs that 
block essential viral enzymes would work only temporarily, 
because new strains of the virus resistant to these drugs 
arose rapidly by mutation. Today’s strategy employs 
multiple drugs simultaneously, which greatly decreases the 
likelihood that a fully resistant mutant virus could arise. 
Like reverse transcriptases, RNA replicases (enzymes that 
synthesize RNA using RNA as a template) do not proofread. 
Thus, RNA viruses that replicate their RNA genomes directly 
(that is, without using DNA as an intermediate) also mutate 
frequently. In such a virus, this tends to produce changes 
in the coat proteins that cause the mutated virus to appear 
“new” to our immune systems; the virus is therefore not 
suppressed by immunity that has arisen to the previous 
version. This is part of the explanation for the new strains 
of the influenza (flu) virus and the common cold virus that 
regularly appear. 
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Chapter 10 


ANSWER 10-1 The presence of a mutation in a gene does 
not necessarily mean that the protein expressed from it is 
defective. For example, the mutation could change one 
codon into another that still specifies the same amino acid, 
and so does not change the amino acid sequence of the 
protein. Or, the mutation may cause a change from one 
amino acid to another in the protein, but in a position that 
is not important for the folding or function of the protein. 
In assessing the likelihood that such a mutation might cause 
a defective protein, information on the known -globin 
mutations that are found in humans is essential. You would 
therefore want to know the precise nucleotide change 

in your mutant gene, and whether this change has any 
known or predictable consequences for the function of the 
encoded protein. If your mate has two normal copies of the 
globin gene, 50% of your children would be carriers of your 
mutant gene. 


ANSWER 10-2 

A. Digestion with EcoRI produces two products: 
S'-AAGAATTGCGG + AATTCGGGCCTTAAGCGCCGCGTCGAGGCCTTAAA-3 
3'-TTCTTAACGCCTTAA  GCCCGGAATTCGCGGCGCAGCTCCGGAATTT-5’ 

B. Digestion with Haelll produces three products: 
5'-AAGAATTGCGGAATTCGGG = CCTTAAGCGCCGCGTCGAGG CCTTAAA-3' 
3'-TTCTTAACGCCTTAAGCCC GGAATTCGCGGCGCAGCTCC GGAATTT-5' 
C. The sequence lacks a HindIll cleavage site. 

D. Digestion with all three enzymes therefore produces: 
5'-AAGAATTGCGG AATTCGGG CCTTAAGCGCCGCGTCGAGG CCTTAAA-3' 
3'-TTCTTAACGCCTTAA GCCC GGAATTCGCGGCGCAGCTCC GGAATTT-5' 


ANSWER 10-3 Protein biochemistry is still very important 
because it provides the link between the amino acid 
sequence (which can be deduced from DNA sequences) 

and the functional properties of the protein. We are still not 
able to infallibly predict the folding of a polypeptide chain 
from its amino acid sequence, and in most cases information 
regarding the function of the protein, such as its catalytic 
activity, cannot be deduced from the gene sequence alone. 
Instead, such information must be obtained experimentally 
by analyzing the properties of proteins biochemically. 
Furthermore, the structural information that can be deduced 
from DNA sequences is necessarily incomplete. We cannot, 
for example, accurately predict covalent modifications of 
the protein, proteolytic processing, the presence of tightly 
bound small molecules, or the association of the protein 
with other subunits. Moreover, we cannot accurately predict 
the effects these modifications might have on the activity of 
the protein. 


ANSWER 10-4 

A. After an additional round of amplification there will be 2 
gray, 4 green, 4 red, and 22 yellow-outlined fragments; 
after a second additional round there will be 2 gray, 5 
green, 5 red, and 52 yellow-outlined fragments. Thus the 
DNA fragments outlined in yellow increase exponentially 
and will eventually overrun the other reaction products. 
Their length is determined by the DNA sequence that 
spans the distance between the two primers plus the 
length of the primers. 

B. The mass of one DNA molecule 500 nucleotide pairs 
long is 5.5 x 1071? g [= 2 x 500 x 330 (g/mole)/6 x 107% 
(molecules/mole)]. Ignoring the complexities of the first 
few steps of the amplification reaction (which produce 
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longer products that eventually make an insignificant 
contribution to the total DNA amplified), this amount of 
product approximately doubles for every amplification 
step. Therefore, 100 x 107? g = 2N x 5.5 x 1071? g, 
where N is the number of amplification steps of the 
reaction. Solving this equation for N = log(1.81 x 1011)/ 
log(2) gives N = 37.4. Thus, only about 40 cycles of 

PCR amplification are sufficient to amplify DNA from a 
single molecule to a quantity that can be readily handled 
and analyzed biochemically. This whole procedure is 
automated and takes only a few hours in the laboratory. 


ANSWER 10-5 If the ratio of dideoxyribonucleoside 
triphosphates to deoxyribonucleoside triphosphates 

is increased, DNA polymerization will be terminated 
more frequently and thus shorter DNA strands will be 
produced. Such conditions are favorable for determining 
nucleotide sequences that are close to the DNA 

primer used in the reaction. In contrast, decreasing 

the ratio of dideoxyribonucleoside triphosphates to 
deoxyribonucleoside triphosphates will produce longer 
DNA fragments, thus allowing one to determine nucleotide 
sequences more distant from the primer. 


ANSWER 10-6 Although several explanations are 
possible, the simplest is that the DNA probe has hybridized 
predominantly with its corresponding mRNA, which is 
typically present in many more copies per cell than is the 
gene. The different extents of hybridization probably reflect 
different levels of gene expression. Perhaps each of the 
different cell types that make up the tissue expresses the 
gene at a different level. 


ANSWER 10-7 Like the vast majority of mammalian genes, 
the attractase gene likely contains introns. Bacteria do not 
have the splicing machinery required to remove introns, 
and therefore the correct protein would not be expressed 
from the gene. For expression of most mammalian genes in 
bacterial cells, a cDNA version of the gene must be used. 


ANSWER 10-8 

A. False. Restriction sites are found at random throughout 
the genome, within as well as between genes. 

B. True. DNA bears a negative charge at each phosphate, 
giving DNA an overall negative charge. 

C. False. Clones isolated from cDNA libraries do not 
contain promoter sequences. These sequences are not 
transcribed and are therefore not part of the mRNAs 
that are used as the templates to make cDNAs. 

D. True. Each polymerization reaction produces double- 
stranded DNA that must, at each cycle, be denatured to 
allow new primers to hybridize so that the DNA strand 
can be copied again. 

E. False. Digestion of genomic DNA with restriction 
enzymes that recognize four-nucleotide sequences 
produces fragments that are on average 256 nucleotides 


long. However, the actual lengths of the fragments 
produced will vary considerably on both sides of the 
average. 

F. True. Reverse transcriptase is first needed to copy the 
mRNA into single-stranded DNA, and DNA polymerase 
is then required to make the second DNA strand. 

G. True. Using a sufficient number of STRs, individuals can 
be uniquely “fingerprinted” (see Figure 10-15). 

H. True. If cells of the tissue do not transcribe the gene 
of interest, it will not be represented in a cDNA 
library prepared from this tissue. However, it will be 
represented in a genomic library prepared from the 
same tissue. 


ANSWER 10-9 

A. The DNA sequence, from its 5’ end to its 3’ end, is read 
starting from the bottom of the gel, where the smallest 
DNA fragments migrate. Each band results from the 
incorporation of the appropriate dideoxyribonucleoside 
triphosphate, and as expected there are no two 
bands that have the same mobility. This allows one to 
determine the DNA sequence by reading off the bands 
in strict order, proceeding upward from the bottom of 
the gel, and assigning the correct nucleotide according 
to which lane the band is in. 

The nucleotide sequence of the top strand (Figure 
A10-9A) was obtained directly from the data of Figure 
Q10-9, and the bottom strand was deduced from the 
complementary base-pairing rules. 

B. The DNA sequence can then be translated into an amino 
acid sequence using the genetic code. However, there 
are two strands of DNA that could be transcribed into 
RNA and three possible reading frames for each strand. 
Thus there are six amino acid sequences that can in 
principle be encoded by this stretch of DNA. Of the 
three reading frames possible from the top strand, only 
one is not interrupted by a stop codon (underlined in the 
DNA sequence and represented by yellow blocks in the 
three amino acid sequences in Figure A10-9B). 

From the bottom strand, two of the three reading 
frames also have stop codons (not shown). The third 
frame gives the following sequence: 
SerAlaLeuGlySerSerGluAsnArgProArgThrProAlaArg 
ThrGlyCysProValTyr 

It is not possible from the information given to tell 
which of the two open reading frames corresponds to 
the actual protein encoded by this stretch of DNA. What 
additional experiment could distinguish between these 
two possibilities? 


ANSWER 10-10 

A. Cleavage of human genomic DNA with Haelll would 
generate about 11 x 10° different fragments 
[= 3 x 109/44] and with EcoRI about 730,000 different 
fragments [= 3 x 109/49]. There will also be some 


(A) 5’ TATAAACTGGACAACCAGTTCGAGCTGGTGTTCGTGGTCGGTTTTCAGAAGATCCTAACGCTGACG 3’ 
3’ ATATTTGACCTGTTGGTCAAGCTCGACCACAAGCACCAGCCAAAAGTCTTCTAGGATTGCGACTGC 5’ 


(B) top strand of DNA 


5’ TATAAACTGGACAACCAGTTCGAGCTGGTGTTCGTGGTCGGTTTTCAGAAGATCCTAACGCTGACG 3’ 
TyrLysLeuAspAsnGlnPheGluLeuVal PheValValGlyPheGlnLysIleLeuThrLeuThr 


Figure A10-9 


1 
2 ITleAsnTrpThrThrSerSerSerTrpCysSerTrpSerVal PheArgArgSer Mi Arg EE Ar 
3 MS ThrGlyGlnProValArgAlaGlyValArgGlyArgPheSerGluAspProAsnAlaAsp 


additional fragments generated because the maternal 
and paternal chromosomes are very similar but not 
identical in DNA sequence. 

B. A set of overlapping DNA fragments will be generated. 
Libraries constructed from sets of overlapping fragments 
are valuable because they can be used to order cloned 
sequences in relation to their original order in the 
genome and thus obtain the DNA sequence of a long 
stretch of DNA (see Figure 10-20). 


ANSWER 10-11 By comparison with the positions of the 
size markers, we find that EcoRI treatment gives two 
fragments of 4 kb and 6 kb; Hindlll treatment gives one 
fragment of 10 kb; and treatment with EcoRI + HindIll gives 
three fragments of 6 kb, 3 kb, and 1 kb. This gives a total 
length of 10 kb calculated as the sum of the fragments in 
each lane. Thus the original DNA molecule must be 

10 kb (10,000 nucleotide pairs) long. Because treatment 
with Hindlll gives a fragment 10 kb long it could be that 
the original DNA is a linear molecule with no cutting site for 
HindIll. But we can rule that out by the results of the EcoRI 
+ Hindlll digestion. We know that EcoRI cleavage alone 
produces two fragments of 6 kb and 4 kb, and in the double 
digest this 4-kb fragment is further cleaved by Hindll into 

a 3-kb and a 1-kb fragment. The DNA therefore contains a 
single Hindlll cleavage site, and thus it must be circular, as 

a single fragment of 10 kb is produced when it is cut with 
Hindili alone. Arranging the cutting sites on a circular DNA 
to give the appropriate sizes of fragments produces the 
map illustrated in Figure A10-11. 


Hindill 


EcoRI 


Figure A10-11 


ANSWER 10-12 

A. Infants 2 and 8 have identical STR patterns and therefore 
must be identical twins. Infants 3 and 6 also have 
identical STR patterns and must also be identical twins. 
The other two sets of twins must be fraternal twins 
because their STR patterns are not identical. Fraternal 
twins, like any pair of siblings born to the same parents, 
will have roughly half their genome in common. Thus, 
roughly half the STR polymorphisms in fraternal twins 
will be identical. Using this criterion, you can identify 
infants 1 and 7 as fraternal twins and infants 4 and 5 as 
fraternal twins. 

B. You can match infants to their parents by using the 
same sort of analysis of STR polymorphisms. Every 
band present in the analysis of an infant should have 
a matching band in one or the other of the parents, 
and, on average, each infant will share half of its 
polymorphisms with each parent. Thus, the degree 
of match between each child and each parent will be 
approximately the same as that between fraternal twins. 


ANSWER 10-13 Mutant bacteria that do not produce 
ice-protein have probably arisen many times in nature. 


Answers 


However, bacteria that produce ice-protein have a slight 
growth advantage over bacteria that do not, so it would be 
difficult to find such mutants in the wild. Recombinant DNA 
technology makes these mutants much easier to obtain. 

In this case, the consequences, both advantageous and 
disadvantageous, of using a genetically modified organism 
are therefore nearly indistinguishable from those of a natural 
mutant. Indeed, bacterial and yeast strains have been 
selected for centuries for desirable genetic traits that make 
them suitable for industrial-scale applications such as cheese 
and wine production. The possibilities of recombinant 

DNA technology are endless, however, and as with any 
technology, there is a risk of unforeseen consequences. 
Recombinant DNA experimentation, therefore, is regulated, 
and the risks of individual projects are carefully assessed 

by review panels before permissions are granted. The 

state of our knowledge is sufficiently advanced that the 
consequences of some changes, such as the disruption of a 
bacterial gene in the example above, can be predicted with 
reasonable certainty. Other applications, such as germ- 

line gene therapy to correct human disease, may have far 
more complex outcomes, and it will take many more years 
of research and ethical debate to determine whether such 
treatments will eventually be used. 


Chapter 11 


ANSWER 11-1 Water is a liquid, and the hydrogen 

bonds that form between water molecules are not static; 
they are continually broken and remade again by thermal 
motion. When a water molecule happens to be next to a 
hydrophobic molecule, it is more restricted in motion and 
has fewer neighbors with which it can interact, because it 
cannot form any hydrogen bonds in the direction of the 
hydrophobic molecule. It will therefore form hydrogen 
bonds to the more limited number of water molecules in 
its proximity. Bonding to fewer partners results in a more 
ordered water structure, which represents the cagelike 
structure in Figure 11-9. This structure has been likened 

to ice, although it is a more transient, less organized, and 
less extensive network than even a tiny ice crystal. The 
formation of any ordered structure decreases the entropy of 
the system and is thus energetically unfavorable (discussed 
in Chapter 3). 


ANSWER 11-2 (B) is the correct analogy for lipid bilayer 
assembly because exclusion from water rather than 
attractive forces between the lipid molecules is involved. 
If the lipid molecules formed bonds with one another, the 
bilayer would be less fluid, and might even become rigid, 
depending on the strength of the interaction. 


ANSWER 11-3 The fluidity of the bilayer is strictly confined 
to one plane: lipid molecules can diffuse laterally in their 
own monolayer but do not readily flip from one monolayer 
to the other. Lipid molecules inserted into one monolayer 
therefore remain in it unless they are actively transferred to 
the other monolayer by a transporter such as a scramblase 
or a flippase. 


ANSWER 11-4 In both an a helix and a B barrel, the 

polar peptide bonds of the polypeptide backbone can be 
completely shielded from the hydrophobic environment of 
the lipid bilayer by the hydrophobic amino acid side chains. 


A:27 


A:28 


Answers 


Internal hydrogen bonds between the peptide bonds 
stabilize the a helix and p barrel. 


ANSWER 11-5 The sulfate group in SDS is charged and 
therefore hydrophilic. The OH group and the C-O-C groups 
in Triton X-100 are polar; they can also form hydrogen 
bonds with water molecules and are therefore hydrophilic. 
In contrast, the red portions of the detergents are either 
hydrocarbon chains or aromatic rings, neither of which has 
polar groups that could form hydrogen bonds with water 
molecules; they are therefore hydrophobic. (One example of 
a tripeptide with hydrophobic side chains is shown in Figure 
A11-5.) 


i O-H water 
cH. co EO molecules 
hydrogen \ 7 ° OT Ss i 
bond H = 3 
| H Il | gi 
H E N c OHO 
X, AH S S AHNA SH 
O` HN C C N E 
ae Il d B ll 
O Z% A D 
= Gi T Fe % 
-H 
as CMs i Q LH 
P O 
H 
valine isoleucine alanine 
Figure A11-5 
ANSWER 11-6 Some of the transmembrane proteins are 


anchored to the spectrin filaments of the cell cortex. These 
molecules are not free to rotate or diffuse within the plane 
of the membrane. There is an excess of transmembrane 
proteins over the available attachment sites in the cortex, 
however, so some of the transmembrane protein molecules 
are not anchored. These proteins are free to rotate 

and diffuse within the plane of the membrane. Indeed, 
measurements of protein mobility show that there are two 
populations of each transmembrane protein, corresponding 
to those proteins that are anchored and those that are not. 


ANSWER 11-7 The different ways in which membrane 
proteins can be restricted to different regions of the 
membrane are summarized in Figure 11-31. The mobility 
of the membrane proteins is drastically reduced if they are 
bound to other proteins such as those of the cell cortex 

or the extracellular matrix. Some membrane proteins are 
confined to membrane domains by barriers, such as tight 
junctions. The fluidity of the lipid bilayer is not significantly 
affected by the anchoring of membrane proteins; the sea of 
lipid molecules flows around anchored membrane proteins 
like water around the posts of a pier. 


ANSWER 11-8 All of the statements are correct. 

A. The lipid bilayer is fluid because its lipid can undergo 
these motions. 

B. The lipid bilayer is fluid because its lipid can undergo 
these motions. 

C. Such exchanges require the action of a transporter. 

D. Hydrogen bonds are formed and broken by thermal 
motion. 

E. Glycolipids are mostly restricted to the monolayer 
of membranes that faces away from the cytosol. 
Some special glycolipids, such as phosphatidylinositol 
(discussed in Chapter 16), are found specifically in the 
cytosolic monolayer. 


F. The reduction of double bonds (by hydrogenation) 
allows the resulting saturated lipid molecules to pack 
more tightly against one another and therefore increases 
viscosity—that is, it turns oil into margarine. 

G. Examples include the many membrane enzymes involved 
in signaling (discussed in Chapter 16). 

H. Polysaccharides are the main constituents of mucus and 
slime; the carbohydrate coat of a cell, which is made 
up of polysaccharides and oligosaccharides, is a very 
important lubricant, particularly for cells that line blood 
vessels or circulate in the bloodstream. 


ANSWER 11-9 In a two-dimensional fluid, the molecules 
are free to move only in one plane; the molecules in a 
normal fluid, in contrast, can move in three dimensions. 


ANSWER 11-10 

A. You would have a detergent. The diameter of the lipid 
head would be much larger than that of the hydrocarbon 
tail, so that the shape of the molecule would be a 
cone rather than a cylinder and the molecules would 
aggregate to form micelles rather than bilayers. 

B. The lipid bilayers formed would be much more fluid. 
The bilayers would also be less stable, as the shorter 
hydrocarbon tails would be less hydrophobic, so the 
forces that drive the formation of the bilayer would be 
reduced. 

C. The lipid bilayers formed would be much less fluid. 
Whereas a normal lipid bilayer has the viscosity of olive 
oil, a bilayer made of the same lipids but with saturated 
hydrocarbon tails would have the consistency of bacon 
fat. 

D. The lipid bilayers formed would be much more fluid. 
Also, because the lipids would pack together less well, 
there would be more gaps and the bilayer would be 
more permeable to small, water-soluble molecules. 

E. If we assume that the lipid molecules are completely 
intermixed, the fluidity of the membrane would be 
unchanged. In such bilayers, however, the saturated lipid 
molecules would tend to aggregate with one another 
because they can pack so much more tightly and would 
therefore form patches of much-reduced fluidity. The 
bilayer would not, therefore, have uniform properties 
over its surface. Because in membrane lipid molecules, 
one saturated and one unsaturated hydrocarbon tail 
are typically linked to the same hydrophilic head, such 
segregation does not occur in cell membranes. 

F. The lipid bilayers formed would have virtually unchanged 
properties. Each lipid molecule would now span the 
entire membrane, with one of its two head groups 
exposed at each surface. Such lipid molecules are found 
in the membranes of thermophilic bacteria, which can 
live at temperatures approaching boiling water. Their 
bilayers do not come apart at elevated temperatures, as 
usual bilayers do, because the original two monolayers 
are now covalently linked into a single membrane. 


ANSWER 11-1] Phospholipid molecules are approximately 
cylindrical in shape. Detergent molecules, by contrast, 

are conical or wedge-shaped. A phospholipid molecule 
with only one hydrocarbon tail, for example, would be 

a detergent. To make a phospholipid molecule into a 
detergent, you would have to make its hydrophilic head 
larger or remove one of its tails so that it could form a 
micelle. Detergent molecules also usually have shorter 
hydrocarbon tails than lipid molecules. This makes them 


slightly water-soluble, so that detergent molecules leave 
and reenter micelles frequently in aqueous solution. Because 
of this, some monomeric detergent molecules are always 
present in aqueous solution and therefore can enter the 
lipid bilayer of a cell membrane to solubilize the proteins 
(see Figure 11-27). 


ANSWER 11-12 

A. There are about 4000 lipid molecules, each 0.5 nm wide, 
between one end of the bacterial cell and the other. 

So if a lipid molecule at one end moved directly in a 
straight line it would require only 4 x 1074 sec (= 4000 

x 10-7) to reach the other end. In reality, however, the 
lipid molecule would move in a random path, so that it 
would take considerably longer. We can calculate the 
approximate time required from the equation: t = x2/2D, 
where x is the average distance moved, t is the time 
taken, and Dis a constant called the diffusion coefficient. 
Inserting step values x = 0.5 nm and t = 107 sec, we 
obtain D = 1.25 x 1078 cm?/sec. Using this value in the 
same equation but with distance x = 2 x 10-4 cm (= um) 
gives the time taken t = 0.16 seconds. 

B. Similarly, if a 4-cm-diameter ping-pong ball exchanged 
partners every 1077 seconds and moved in a linear 
fashion, it would reach the opposite wall in 
1.5 x 10° sec, traveling at 1,440,000 km/hr. [But a 
random walk would take longer. Using the equation 
above, we calculate the constant D in this case to be 
8 x 107 cm2/sec and the time required to travel 
6 m about 2 msec (= 6002/1.6 x 108).] 


ANSWER 11-13 Transmembrane proteins anchor the plasma 
membrane to the underlying cell cortex, strengthening the 
membrane so that it can withstand the forces on it when 

the red blood cell is pumped through small blood vessels. 
Transmembrane proteins also transport nutrients and ions 
across the plasma membrane. 


ANSWER 11-14 The hydrophilic faces of the five membrane- 
spanning a helices, each contributed by a different subunit, 
are thought to come together to form a pore across the 
lipid bilayer that is lined with the hydrophilic amino acid 

side chains (Figure A11-14). lons can pass through this 
hydrophilic pore without coming into contact with the lipid 
tails of the bilayer. The hydrophobic side chains interact with 
the hydrophobic lipid tails. 
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Figure A11-14 


ANSWER 11-15 There are about 100 lipid molecules (i.e., 
phospholipid + cholesterol) for every protein molecule 

in the membrane [(2/50,000)/(1/800 + 1/386)]. A similar 
protein/lipid ratio is seen in many cell membranes. 
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ANSWER 1l-16 Membrane fusion does not alter the 
orientation of the membrane proteins with their attached 
color tags: the portion of each transmembrane protein that 
is exposed to the cytosol always remains exposed to the 
cytosol, and the portion exposed to the outside always 
remains exposed to the outside (Figure A11-16). At 0°C, 
the fluidity of the membrane is reduced, and the mixing of 
the membrane proteins is significantly slowed. 
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Figure A11-16 


ANSWER 11-17 The exposure of hydrophobic amino acid 
side chains to water is energetically unfavorable. There are 
two ways that such side chains can be sequestered away 
from water to achieve an energetically more favorable 
state. First, they can form transmembrane segments that 
span a lipid bilayer. This requires about 20 of them to be 
located sequentially in a polypeptide chain. Second, the 
hydrophobic amino acid side chains can be sequestered 

in the interior of the folded polypeptide chain. This is one 
of the major forces that lock the polypeptide chain into 

a unique three-dimensional structure. In either case, the 
hydrophobic forces in the lipid bilayer or in the interior of a 
protein are based on the same principles. 


ANSWER 11-18 (A) Antarctic fish live at subzero 
temperatures and are cold-blooded. To keep their 
membranes fluid at these temperatures, they have a high 
percentage of unsaturated phospholipids. 


ANSWER 11-19 Sequence B is most likely to form 

a transmembrane helix. It is composed primarily of 
hydrophobic amino acids, and therefore can be stably 
integrated into a lipid bilayer. In contrast, sequence A 
contains many polar amino acids (S, T, N, Q), and sequence 
C contains many charged amino acids (K, R, H, E, D), which 
would be energetically disfavored in the hydrophobic 
interior of the lipid bilayer. 


ANSWER 11-20 Triacylglycerol is an entirely hydrophobic 
molecule. Without a hydrophilic portion, it is unable to form 
favorable interactions with water. Thus, triacylglycerol would 
be unlikely to become part of a lipid bilayer. Instead, such 
purely hydrophobic molecules cluster together to limit their 
contact with surrounding water molecules (see Figure 11-9). 
In this way, triacylglycerols—which are major components of 
animal fats and plant oils—coalesce into fat droplets in an 
aqueous environment, including those in fat cells and plant 
seeds. 


Chapter 12 


ANSWER 12-1 

A. The movement of a solute mediated by a transporter 
can be described by a strictly analogous equation: 
equation 1: T + S e TS — T + S* 
where S is the solute, S* is the solute on the other 
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side of the membrane (i.e., although it is still the same 
molecule, it is now located in a different environment), 
and T is the transporter. 

B. This equation is useful because it describes a binding 
step followed by a delivery step. The mathematical 
treatment of this equation would be very similar to 
that described for enzymes (see Figure 4-35); thus 
transporters are characterized by a Km value that 
describes their affinity for a solute and a Vmax value that 
describes their maximal rate of transfer. 

To be more accurate, one could include the 
conformational change of the transporter in the reaction 
scheme: 

equation 2: T + S +> TS + T*S* — T* + S* 

equation 3: T <> T* 
where T* is the transporter after the conformational 
change that exposes its solute-binding site on the other 
side of the membrane. This account requires a second 
equation (3) that allows the transporter to return to its 
starting conformation. 

C. The equations do not describe the behavior of channels 
because solutes passing through channels do not bind to 
them in the way that a substrate binds to an enzyme. 


ANSWER 12-2 If the Na* pump is not working at full 
capacity because it is partially inhibited by ouabain or 
digitalis, the electrochemical gradient of Na* that the 
pump generates is less steep than that in untreated cells. 
Consequently, the Ca?+—Na* antiport works less efficiently, 
and Ca?* is removed from the cell more slowly. When 

the next cycle of muscle contraction begins, there is still 
an elevated level of Ca?* left in the cytosol. The entry 

of the same number of Ca?* ions into the cell therefore 
leads to a higher Ca@* concentration than in untreated 
cells, which in turn leads to a stronger and longer-lasting 
muscle contraction. Because the Na* pump fulfills essential 
functions in all animal cells, both to maintain osmotic 
balance and to generate the Na* gradient used to power 
many transporters, the drugs are deadly poisons if too much 
is taken. 


ANSWER 12-3 

A. Each of the rectangular peaks corresponds to the 
opening of a single channel that allows a small current 
to pass. You note from the recording that the channels 
present in the patch of membrane open and close 
frequently. Each channel remains open for a very short, 
somewhat variable time, averaging about 5 milliseconds. 
When open, the channels allow a small current with a 
unique amplitude (4 pA; one picoampere = 10712 A) to 
pass. In one instance, the current doubles, indicating 
that two channels in the same membrane patch opened 
simultaneously. 

B. If acetylcholine is omitted or is added to the solution 
outside the pipette, you would measure only the 
baseline current. Acetylcholine must bind to the 
extracellular portion of the acetylcholine receptor in 
the membrane patch to allow the channel to open 
frequently enough to detect changes in the currents; 
in the membrane patch shown in Figure 12-25B, only 
the cytoplasmic side of the receptor is exposed to the 
solution outside the microelectrode. 


ANSWER 12-4 The equilibrium potential of K* is -90 mV [= 
62 mV logio (5 mM/140 mM)], and that of Na* is 


+72 mV [= 62 mV logio (145 mM/10 mM)]. The K+ leak 
channels are the main ion channels open in the plasma 
membrane of a resting cell, and they allow Kt to come to 
equilibrium; the membrane potential of the cell is therefore 
close to -90 mV. When Nat channels open, Na* rushes in, 
and, as a result, the membrane potential reverses its polarity 
to a value nearer to +72 mV, the equilibrium value for Na‘. 
Upon closure of the Na* channels, the K+ leak channels 
allow K*, now no longer at equilibrium, to exit from the cell 
until the membrane potential is restored to the equilibrium 
value for K+, about -90 mV. 


ANSWER 12-5 When the resting membrane potential of 
an axon (inside negative) rises to a threshold value, voltage- 
gated Na* channels in the immediate neighborhood open 
and allow an influx of Na*. This depolarizes the membrane 
further, causing more voltage-gated Na* channels to open, 
including those in the adjacent plasma membrane. This 
creates a wave of depolarization that spreads rapidly along 
the axon, called the action potential. Because Na* channels 
become inactivated soon after they open, the outward 
flow of K* through voltage-gated Kt channels and K* 

leak channels is rapidly able to restore the original resting 
membrane potential. (96 words) 


ANSWER 12-6 If the number of functional acetylcholine 
receptors is reduced by the antibodies, the neurotransmitter 
(acetylcholine) that is released from the nerve terminals 
cannot (or can only weakly) stimulate the muscle to contract. 


ANSWER 12-7 Although the concentration of CI- outside 
cells is much higher than inside, when transmitter-gated CI- 
channels open in the plasma membrane of a postsynaptic 
neuron in response to an inhibitory neurotransmitter, very 
little CI- enters the cell. This is because the driving force 
for the influx of CI- across the membrane is close to zero at 
the resting membrane potential, which opposes the influx. 
If, however, the excitatory neurotransmitter opens Na+ 
channels in the postsynaptic membrane at the same time 
that an inhibitory neurotransmitter opens CI- channels, the 
resulting depolarization caused by the Na* influx will cause 
CI- to move into the cell through the open CF channels, 
neutralizing the effect of the Na* influx. In this way, 
inhibitory neurotransmitters suppress the production of an 
action potential by making the target cell membrane much 
harder to depolarize. 


ANSWER 12-8 By analogy to the Nat pump shown 

in Figure 12-12, ATP might be hydrolyzed and donate 

a phosphate group to the transporter when—and only 
when—it has the solute bound on the cytosolic face of the 
membrane (step 1 — 2). The attachment of the phosphate 
would trigger an immediate conformational change 

(step 2 — 3), thereby capturing the solute and exposing 

it to the other side of the membrane. The phosphate 
would be removed from the protein when—and only 
when—the solute had dissociated, and the now empty, 
nonphosphorylated transporter would switch back to the 
starting conformation (step 3 — 4) (Figure A12-8). 


ANSWER 12-9 

A. False. The plasma membrane contains transport proteins 
that confer selective permeability to many but not all 
charged molecules. In contrast, a pure lipid bilayer 
lacking proteins is highly impermeable to all charged 
molecules. 


OUTSIDE 


INSIDE 


Figure A12-8 


B. False. Channels do not have binding pockets for the 
solute that passes through them. Selectivity of a channel 
is achieved by the size of the internal pore and by 
charged regions at the entrance of the pore that attract 
or repel ions of the appropriate charge. 

C. False. Transporters are slower. They have enzyme- 
like properties; that is, they bind solutes and need to 
undergo conformational changes during their functional 
cycle. This limits the maximal rate of transport to about 
1000 solute molecules per second, whereas channels can 
pass up to 1,000,000 solute molecules per second. 

D. True. The bacteriorhodopsin of some photosynthetic 
bacteria pumps H* out of the cell using energy captured 
from visible light. 

E. True. Most animal cells contain K* leak channels in their 
plasma membrane that are predominantly open. The 
K+ concentration inside the cell still remains higher than 
outside because the membrane potential is negative and 
therefore inhibits the positively charged K* from leaking 
out. K+ is also continually pumped into the cell by the 
Na* pump. 

F. False. A symport binds two different solutes on the 
same side of the membrane. Turning it around would 
not change it into an antiport, which must also bind 
two different solutes but on opposing sides of the 
membrane. 

G. False. The peak of an action potential corresponds 
to a transient shift of the membrane potential from a 
negative to a positive value. The influx of Na* causes the 
membrane potential first to move toward zero and then 
to reverse, rendering the cell positively charged on its 
inside. Eventually, the resting potential is restored by an 
efflux of K* through voltage-gated K* channels and K* 
leak channels. 


ANSWER 12-10 The permeabilities are N2 (small and 
nonpolar) > ethanol (small and slightly polar) > water (small 
and polar) > glucose (large and polar) > Ca?+ (small and 
charged) > RNA (very large and charged). 


ANSWER 12-11 

A. Both couple the movement of two different solutes 
across a cell membrane. Symports transport both solutes 
in the same direction, whereas antiports transport the 
solutes in opposite directions. 

B. Both are mediated by membrane transport proteins. 
Passive transport of a solute occurs downhill, in the 
direction of its concentration or electrochemical 
gradient, whereas active transport occurs uphill and 
therefore needs an energy source. Active transport 
can be mediated by transporters but not by channels, 
whereas passive transport can be mediated by either. 

C. Both terms describe gradients across a membrane. The 
membrane potential refers to the voltage gradient; the 


Answers A:31 


nat A- 


electrochemical gradient is a composite of the voltage 
gradient and the concentration gradient of a specific 
charged solute (ion). The membrane potential is defined 
independently of the solute of interest, whereas an 
electrochemical gradient refers to the particular solute. 

D. A pump is a specialized transporter that uses energy 
to transport a solute uphill—against an electrochemical 
gradient for a charged solute or a concentration 
gradient for an uncharged solute. 

E. Both transmit electrical signals, by means of electrons 
in wires and by ion movements across the plasma 
membrane in axons. Wires are made of copper, axons 
are not. The signal passing down an axon does not 
diminish in strength because it is self-amplifying, 
whereas the signal in a wire decreases over distance (by 
leakage of current across the insulating sheath). 

F. Both affect the osmotic pressure in a cell. Anion is a 
solute that bears a charge. 


ANSWER 12-12 A bridge allows vehicles to pass over 
water in a steady stream; the entrance can be designed 
to exclude, for example, oversized trucks, and it can be 
intermittently closed to traffic by a gate. By analogy, 
gated channels allow ions to pass across a cell membrane, 
imposing size and charge restrictions. 

A ferry, in contrast, loads vehicles on one side of 
the body of water, crosses, and unloads on the other 
side—a slower process. During loading, particular vehicles 
could be selected from the waiting line because they fit 
particularly well on the car deck. By analogy, transporters 
bind solutes on one side of the membrane and then, after a 
conformational movement, release them on the other side. 
Specific binding selects the molecules to be transported. 
As in the case of coupled transport, sometimes you have to 
wait until the ferry is full before you can go. 


ANSWER 12-13 Acetylcholine is being transported into 
the vesicles by an H*-acetylcholine antiport in the vesicle 
membrane. The H* gradient that drives the uptake is 
generated by an ATP-driven H* pump in the vesicle 
membrane, which pumps H* into the vesicle (hence the 
dependence of the reaction on ATP). Raising the pH of 
the solution surrounding the vesicles decreases the H+ 
concentration of the solution, thereby increasing the 
outward gradient across the vesicle membrane, explaining 
the enhanced rate of acetylcholine uptake. 


ANSWER 12-14 The voltage gradient across the membrane 
is about 150,000 V/cm (70 x 10-3 V/4.5 x 1077 cm). This 
extremely powerful electric field is close to the limit at 
which insulating materials—such as the lipid bilayer— 
break down and cease to act as insulators. The large field 
indicates what a large amount of energy can be stored in 
electrical gradients across the membrane, as well as the 
extreme electrical forces that proteins can experience 
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in a membrane. A voltage of 150,000 V would instantly 
discharge in an arc across a 1-cm-wide gap (that is, air would 
be an insufficient insulator for this strength of field). 


ANSWER 12-15 

A. Nothing. You require ATP to drive the Na* pump. 

B. The ATP becomes hydrolyzed, and Na* is pumped into 
the vesicles, generating a concentration gradient of Nat 
across the membrane. At the same time, K* is pumped 
out of the vesicles, generating a concentration gradient 
of K+ of opposite polarity. When all the K+ is pumped 
out of the vesicle or the ATP runs out, the pump would 
stop. 

C. The pump would initiate a transport cycle and then 
cease. Because all reaction steps must occur strictly 
sequentially, dephosphorylation and the accompanying 
conformational switch cannot occur in the absence 
of K*. The Na* pump will therefore become stuck in 
the phosphorylated state, waiting indefinitely for a 
potassium ion. The number of sodium ions transported 
would be minuscule, because each pump molecule 
would have functioned only a single time. Similar 
experiments, leaving out individual ions and analyzing 
the consequences, were used to determine the 
sequence of steps by which the Na* pump works. 

D. ATP would become hydrolyzed, and Na* and K* would 
be pumped across the membrane as described in (B). 
However, the pump molecules that sit in the membrane 
in the reverse orientation would be completely inactive 
(i.e., they would not—as one might have erroneously 
assumed—pump ions in the opposite direction) 
because ATP would not have access to the site on 
these molecules where phosphorylation occurs, which is 
normally exposed to the cytosol. ATP is highly charged 
and cannot cross membranes without the help of specific 
transporters. 

E. ATP becomes hydrolyzed, and Na* and K* are pumped 
across the membrane, as described in (B). Kt, however, 
immediately flows back into the vesicles through the 
K+ leak channels. K* moves down the K* concentration 
gradient formed by the action of the Na* pump. 

With each Kt that moves into the vesicle through a 
leak channel, a positive charge is moved across the 
membrane, generating a membrane potential that is 
positive on the inside of the vesicles. Eventually, K* 
will stop flowing through the leak channels when the 
membrane potential balances the K* concentration 
gradient. The scenario described here is a slight 
oversimplification: the Na* pump in mammalian 

cells actually moves three sodium ions out of cells 

for each two potassium ions that it pumps, thereby 
driving an electric current across the membrane and 
making a small additional contribution to the resting 
membrane potential (which therefore corresponds only 
approximately to a state of equilibrium for K* moving via 
Kt leak channels). 


ANSWER 12-16 lon channels can be ligand-gated, voltage- 
gated, or mechanically- (stress-) gated. 


ANSWER 12-17 The cell has a volume of 10-712 liters 

(= 10-15 m3) and thus contains 6 x 104 calcium ions 

(= 6 x 1023 molecules/mole x 100 x 107? moles/liter 

x 10712 liters). Therefore, to raise the intracellular Ca2* 
concentration fiftyfold, another 2,940,000 calcium ions have 


to enter the cell (note that at 5 uM concentration there are 
3 x 10° ions in the cell, of which 60,000 are already present 
before the channels are opened). Because each of the 1000 
channels allows 10° ions to pass per second, each channel 
has to stay open for only 3 milliseconds. 


ANSWER 12-18 Animal cells drive most transport processes 
across the plasma membrane with the electrochemical 
gradient of Na*. ATP is needed to fuel the Na* pump to 
maintain the Na* gradient. 


ANSWER 12-19 

A. If H* is pumped across the membrane into the 
endosomes, an electrochemical gradient of H* results, 
composed of both an H* concentration gradient and 
a membrane potential, with the interior of the vesicle 
positive. Both of these components add to the energy 
that is stored in the gradient and that must be supplied 
to generate it. The electrochemical gradient will limit 
the transfer of more H+. If, however, the membrane also 
contains Cl channels, the negatively charged CI- in the 
cytosol will flow into the endosomes and diminish their 
membrane potential. It therefore becomes energetically 
less expensive to pump more H* across the membrane, 
and the interior of the endosomes can become more 
acidic. 

B. No. As explained in (A), some acidification would still 
occur in their absence. 


ANSWER 12-20 

A. See Figure A12-20. 

B. The transport rates of compound A are proportional 
to its concentration, indicating that compound A can 
diffuse through membranes on its own. Compound A is 
likely to be ethanol, because it is a small and relatively 
nonpolar molecule that can diffuse readily through the 
lipid bilayer (see Figure 12-2). In contrast, the transport 
rates of compound B saturate at high concentrations, 
indicating that compound B is transported across the 
membrane by some sort of membrane transport protein. 
Transport rates cannot increase beyond a maximal rate 
at which this protein can function. Compound B is likely 
to be acetate, because it is a charged molecule that 
could not cross the membrane without the help of a 
membrane transport protein. 
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Figure A12-20 


ANSWER 12-21 The membrane potential and the high 
extracellular Na* concentration provide a large inward 
electrochemical driving force and a large reservoir of Nat 
ions, so that mostly Na* ions enter the cell as acetylcholine 
receptors open. Ca? ions will also enter the cell, but 

their influx is much more limited because of their lower 
extracellular concentration. (Most of the Ca?* that enters 
the cytosol to stimulate muscle contraction is released from 
intracellular stores, as we discuss in Chapter 17). Because 
of the high intracellular K* concentration and the opposing 
direction of the membrane potential, there will be little if 
any movement of K* ions upon opening of a cation channel. 


ANSWER 12-22 The diversity of neurotransmitter-gated ion 
channels raises the hope of developing new drugs specific 
for each channel type. Each of the diverse subtypes of 
these channels is expressed in a narrow subset of neurons. 
This narrow range of expression should make it possible, in 
principle, to discover or design drugs that affect particular 
receptor subtypes present in a selected set of neurons, thus 
targeting particular brain functions with greater specificity. 


Chapter 13 


ANSWER 13-1 To keep glycolysis going, cells need 

to regenerate NAD* from NADH. In the absence of 
oxygen, there is no efficient way to do this without 
fermentation. Without regenerated NAD*, step 6 of 
glycolysis [the oxidation of glyceraldehyde 3-phosphate to 
1,3-bisphosphoglycerate (Panel 13-1, pp. 436-437)] could 
not occur, and the product glyceraldehyde 3-phosphate 
would accumulate. The same thing would happen in cells 
unable to make either lactate or ethanol: neither would 
be able to regenerate NAD+, and so glycolysis would be 
blocked at the same step. 


ANSWER 13-2 Arsenate instead of phosphate becomes 
attached in step 6 of glycolysis to form 1-arseno-3- 
phosphoglycerate (Figure A13-2). Because of its sensitivity 
to hydrolysis in water, the high-energy bond is destroyed 
before the molecule that contains it can diffuse to 

reach the next enzyme. The product of the hydrolysis, 
3-phosphoglycerate, is the same product normally formed 
in step 7 by the action of phosphoglycerate kinase. But 
because hydrolysis occurs nonenzymatically, the energy 
liberated by breaking the high-energy bond cannot be 
captured to generate ATP. In Figure 13-7, therefore, the 
reaction corresponding to the downward-pointing arrow 
in step 7 would still occur, but the wheel that provides 
the coupling to ATP synthesis is missing. Arsenate wastes 
metabolic energy by uncoupling many phosphotransfer 
reactions by the same mechanism, which is why it is so 
poisonous. 
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ANSWER 13-3 The oxidation of fatty acids breaks the 
carbon chain down into two-carbon units at a time (acetyl 
groups that had become attached to CoA). Conversely, 
during biogenesis, fatty acids are constructed by linking 
together acetyl groups. Most fatty acids therefore have an 
even number of carbon atoms. 


ANSWER 13-4 Because the function of the citric acid cycle 
is to harvest the energy released during the oxidation, it 

is advantageous to break the overall reaction into as many 
steps as possible (see Figure 13-1). Using a two-carbon 
compound (acetyl CoA), the available chemistry would be 
much more limited, and it would be impossible to generate 
as many intermediates. 


ANSWER 13-5 Itis true that oxygen atoms are returned 
to the atmosphere as part of CO2 during the oxidative 
degradation of glucose (see Figure 13-3). The CO2 
released from the cells, however, does not contain the 
specific oxygen atoms consumed as part of the oxidative 
phosphorylation process; these are converted into water. 
One can show this directly by incubating living cells in an 
atmosphere that includes molecular oxygen containing the 
180 isotope of oxygen instead of the naturally abundant 
isotope, 160, In such an experiment, one finds that all the 
COz2 released from cells contains only 160. Therefore, the 
oxygen atoms in the released CO2 molecules do not come 
directly from the atmosphere but from organic molecules 
that the cell has first made and then oxidized as fuel (see 
top of first page of Panel 13-2, pp. 442-443). 


ANSWER 13-6 The cycle continues because intermediates 
are replenished as necessary by reactions leading into 

the citric acid cycle (instead of away from it). One of the 
most important reactions of this kind is the conversion 

of pyruvate to oxaloacetate by the enzyme pyruvate 
carboxylase: 


pyruvate + CO2 + ATP + H20 — oxaloacetate + ADP + P; + 2H* 


This reaction feeds oxaloacetate into the citric acid cycle. 
It is one of the many examples of how metabolic pathways 
are carefully coordinated to work together to maintain 
appropriate concentrations of all metabolites required by 
the cell (see Figure A13-6). 
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Figure A13-6 


ANSWER 13-7 The carbon atoms in sugar molecules are 
already partially oxidized. In contrast, only the very first 
carbon atom in the acyl chains of fatty acids is oxidized. 
Thus, two carbon atoms from glucose are lost as CO2 during 
the conversion of pyruvate to acetyl CoA (see Figure 13-3), 
leaving only four carbons to enter the citric acid cycle, 
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where most of the energy is captured. In contrast, all carbon 
atoms of a fatty acid are converted into acetyl CoA (see 
Figure 13-11). 


ANSWER 13-8 

A. False. If this were the case, the reaction would be 
useless for the cell. No chemical energy would be 
harvested in a useful form (e.g., ATP) to be used for 
metabolic processes. (The cells would certainly be warm, 
though!) 

B. False. No energy-conversion process can be 100% 
efficient. Recall that entropy in the universe always 
has to increase, and for most reactions this occurs by 
releasing heat. 

C. True. The carbon atoms in glucose are in a reduced state 
compared with those in COz, in which they are fully 
oxidized. 

D. False. The final steps of oxidative phosphorylation do 
indeed produce some water (see Figure 13-3). But 
water is so abundant in the biosphere that this is no 
more than “a drop in the ocean.” 

E. True. If it had occurred in only one step, then all the 
energy would be released at once and it would be 
impossible to harness it efficiently to drive other 
reactions, such as the synthesis of ATP. 

F. False. Molecular oxygen (O2) is used only in the very last 
step of the reaction (see Figure 13-3). 

G. True. Plants convert COz into sugars by harvesting the 
energy of light in photosynthesis. O2 is produced in the 
process and released into the atmosphere by plant cells. 

H. True. Anaerobically growing cells use glycolysis to 
oxidize sugars to pyruvate: animal cells convert the 
pyruvate into lactate, and no COz is produced; yeast 
cells, however, convert the pyruvate into ethanol and 
COz. It is this CO2 gas released from yeast cells during 
fermentation that makes bread dough rise and that 
carbonates beer and champagne. 


ANSWER 13-9 Darwin exhaled the carbon atom, which 
therefore must be the carbon atom of a CO2 molecule. 
After spending some time in the atmosphere, the CO2 
molecule must have entered a plant cell, where it became 
“fixed” by photosynthesis and converted into part of a 
sugar molecule. While it is certain that these early steps 
must have happened this way, there are many different 
paths from there that the carbon atom could have taken. 
The sugar could have been broken down by the plant cell 
into pyruvate or acetyl CoA, for example, which then could 
have entered biosynthetic reactions to build an amino acid. 
The amino acid might have been incorporated into a plant 
protein. You might have eaten the delicious leaves of the 
plant in your salad, and digested the protein in your gut 

to produce amino acids again. After circulating in your 
bloodstream, the amino acid might have been taken up by 
a developing red blood cell to make its own protein, such 
as the hemoglobin in question. If we wish, of course, we can 
make our food-chain scenario more complicated. The plant, 
for example, might have been eaten by an animal that in 
turn was consumed by you during a lunch break. Moreover, 
because Darwin died more than 100 years ago, the carbon 
atom could have traveled such a route many times. In 

each round, however, it would have started again as fully 
oxidized COz2 gas and entered the living world through 
photosynthesis in a plant. 


ANSWER 13-10 Yeast cells proliferate much better 
aerobically. Under anaerobic conditions they cannot perform 
oxidative phosphorylation and therefore have to produce all 
their ATP by glycolysis, which is less efficient. Whereas one 
glucose molecule yields a net gain of two ATP molecules 

by glycolysis, the additional use of the citric acid cycle and 
oxidative phosphorylation boosts the energy yield up to 
about 30 ATP molecules. The citric acid cycle depends on 
O2 because it needs NAD* to continue running. 


ANSWER 13-11 The amount of free energy stored in 

the phosphate bond in creatine phosphate is larger than 
that of the anhydride bonds in ATP. Hydrolysis of creatine 
phosphate can therefore be directly coupled to the 
production of ATP. 


creatine phosphate + ADP — creatine + ATP 


The AG? for this reaction is -12.6 kJ/mole, indicating that it 
proceeds rapidly to the right, as written. 


ANSWER 13-12 The extreme conservation of glycolysis 

is some of the evidence that all present-day cells are 
derived from a single founder cell, as discussed in Chapter 
1. The elegant reactions of glycolysis would therefore 
have evolved only once, and then they would have been 
inherited as organisms evolved. The later invention of 
oxidative phosphorylation allowed organisms to capture 
15 times more energy from fuel molecules than is possible 
by glycolysis alone. This remarkable efficiency is close 

to the theoretical limit and hence virtually eliminates the 
opportunity for further improvements. Thus, the generation 
of alternative pathways would result in no obvious 
reproductive advantage that would have been selected in 
evolution. 


ANSWER 13-13 If one glucose molecule produces 30 ATPs, 
then to generate 10? ATP molecules will require 1 x 109/30 = 
3.3 x 107 glucose molecules and 6 x 3.3 x 107 = 2 x 108 
molecules of oxygen. Thus, in one minute, the cell will 
consume 2 x 108/(6 x 1023) or 3.3 x 10-16 moles of oxygen, 
which would occupy 3.3 x 10716 22.4 = 7.4 x 10715 liters in 
gaseous form. The volume of the cell is 10715 cubic meters 
[= (10-5)3], which is 10712 liter. The cell therefore consumes 
an amount of O2 gas equivalent to about 0.7% of the cell 
volume every minute, or an amount of O2 gas equivalent to 
the cell volume in 2 hours and 15 minutes. 


ANSWER 13-14 The reactions each have negative AG 
values and are therefore energetically favorable (see 
Figure A13-14 for energy diagrams). 


ANSWER 13-15 

A. Pyruvate is converted to acetyl CoA, and the labeled 14C 
atom is released as '*CO2 gas (see Figure 13-10). 

B. By following the '4C-labeled atom through every 
reaction in the citric acid cycle, shown in Panel 13-2 (pp. 
442-443), you find that the added 14C label would be 
quantitatively recovered in oxaloacetate. The analysis 
also reveals, however, that it is no longer in the keto 
group but in the methylene group of oxaloacetate 
(Figure A13-15). 


ANSWER 13-16 In the presence of molecular oxygen, 
oxidative phosphorylation converts most of the cellular 
NADH to NAD+ (see Figure 13-19). Since fermentation 
requires NADH (see Figure 13-6), it is severely inhibited by 
the availability of oxygen gas. 


STEP STEP STEP STEP 
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Figure A13-14 


Chapter 14 


ANSWER 14-1 By making membranes permeable to 
protons, DNP collapses—or, at very small concentrations, 
diminishes—the proton gradient across the inner 
mitochondrial membrane. Cells continue to oxidize food 
molecules to feed high-energy electrons into the electron- 
transport chain, but H* ions pumped across the membrane 
flow back across that membrane in a futile cycle. As a 
result, the energy of the electrons cannot be tapped 

to drive ATP synthesis, and instead is released as heat. 
Patients who have been given small doses of DNP lose 
weight because their fat reserves are used more rapidly to 
feed the electron-transport chain, and the whole process 
simply “wastes” energy as heat. A similar mechanism of 
heat production is used naturally in a specialized tissue 
composed of brown fat cells, which is abundant in newborn 
humans and in hibernating animals. These cells are packed 
with mitochondria that leak part of their H* gradient 
futilely back across the membrane for the sole purpose of 
warming up the organism. These cells are brown because 
they are packed with mitochondria, which contain high 
concentrations of pigmented proteins such as cytochromes. 


ANSWER 14-2 The inner mitochondrial membrane is the 
site of oxidative phosphorylation, and it produces most 

of the cell's ATP. Cristae are portions of the mitochondrial 
inner membrane that are folded inward. Mitochondria that 
have a higher density of cristae have a larger area of inner 
membrane and therefore a greater capacity to carry out 
oxidative phosphorylation. Heart muscle expends a lot of 
energy during its continuous contractions, whereas skin 
cells have a smaller energy demand. An increased density of 
cristae therefore increases the ATP-production capacity of 
the heart muscle cell. This is a remarkable example of how 
cells adjust the abundance of their individual components 
according to need. 
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ANSWER 14-3 

A. The DNP collapses the electrochemical proton gradient 
completely. H* ions that are pumped to one side of the 
membrane flow back freely, and therefore no energy to 
drive ATP synthesis can be stored across the membrane. 

B. An electrochemical gradient is made up of two 
components: a concentration gradient and an electrical 
potential. If the membrane is made permeable to Kt 
with nigericin, K* will be driven into the matrix by the 
electrical potential of the inner membrane (negative 
inside, positive outside). The influx of positively charged 
K* will abolish the membrane’s electrical potential. 
In contrast, the concentration component of the H+ 
gradient (the pH difference) is unaffected by nigericin. 
Therefore, only part of the driving force that makes it 
energetically favorable for H* ions to flow back into the 
matrix is lost. 


ANSWER 14-4 

A. Such a turbine running in reverse is an electrically 
driven water pump, which is analogous to what the 
ATP synthase becomes when it uses the energy of ATP 
hydrolysis to pump protons against their electrochemical 
gradient across the inner mitochondrial membrane. 

B. The ATP synthase should stall when the energy that it 
can draw from the proton gradient is just equal to the 
AG required to make ATP; at this equilibrium point 
there will be neither net ATP synthesis nor net ATP 
consumption. 

C. As the cell uses up ATP, the ATP/ADP ratio in the matrix 
falls below the equilibrium point just described, and ATP 
synthase uses the energy stored in the proton gradient 
to synthesize ATP in order to restore the original ATP/ 
ADP ratio. Conversely, when the electrochemical proton 
gradient drops below that at the equilibrium point, ATP 
synthase uses ATP in the matrix to restore this gradient. 


ANSWER 14-5 An electron pair, when passing from NADH 
to O2 through the three respiratory complexes, causes 

10 H* to be pumped across the membrane. Four H* are 
needed to make each ATP: three for synthesis from ADP 
and one for ATP export to the cytosol. Therefore, 2.5 ATP 
molecules are synthesized from each NADH molecule. 


ANSWER 14-6 One can describe four essential roles for 
the proteins in the process. First, the chemical environment 
provided by a protein's amino acid side chains sets the 
redox potential of each Fe ion such that electrons can be 
passed in a defined order from one component to the next, 
giving up their energy in small steps and becoming more 
firmly bound as they proceed. Second, the proteins position 
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the Fe ions so that the electrons can move efficiently 
between them. Third, the proteins prevent electrons from 
skipping an intermediate step; thus, as we have learned for 
other enzymes (discussed in Chapter 4), they channel the 
electron flow along a defined path. Fourth, the proteins 
couple the movement of the electrons down their energy 
ladder to the pumping of protons across the membrane, 
thereby harnessing the energy that is released and storing it 
in a proton gradient that is then used for ATP production. 


ANSWER 14-7 It would not be productive to use the 
same carrier in two steps. If ubiquinone, for example, could 
transfer electrons directly to the cytochrome c oxidase, the 
cytochrome c reductase complex would often be skipped 
when electrons are collected from NADH dehydrogenase. 
Given the large difference in redox potential between 
ubiquinone and cytochrome c oxidase, a large amount of 
energy would be released as heat and thus be wasted. 
Electron transfer directly between NADH dehydrogenase 
and cytochrome c would similarly allow the cytochrome c 
reductase complex to be bypassed. 


ANSWER 14-8 Protons pumped across the inner 
mitochondrial membrane into the intermembrane space 
equilibrate with the cytosol, which functions as a huge 

H* sink. Both the mitochondrial matrix and the cytosol 
support many metabolic reactions that require a pH 
around neutrality. The H* concentration difference, ApH, 
that can be achieved between the mitochondrial matrix 
and the cytosol is therefore relatively small (less than one 
pH unit). Much of the energy stored in the mitochondrial 
electrochemical proton gradient is instead due to the 
membrane potential (see Figure 14-15). In contrast, 
chloroplasts have a smaller, dedicated compartment into 
which H* ions are pumped. Much higher concentration 
differences can be achieved (up to a thousandfold, or 3 pH 
units), and much of the energy stored in the thylakoid H+ 
gradient is due to the H* concentration difference between 
the thylakoid space and the stroma. 


ANSWER 14-9 NADH and NADPH differ by the presence 
of a single phosphate group. That phosphate gives NADPH 
a slightly different shape from NADH, which allows these 
molecules to be recognized by different enzymes, and 

thus to deliver their electrons to different destinations. 
Such a division of labor is useful because NADPH tends to 
be involved in biosynthetic reactions, where high-energy 
electrons are used to produce energy-rich biological 
molecules. NADH, on the other hand, is involved in 
reactions that oxidize energy-rich food molecules to 
produce ATP. Inside the cell, the ratio of NAD* to NADH is 
kept high, whereas the ratio of NADP* to NADPH is kept 
low. This provides plenty of NAD* to act as an oxidizing 
agent and plenty of NADPH to act as a reducing agent—as 
required for their special roles in catabolism and anabolism, 
respectively. 


ANSWER 14-10 

A. Photosynthesis produces sugars, most importantly 
sucrose, that are transported from the photosynthetic 
cells through the sap to root cells. There, the sugars are 
oxidized by glycolysis in the root cell cytoplasm and by 
oxidative phosphorylation in the root cell mitochondria 
to produce ATP, as well as being used as the building 
blocks for many other metabolites. 


B. Mitochondria are required even during daylight hours in 
chloroplast-containing cells to supply the cell with ATP 
derived by oxidative phosphorylation. Glyceraldehyde 
3-phosphate made by photosynthesis in chloroplasts 
moves to the cytosol and is eventually used as a source 
of energy to drive ATP production in mitochondria. 


ANSWER 14-1] All statements are correct. 

A. This is a necessary condition. If it were not true, 
electrons could not be removed from water and the 
reaction that splits water molecules (H20 — 2H* + 402 
+ 2e) would not occur. 

B. Only when excited by light energy does chlorophyll have 
a low enough affinity for an electron to pass it to an 
electron carrier with a low electron affinity. This transfer 
allows the energy of the photon to be harnessed as 
energy that can be utilized in chemical conversions. 

C. It can be argued that this is one of the most important 
obstacles that had to be overcome during the 
evolution of photosynthesis: partially reduced oxygen 
molecules, such as the superoxide radical Oz, are 
dangerously reactive and will attack and destroy almost 
any biologically active molecule. These intermediates 
therefore have to remain tightly bound to the metals in 
the active site of the enzyme until all four electrons have 
been removed from two water molecules. This requires 
the sequential capture of four photons by the same 
reaction center. 


ANSWER 14-12 

A. True. NAD* and quinones are examples of compounds 
that do not have metal ions but can participate in 
electron transfer. 

B. False. The potential is due to protons (H*) that are 
pumped across the membrane from the matrix to the 
intermembrane space. Electrons remain bound to 
electron carriers in the inner mitochondrial membrane. 

C. True. Both components add to the driving force that 
makes it energetically favorable for H+ to flow back into 
the matrix. 

. True. Both move rapidly in the plane of the membrane. 
False. Not only do plants need mitochondria to make 
ATP in cells that do not have chloroplasts, such as root 
cells, but mitochondria make most of the cytosolic ATP 
in all plant cells. 

F. True. Chlorophyll’s physiological function requires it 

to absorb light; heme just happens to be a colored 
compound from which blood derives its red color. 

G. False. Chlorophyll absorbs light and transfers energy in 
the form of an energized electron, whereas the iron in 
heme is a simple electron carrier. 

H. False. Most of the dry weight of a tree comes from 
carbon derived from the COz that has been fixed during 
photosynthesis. 


ANSWER 14-13 It takes three protons. The precise value of 
the AG for ATP synthesis depends on the concentrations of 
ATP, ADP, and P; (as described in Chapter 3). The higher the 
ratio of the concentration of ATP to ADP, the more energy it 
takes to make additional ATP. The lower value of 46 kJ/mole 
therefore applies to conditions where cells have expended 
a lot of energy and have therefore decreased the normal 
ATP/ADP ratio. 


ANSWER 14-14 If no Oz is available, all components of 
the mitochondrial electron-transport chain will accumulate 
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in their reduced form. This is the case because electrons 
derived from NADH enter the chain but cannot be 
transferred to O2. The electron-transport chain therefore 
stalls with all of its components in the reduced form. If O2 is 
suddenly added again, the electron carriers in cytochrome 
c oxidase will become oxidized before those in NADH 
dehydrogenase. This is true because, after O2 addition, 
cytochrome c oxidase will donate its electrons directly 

to O2, thereby becoming oxidized. A wave of increasing 
oxidation then passes backward with time from cytochrome 
c oxidase through the components of the electron-transport 
chain, as each component regains the opportunity to pass 
on its electrons to downstream components. 


ANSWER 14-15 As oxidized ubiquinone becomes reduced, 
it picks up two electrons but also two protons from water 
(Figure 14-21). Upon oxidation, these protons are released. 
If reduction occurs on one side of the membrane and 
oxidation at the other side, a proton is pumped across 

the membrane for each electron transported. Electron 
transport by ubiquinone thereby contributes directly to the 
generation of the H* gradient. 


ANSWER 14-16 Photosynthetic bacteria and plant cells 
use the electrons derived in the reaction 2H20 — 4e + 
4H* + O2 to reduce NADP* to NADPH, which is then used 
to produce useful metabolites. If the electrons were used 
instead to produce H2 in addition to Oz, the cells would 
lose any benefit they derive from carrying out the reaction, 
because the electrons could not take part in metabolically 
useful reactions. 


ANSWER 14-17 

A. The switch in solutions creates a pH gradient across 
the thylakoid membrane. The flow of H* ions down the 
electrochemical proton gradient drives ATP synthase, 
which converts ADP to ATP. 

B. No light is needed, because the H* gradient is 
established artificially without a need for the light-driven 
electron-transport chain. 

C. Nothing. The H* gradient would be in the wrong 
direction; ATP synthase would not work. 

D. The experiment provided early supporting evidence for 
the chemiosmotic model by showing that an H* gradient 
alone is sufficient to drive ATP synthesis (see How We 
Know, pp. 476-477). 


ANSWER 14-18 

A. When the vesicles are exposed to light, H* ions 
(derived from H20) pumped into the vesicles by the 
bacteriorhodopsin flow back out through the ATP 
synthase, causing ATP to be made in the solution 
surrounding the vesicles. 

B. If the vesicles are leaky, no H* gradient can form and 
thus ATP synthase cannot work. 

C. Using components from widely divergent organisms 
can be a very powerful experimental tool. Because 
the two proteins come from such different sources, 
it is very unlikely that they form a direct functional 
interaction. The experiment therefore strongly suggests 
that electron transport and ATP synthesis are separate 
events. This approach is therefore a valid one. 


ANSWER 14-19 The redox potential of FADH2 is too low to 
transfer electrons to the NADH dehydrogenase complex, 
but high enough to transfer electrons to ubiquinone (Figure 
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14-22). Therefore, electrons from FADH2 can enter the 
electron-transport chain only at this step (Figure A14-19). 
Because the NADH dehydrogenase complex is bypassed, 
fewer H* ions are pumped across the membrane and less 
ATP is made. This example shows the versatility of the 
electron-transport chain. The ability to use vastly different 
sources of electrons from the environment to feed electron 
transport is thought to have been an essential feature in the 
early evolution of life. 


ANSWER 14-20 If these bacteria used a proton gradient 
to make their ATP in a fashion analogous to that in other 
bacteria (that is, fewer protons inside than outside), they 
would need to raise their cytoplasmic pH even higher than 
that of their environment (pH 10). Cells with a cytoplasmic 
pH greater than 10 would not be viable. These bacteria 
must therefore use gradients of ions other than Ht, such 
as Na* gradients, in the chemiosmotic coupling between 
electron transport and an ATP synthase. 


ANSWER 14-21 Statements A and B are accurate. 
Statement C is incorrect, because the chemical reactions 
that are carried out in each cycle are completely different, 
even though the net effect is the same as that expected for 
simple reversal. 


ANSWER 14-22 This experiment would suggest a two-step 
model for ATP synthase function. According to this model, 
the flow of protons through the base of the synthase drives 
rotation of the head, which in turn causes ATP synthesis. In 
their experiment, the authors have succeeded in uncoupling 
these two steps. If rotating the head mechanically is 
sufficient to produce ATP in the absence of any applied 
proton gradient, the ATP synthase is a protein machine 
that indeed functions like a “molecular turbine.” This would 
be a very exciting experiment indeed, because it would 
directly demonstrate the relationship between mechanical 
movement and enzymatic activity. There is no doubt that it 
should be published and that it would become a “classic.” 


ANSWER 14-23 Only under condition (E) is electron transfer 
observed, with cytochrome c becoming reduced. A portion 
of the electron-transport chain has been reconstituted in 
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this mixture, so that electrons can flow in the energetically 
favored direction from reduced ubiquinone to the 
cytochrome c reductase complex to cytochrome c. Although 
energetically favorable, the transfer in (A) cannot occur 
spontaneously in the absence of the cytochrome c reductase 
complex to catalyze this reaction. No electron flow occurs in 
the other experiments, whether the cytochrome c reductase 
complex is present or not: in experiments (B) and (F), both 
ubiquinone and cytochrome c are oxidized; in experiments 
(C) and (G), both are reduced; and in experiments (D) and 
(H), electron flow is energetically disfavored because an 
electron in reduced cytochrome c has a lower free energy 
than an electron added to oxidized ubiquinone. 


Chapter 15 


ANSWER 15-1 Although the nuclear envelope forms 

one continuous membrane, it has specialized regions 

that contain special proteins and have a characteristic 
appearance. One such specialized region is the inner nuclear 
membrane. Membrane proteins can indeed diffuse between 
the inner and outer nuclear membranes, at the connections 
formed around the nuclear pores. Those proteins with 
particular functions in the inner membrane, however, are 
usually anchored there by their interaction with other 
components such as chromosomes and the nuclear lamina (a 
protein meshwork underlying the inner nuclear membrane 
that helps give structural integrity to the nuclear envelope). 


ANSWER 15-2 Eukaryotic gene expression is more 
complicated than prokaryotic gene expression. In particular, 
prokaryotic cells do not have introns that interrupt the 
coding sequences of their genes, so that an mRNA can 

be translated immediately after it is transcribed, without 

a need for further processing (discussed in Chapter 7). In 
fact, in prokaryotic cells, ribosomes start translating most 
mRNAs before transcription is finished. This would have 
disastrous consequences in eukaryotic cells, because most 
RNA transcripts have to be spliced before they can be 
translated. The nuclear envelope separates the transcription 
and translation processes in space and time: a primary RNA 
transcript is held in the nucleus until it is properly processed 
to form an mRNA, and only then is it allowed to leave the 
nucleus so that ribosomes can translate it. 


ANSWER 15-3 An mRNA molecule is attached to the ER 
membrane by the ribosomes translating it. This ribosome 
population, however, is not static; the mRNA is continuously 
moved through the ribosome. Those ribosomes that 

have finished translation dissociate from the 3' end of the 
mRNA and from the ER membrane, but the mRNA itself 
remains bound by other ribosomes, newly recruited from 
the cytosolic pool, that have attached to the 5’ end of the 
mRNA and are still translating the mRNA. Depending on its 
length, there are about 10-20 ribosomes attached to each 
membrane-bound mRNA molecule. 


ANSWER 15-4 

A. The internal signal sequence functions as a membrane 
anchor, as shown in Figure 15-17. Because there is 
no stop-transfer sequence, however, the C-terminal 
end of the protein continues to be translocated into 
the ER lumen. The resulting protein therefore has its 
N-terminal domain in the cytosol, followed by a single 
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transmembrane segment, and a C-terminal domain in 
the ER lumen (Figure A15-4A). 

B. The N-terminal signal sequence initiates translocation of 
the N-terminal domain of the protein until translocation 
is stopped by the stop-transfer sequence. A cytosolic 
domain is synthesized until the start-transfer sequence 
initiates translocation again. The situation now resembles 
that described in (A), and the C-terminal domain of 
the protein is translocated into the lumen of the ER. 
The resulting protein therefore spans the membrane 
twice. Both its N-terminal and C-terminal domains are 
in the ER lumen, and a loop domain between the two 
transmembrane regions is exposed in the cytosol 
(Figure A15-4B). 

C. It would need a cleaved signal sequence, followed by 
an internal stop-transfer sequence, followed by pairs of 
start- and stop-transfer sequences (Figure A15-4C). 

These examples demonstrate that complex protein 

topologies can be achieved by simple variations and 

combinations of the two basic mechanisms shown in Figures 

15-16 and 15-17. 


ANSWER 15-5 

A. Clathrin coats cannot assemble in the absence of 
adaptins that link the clathrin to the membrane. At high 
clathrin concentrations and under the appropriate ionic 
conditions, clathrin cages assemble in solution, but 
they are empty shells, lacking other proteins, and they 
contain no membrane. This shows that the information 
to form clathrin baskets is contained in the clathrin 
molecules themselves, which are therefore able to self- 
assemble. 

B. Without clathrin, adaptins still bind to receptors in the 
membrane, but no clathrin coat can form and thus no 
clathrin-coated pits or vesicles are produced. 

C. Deeply invaginated clathrin-coated pits form on the 
membrane, but they do not pinch off to form closed 
vesicles (see Figure A15—21B). 


D. Prokaryotic cells do not perform endocytosis. A 
prokaryotic cell therefore does not contain any receptors 
with appropriate cytosolic tails that could mediate 
adaptin binding. Therefore, no clathrin can bind and no 
clathrin coats can assemble. 


ANSWER 15-6 The preassembled sugar chain allows 
better quality control. The assembled oligosaccharide 
chains can be checked for accuracy before they are 

added to the protein; if a mistake were made in adding 
sugars individually to the protein, the whole protein 

would have to be discarded. Because far more energy 

is used in building a protein than in building a short 
oligosaccharide chain, this is a much more economical 
strategy. The difficulty of modifying oligosaccharides 
precisely becomes apparent as the protein moves to the cell 
surface: although sugar chains are continually modified by 
enzymes in various compartments of the secretory pathway, 
these modifications are often incomplete and result in 
considerable heterogeneity of the glycoproteins that leave 
the cell. This heterogeneity is largely due to the restricted 
access that the enzymes have to the sugar trees attached 

to the surface of proteins. The heterogeneity also explains 
why glycoproteins are more difficult to study and purify than 
nonglycosylated proteins. 


ANSWER 15-7 Aggregates of the secretory proteins would 
form in the ER, just as they do in the trans Golgi network. 
As the aggregation is specific for secretory proteins, ER 
proteins would be excluded from the aggregates. The 
aggregates would eventually be degraded. 


ANSWER 15-8 Transferrin without Fe bound does not 
interact with its receptor and circulates in the bloodstream 
until it catches an Fe ion. Once iron is bound, the iron- 
transferrin complex can bind to the transferrin receptor on 
the surface of a cell and be endocytosed. Under the acidic 
conditions of the endosome, the transferrin releases its 
iron, but the transferrin remains bound to the transferrin 
receptor, which is recycled back to the cell surface, where 
it encounters the neutral pH environment of the blood. The 
neutral pH causes the receptor to release the transferrin 
into the circulation, where it can pick up another Fe ion to 
repeat the cycle. The iron released in the endosome, like the 
LDL in Figure 15-33, moves on to lysosomes, from where it 
is transported into the cytosol. 

The system allows cells to take up iron efficiently even 
though the concentration of iron in the blood is extremely 
low. The iron bound to transferrin is concentrated at the 
cell surface by binding to transferrin receptors; it becomes 
further concentrated in clathrin-coated pits, which collect 
the transferrin receptors. In this way, transferrin cycles 
between the blood and endosomes, delivering the iron that 
cells need to grow. 


ANSWER 15-9 

A. True. 

B. False. The signal sequences that direct proteins to 
the ER contain a core of eight or more hydrophobic 
amino acids. The sequence shown here contains many 
hydrophilic amino acid side chains, including the charged 
amino acids His, Arg, Asp, and Lys, and the uncharged 
hydrophilic amino acids Gln and Ser. 

C. True. Otherwise they could not dock at the correct 
target membrane or recruit a fusion complex to a 
docking site. 


Answers 


. True. 

. True. Lysosomal proteins are selected in the trans Golgi 
network and packaged into transport vesicles that 
deliver them to the late endosome. If not selected, they 
would enter by default into transport vesicles that move 
constitutively to the cell surface. 

F. False. Lysosomes also digest internal organelles by 

autophagy. 

G. False. Mitochondria do not participate in vesicular 

transport, and therefore N-linked glycoproteins, 
which are exclusively assembled in the ER, cannot be 
transported to mitochondria. 

H. False. The outer nuclear membrane is continuous with 

the ER and all proteins made by ribosomes bound there 

end up in the ER lumen. 
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ANSWER 15-10 They must contain a nuclear localization 
signal as well. Proteins with nuclear export signals shuttle 
between the nucleus and the cytosol. An example is the A1 
protein, which binds to mRNAs in the nucleus and guides 
them through the nuclear pores. Once in the cytosol, a 
nuclear localization signal ensures that the A1 protein is re- 
imported so that it can participate in the export of further 
mRNAs. 


ANSWER 15-11 Influenza virus enters cells by endocytosis 
and is delivered to endosomes, where it encounters 

an acidic pH that activates its fusion protein. The viral 
membrane then fuses with the membrane of the endosome, 
releasing the viral genome into the cytosol (Figure A15-11). 
NH3 is a small molecule that readily penetrates membranes. 
Thus, it can enter all intracellular compartments, including 
endosomes, by diffusion. Once in a compartment that 

has an acidic pH, NH3 binds H* to form NH4q", which is a 
charged ion and therefore cannot cross the membrane 

by diffusion. NH4* ions therefore accumulate in acidic 
compartments, raising their pH. When the pH of the 
endosome is raised, viruses are still endocytosed, but 
because the viral fusion protein cannot be activated, the 
virus cannot enter the cytosol. Remember this the next time 
you have the flu and have access to a stable. 
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ANSWER 15-12 

A. The problem is that vesicles having two different kinds 
of v-SNAREs in their membrane could dock on either of 
two different membranes. 

B. The answer to this puzzle is currently not known, but 
we can predict that cells must have ways of turning 
the docking ability of SNAREs on and off. This may be 
achieved through other proteins that are, for example, 
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co-packaged in the ER with SNAREs into transport 
vesicles and facilitate the interactions of the correct 
v-SNARE with the t-SNARE in the cis Golgi network. 


ANSWER 15-13 Synaptic transmission involves the release 
of neurotransmitters by exocytosis. During this event, the 
membrane of the synaptic vesicle fuses with the plasma 
membrane of the nerve terminals. To make new synaptic 
vesicles, membrane must be retrieved from the plasma 
membrane by endocytosis. This endocytosis step is blocked 
if dynamin is defective, as the protein is required to pinch 
off the clathrin-coated endocytic vesicles. 


ANSWER 15-14 The first two sentences are correct. The 
third is not. It should read: “Because the contents of the 
lumen of the ER, or any other compartment in the secretory 
or endocytic pathways, never mix with the cytosol, proteins 
that enter these pathways will never need to be imported 
again.” 


ANSWER 15-15 The protein is translocated into the ER. 

Its ER signal sequence is recognized as soon as it emerges 
from the ribosome. The ribosome then becomes bound 

to the ER membrane, and the growing polypeptide chain 

is transferred through the ER translocation channel. The 
nuclear localization sequence is therefore never exposed to 
the cytosol. It will never encounter nuclear import receptors, 
and the protein will not enter the nucleus. 


ANSWER 15-16 (1) Proteins are imported into the 

nucleus after they have been synthesized, folded, and, 

if appropriate, assembled into complexes. In contrast, 
unfolded polypeptide chains are translocated into the ER 
as they are being made by the ribosomes. Ribosomes are 
assembled in the nucleus yet function in the cytosol, and 
the enzyme complexes that catalyze RNA transcription 

and splicing are assembled in the cytosol yet function 

in the nucleus. Thus, both ribosomes and these enzyme 
complexes need to be transported through the nuclear 
pores intact. (2) Nuclear pores are gates, which are always 
open to small molecules; in contrast, translocation channels 
in the ER membrane are normally closed, and open only 
after the ribosome has attached to the membrane and the 
translocating polypeptide chain has sealed the channel from 
the cytosol. It is important that the ER membrane remain 
impermeable to small molecules during the translocation 
process, as the ER is a major store for Ca?2* in the cell, and 
Ca?* release into the cytosol must be tightly controlled 
(discussed in Chapter 16). (3) Nuclear localization signals 
are not cleaved off after protein import into the nucleus; in 
contrast, ER signal peptides are usually cleaved off. Nuclear 
localization signals are needed to repeatedly re-import 
nuclear proteins after they have been released into the 
cytosol during mitosis, when the nuclear envelope breaks 
down. 


ANSWER 15-17 The transient intermixing of nuclear 

and cytosolic contents during mitosis supports the idea 
that the nuclear interior and the cytosol are indeed 
evolutionarily related. In fact, one can consider the nucleus 
as a subcompartment of the cytosol that has become 
surrounded by the nuclear envelope, with access only 
through the nuclear pores. 


ANSWER 15-18 The actual explanation is that the single 
amino acid change causes the protein to misfold slightly 


so that, although it is still active as a protease inhibitor, it 

is prevented by chaperone proteins in the ER from exiting 
this organelle. It therefore accumulates in the ER lumen 
and is eventually degraded. Alternative interpretations 
might have been that (1) the mutation affects the stability 
of the protein in the bloodstream so that it is degraded 
much faster in the blood than the normal protein, or (2) the 
mutation inactivates the ER signal sequence and prevents 
the protein from entering the ER. (3) Another explanation 
could have been that the mutation altered the sequence to 
create an ER retention signal, which would have retained the 
mutant protein in the ER. One could distinguish between 
these possibilities by using fluorescently tagged antibodies 
against the protein or by expressing the protein as a fusion 
with GFP to follow its transport in the cells (see How We 
Know, pp. 520-521). 


ANSWER 15-19 Critique: “Dr. Outonalimb proposes to 
study the biosynthesis of forgettin, a protein of significant 
interest. The main hypothesis on which this proposal is 
based, however, requires further support. In particular, it is 
questionable whether forgettin is indeed a secreted protein, 
as proposed. ER signal sequences are normally found at 

the N-terminus. C-terminal hydrophobic sequences will be 
exposed outside the ribosome only after protein synthesis 
has already terminated and can therefore not be recognized 
by an SRP during translation. It is therefore unlikely that 
forgettin will be translocated by an SRP-dependent 
mechanism; it is more likely that it will remain in the cytosol. 
Dr. Outonalimb should take these considerations into 
account when submitting a revised application.” 


ANSWER 15-20 The Golgi apparatus may have evolved 
from specialized patches of ER membrane. These regions of 
the ER might have pinched off, forming a new compartment 
(Figure A15-20), which still communicates with the 

ER by vesicular transport. For the newly evolved Golgi 
compartment to be useful, transport vesicles would also 
have to have evolved. 


Figure A15-20 


ANSWER 15-21 This is a chicken-and-egg question. In fact, 
the situation never arises in present-day cells, although 
it must have posed a considerable problem for the first 


cells that evolved. New cell membranes are made by 
expansion of existing membranes, and the ER is never 
made de novo. There will always be an existing piece of ER 
with translocation channels to integrate new translocation 
channels. Inheritance is therefore not limited to the 
propagation of the genome; a cell’s organelles must also 
be passed from generation to generation. In fact, the ER 
translocation channels can be traced back to structurally 
related translocation channels in the prokaryotic plasma 
membrane. 


ANSWER 15-22 
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ANSWER 15-23 A single, incomplete round of nuclear 
import would occur. Because nuclear transport is fueled by 
GTP hydrolysis, under conditions of insufficient energy, GTP 
would be used up and no Ran-GTP would be available to 
unload the cargo protein from its nuclear import receptor 
upon arrival in the nucleus (see Figure 15-10). Unable to 
release its cargo, the nuclear import receptor would be 
stuck at the nuclear pore and not return to the cytosol. 
Because the nuclear cargo protein is not released, it would 
not be functional, and no further import could occur. 


anmMmooOm> 


Chapter 16 


ANSWER 16-1 Most paracrine signaling molecules are very 
short-lived after they are released from a signaling cell: they 
are either degraded by extracellular enzymes or are rapidly 
taken up by neighboring target cells. In addition, some 
become attached to the extracellular matrix and are thus 
prevented from diffusing too far. 


ANSWER 16-2 The protein could be an enzyme that 
produces a large number of small intracellular signaling 
molecules such as cyclic AMP or cyclic GMP. Or, it could 
be an enzyme that modifies a large number of intracellular 
target proteins—for example, by phosphorylation. 


ANSWER 16-3 The mutant G protein would be almost 
continuously activated, because GDP would dissociate 
spontaneously, allowing GTP to bind even in the absence 
of an activated GPCR. The consequences for the cell would 
therefore be similar to those caused by cholera toxin, which 
modifies the a subunit of G; so that it cannot hydrolyze 
GTP to shut itself off. In contrast to the cholera toxin case, 
however, the mutant G protein would not stay permanently 
activated: it would switch itself off normally, but then 

it would instantly become activated again as the GDP 
dissociated and GTP re-bound. 


ANSWER 16-4 Rapid breakdown keeps the intracellular 
cyclic AMP concentrations low. The lower the cAMP levels 
are, the larger and faster the increase achieved upon 
activation of adenylyl cyclase, which makes new cyclic AMP. 
If you have $100 in the bank and you deposit another $100, 
you have doubled your wealth; if you have only $10 to start 
with and you deposit $100, you have increased your wealth 


Answers A:41 


tenfold, a much larger proportional increase resulting from 
the same deposit. 


ANSWER 16-5 Recall that the plasma membrane 
constitutes a rather small area compared with the total 
membrane surfaces in a cell (discussed in Chapter 15). The 
endoplasmic reticulum is especially abundant and spans the 
entire volume of the cell as a vast network of membrane 
tubes and sheets. The Ca** stored in the endoplasmic 
reticulum can therefore be released throughout the cytosol. 
This is important because the rapid clearing of Ca?* ions 
from the cytosol by Ca?* pumps prevents Ca** from 
diffusing any significant distance in the cytosol. 


ANSWER 16-6 Each reaction involved in the amplification 
scheme must be turned off to reset the signaling pathway 
to a resting level. Each of these off switches is equally 
important. 


ANSWER 16-7 Because each antibody has two antigen- 
binding sites, it can cross-link the receptors and cause them 
to cluster on the cell surface. This clustering is likely to 
activate RTKs, which are usually activated by dimerization. 
For RTKs, clustering allows the individual kinase domains 

of the receptors to phosphorylate adjacent receptors in 

the cluster. The activation of GPCRs is more complicated, 
because the ligand has to induce a particular conformational 
change; only very special antibodies mimic receptor ligands 
sufficiently well to induce the conformational change that 
activates a GPCR. 


ANSWER 16-8 

A. True. Acetylcholine, for example, slows the beating of 
heart muscle cells by binding to a GPCR, and stimulates 
the contraction of skeletal muscle cells by binding to a 
different acetylcholine receptor, which is an ion-channel- 
coupled receptor. 

B. False. Acetylcholine is short-lived and exerts its effects 

locally. Indeed, the consequences of prolonging its 

lifetime can be disastrous. Compounds that inhibit the 
enzyme acetylcholinesterase, which normally breaks 
down acetylcholine at a nerve—muscle synapse, are 
extremely toxic: for example, the nerve gas sarin, used 
in chemical warfare, is an acetylcholinesterase inhibitor. 

True. Nucleotide-free By complexes can activate ion 

channels, and GTP-bound a subunits can activate 

enzymes. The GDP-bound form of trimeric G proteins is 
the inactive state. 

D. True. The inositol phospholipid that is cleaved to 
produce IP3 contains three phosphate groups, one of 
which links the sugar to the diacylglycerol lipid. IP3 is 
generated by a simple hydrolysis reaction (see Figure 
16-23). 

E. False. Calmodulin senses but does not regulate 
intracellular Ca?* levels. 

F. True. See Figure 16-35. 

G. True. See Figure 16-29. 


ANSWER 16-9 

1. You would expect a high background level of Ras 
activity, because Ras cannot be turned off efficiently. 

2. Because many Ras molecules are already GTP-bound, 
Ras activity in response to an extracellular signal would 
be greater than normal, but this activity would be liable 
to saturate when all Ras molecules are converted to the 
GTP-bound form. 
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3. The response to a signal would be much less rapid, 
because the signal-dependent increase in GTP-bound 
Ras would occur over an elevated background of 
preexisting GTP-bound Ras. 

4. The increase in Ras activity in response to a signal would 
also be prolonged compared to the response in normal 
cells. 


ANSWER 16-10 


A. Both types of signaling can occur over a long range: 
neurons can send action potentials along very long 
axons (think of the axons in the neck of a giraffe, for 
example), and hormones are carried via the bloodstream 
throughout the organism. Because neurons secrete large 
amounts of neurotransmitters at a synapse, a small, well- 
defined space between two cells, the concentrations 
of these signal molecules are high; neurotransmitter 
receptors, therefore, need to bind to neurotransmitters 
with only low affinity. Hormones, in contrast, are vastly 
diluted in the bloodstream, where they circulate at often 
minuscule concentrations; hormone receptors therefore 
generally bind their hormone with extremely high 
affinity. 

B. Whereas neuronal signaling is a private affair, with one 
neuron talking to a select group of target cells through 
specific synaptic connections, endocrine signaling 
is a public announcement, with any target cell with 
appropriate receptors able to respond to the hormone in 
the blood. Neuronal signaling is very fast, limited only by 
the speed of propagation of the action potential and the 
workings of the synapse, whereas endocrine signaling is 
slower, limited by blood flow and diffusion over larger 
distances. 


ANSWER 16-11 

A. There are 100,000 molecules of X and 10,000 molecules 
of Y in the cell (= rate of synthesis x average lifetime). 

B. After one second, the concentration of X will 
have increased by 10,000 molecules per cell. The 
concentration of X, therefore, one second after its 
synthesis is increased, is about 110,000 molecules per 
cell—which is a 10% increase over the concentration of 
X before the boost of its synthesis. The concentration 
of Y will also increase by 10,000 molecules per cell, 
which, in contrast to X, represents a full twofold increase 
in its concentration (for simplicity, we can neglect the 
breakdown in this estimation because X and Y are 
relatively stable during the one-second stimulation). 

C. Because of its larger proportional increase, Y is the 
preferred signaling molecule. This calculation illustrates 
the surprising but important principle that the time it 
takes to switch a signal on is determined by the lifetime 
of the signaling molecule. 


ANSWER 16-12 

A. The mutant RTK lacking its extracellular ligand-binding 
domain is inactive. It cannot bind extracellular signals, 
and its presence has no consequences for the function 
of the normal RTK (Figure A16-1 2A). If the mutant 
receptors are present at extremely high levels, however, 
they might dimerize in the absence of the extracellular 
signal molecule, causing activation of signaling. 

B. The mutant RTK lacking its intracellular domain is also 
inactive, but its presence will block signaling by the 
normal receptors. When a signal molecule binds to 
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either receptor, it will induce their dimerization. Two 
normal receptors have to come together to activate 
each other by phosphorylation. In the presence of an 
excess of mutant receptors, however, normal receptors 
will usually form mixed dimers, in which their intracellular 
domain cannot be activated because their partner is a 
mutant and lacks a kinase domain (Figure A16-12B). 


ANSWER 16-13 The statement is largely correct. Upon 
ligand binding, transmembrane helices of multispanning 
receptors, like the GPCRs, shift and rearrange with respect 
to one another (Figure A16-13A). This conformational 
change is sensed on the cytosolic side of the membrane 
because of a change in the arrangement of the cytoplasmic 
loops. A single transmembrane segment is not sufficient 
to transmit a signal across the membrane directly; no 
rearrangements in the membrane are possible upon ligand 
binding. Thus, upon ligand binding, single-span receptors 
such as most RTKs tend to dimerize, thereby bringing their 
intracellular kinase domains into proximity so that 

they can cross-phosphorylate and activate each other 
(Figure A16-13B). 
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ANSWER 16-14 Activation in both cases depends 

on proteins that catalyze GDP-GTP exchange on the 

G protein or Ras protein. Whereas activated GPCRs 
perform this function directly for G proteins, enzyme- 

linked receptors assemble multiple signaling proteins into 

a signaling complex when the receptors are activated by 
phosphorylation; one of these proteins is an adaptor protein 
that recruits a guanine nucleotide exchange factor that 
fulfills this function for Ras. 


ANSWER 16-15 Because the cytosolic concentration of 
Ca?" is so low, an influx of relatively few Ca?* ions leads 
to large changes in its cytosolic concentration. Thus, a 
tenfold increase in cytosolic Ca?* can be achieved by 
raising its concentration into the micromolar range, which 
would require far fewer ions than would be required to 
change significantly the cytosolic concentration of a more 
abundant ion such as Nat. In muscle, a greater than tenfold 
change in cytosolic Ca2* concentration can be achieved 
in microseconds by releasing Ca?* from the sarcoplasmic 
reticulum, a task that would be difficult to accomplish if 
changes in the millimolar range were required. 


ANSWER 16-16 In a multicellular organism such as an 
animal, it is important that cells survive only when and 
where they are needed. Having cells depend on signals 
from other cells may be a simple way of ensuring this. A 
misplaced cell, for example, would probably fail to get 

the survival signals it needs (as its neighbors would be 
inappropriate) and would therefore kill itself. This strategy 
can also help regulate cell numbers: if cell type A depends 
on a survival signal from cell type B, the number of B cells 
could control the number of A cells by making a limited 
amount of the survival signal, so that only a certain number 
of A cells could survive. There is indeed evidence that such a 
mechanism does operate to help regulate cell numbers—in 
both developing and adult tissues (see Figure 18-41). 


ANSWER 16-17 Ca**-activated Ca** channels create 

a positive feedback loop: the more Ca2* that is released, 
the more Ca?* channels that open. The Ca?* signal in the 
cytosol is therefore propagated explosively throughout the 
cardiac muscle cell, thereby ensuring that all myosin—actin 
filaments contract almost synchronously. 


ANSWER 16-18 K2 activates K1. If K1 is permanently 
activated, a response is observed regardless of the status of 
K2. If the order were reversed, K1 would need to activate 
K2, which cannot occur because in our example K2 contains 
an inactivating mutation. 


ANSWER 16-19 

A. Three examples of extended signaling pathways to the 
nucleus are: (1) extracellular signal — RTK — adaptor 
protein — Ras-activating protein > MAP kinase 
kinase kinase — MAP kinase kinase — MAP kinase > 
transcription regulator; (2) extracellular signal > GPCR 
— G protein — phospholipase C — IP3 > Cat — 
calmodulin — CaM-kinase — transcription regulator; 
(3) extracellular signal — GPCR — G protein — adenylyl 
cyclase — cyclic AMP — PKA > transcription regulator. 

B. An example of a direct signaling pathway to the nucleus 
is Delta Notch —> cleaved Notch tail — transcription. 


ANSWER 16-20 When PI 3-kinase is activated by 
an activated RTK, it phosphorylates a specific inositol 
phospholipid in the plasma membrane. The resulting 
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phosphorylated inositol phospholipid then recruits to the 
plasma membrane both Akt and another protein kinase that 
helps phosphorylate and activate Akt. A third kinase that 

is permanently associated with the membrane also helps 
activate Akt (see Figure 16-32). 


ANSWER 16-21 Polar groups are hydrophilic, so 
cholesterol, with only one polar -OH group, would be too 
hydrophobic to be an effective hormone by itself. Because 
it is virtually insoluble in water, it could not move readily as 
a messenger from one cell to another via the extracellular 
fluid, unless carried by specific proteins. 


ANSWER 16-22 In the case of the steroid-hormone 
receptor, a one-to-one complex of steroid and receptor 
binds to DNA to activate or inactivate gene transcription; 
there is thus no amplification between ligand binding 

and transcriptional regulation. Amplification occurs later, 
because transcription of a gene gives rise to many mRNAs, 
each of which is translated to give many copies of the 
protein it encodes (discussed in Chapter 7). For the ion- 
channel-coupled receptor, a single ion channel will let 
through thousands of ions in the time it remains open; this 
serves as the amplification step in this type of signaling 
system. 


ANSWER 16-23 The more steps there are in an 
intracellular signaling pathway, the more places the cell 

has to regulate the pathway, amplify the signal, integrate 
signals from different pathways, and spread the signal along 
divergent paths (see Figure 16-9). 


ANSWER 16-24 Animals and plants are thought to 

have evolved multicellularity independently, and therefore 
will be expected to have evolved some distinct signaling 
mechanisms for their cells to communicate with one another. 
On the other hand, animal and plant cells are thought to 
have evolved from a common eukaryotic ancestor cell, 

and so plants and animals would be expected to share 
some intracellular signaling mechanisms that the common 
ancestor cell used to respond to its environment. 


Chapter 17 


ANSWER 17-1 Cells that migrate rapidly from one place 
to another, such as amoebae (A) and sperm cells (F), do not 
in general need intermediate filaments in their cytoplasm, 
since they do not develop or sustain large tensile forces. 
Plant cells (G) are pushed and pulled by the forces of wind 
and water, but they resist these forces by means of their 
rigid cell walls rather than by their cytoskeleton. Epithelial 
cells (B), smooth muscle cells (C), and the long axons of 
nerve cells (E) are all rich in cytoplasmic intermediate 
filaments, which prevent them from rupturing as they are 
stretched and compressed by the movements of their 
surrounding tissues. All of the above eukaryotic cells 
possess intermediate filaments in their nuclear lamina. 
Bacteria, such as Escherichia coli (D), have none whatsoever. 


ANSWER 17-2 Two tubulin dimers have a lower affinity for 
each other (because of a more limited number of interaction 
sites) than a tubulin dimer has for the end of a microtubule 
(where there are multiple possible interaction sites, both 
end-to-end for tubulin dimers adding to a protofilament, 
and side-to-side for the tubulin dimers interacting with 
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tubulin subunits in adjacent protofilaments forming the 
ringlike cross section). Thus, to initiate a microtubule from 
scratch, enough tubulin dimers have to come together, and 
remain bound to one another for long enough, for other 
tubulin molecules to add to them. Only when a number 

of tubulin dimers have already assembled will the binding 
of the next subunit be favored. The formation of these 
initial “nucleating sites” is therefore rare and does not 
occur spontaneously at cellular concentrations of tubulin. 
Centrosomes contain preassembled rings of y-tubulin (in 
which the y-tubulin subunits are held together in much 
tighter side-to-side interactions than aB-tubulin can form) 
to which aB-tubulin dimers can bind. The binding conditions 
of af-tubulin dimers resemble those of adding to the end 
of an assembled microtubule. The y-tubulin rings in the 
centrosome can therefore be thought of as permanently 
preassembled nucleation sites. 


ANSWER 17-3 

A. The microtubule is shrinking because it has lost its 
GTP cap; that is the tubulin subunits at its end are all 
in their GDP-bound form. GTP-loaded tubulin subunits 
from solution will still add to this end, but they will be 
short-lived—either because they hydrolyze their GTP 
or because they fall off as the microtubule rim around 
them disassembles. If, however, sufficient GTP-loaded 
subunits are added quickly enough to cover up the GDP- 
containing tubulin subunits at the microtubule end, a 
new GTP cap can form and regrowth is favored. 

B. The rate of addition of GTP-tubulin will be greater at 
higher tubulin concentrations. The frequency with which 
shrinking microtubules switch to the growing mode will 
therefore increase with increasing tubulin concentration. 
The consequence of this regulation is that the system is 
self-balancing: the more microtubules shrink (resulting 
in a higher concentration of free tubulin), the more 
frequently microtubules will start to grow again. 
Conversely, the more microtubules grow, the lower the 
concentration of free tubulin will become and the rate of 
GTP-tubulin addition will slow down; at some point, GTP 
hydrolysis will catch up with new GTP-tubulin addition, 
the GTP cap will be destroyed, and the microtubule will 
switch to the shrinking mode. 

C. If only GDP were present, microtubules would continue 
to shrink and eventually disappear, because tubulin 
dimers with GDP have very low affinity for each other 
and will not add stably to microtubules. 

D. If GTP is present but cannot be hydrolyzed, microtubules 
will continue to grow until all free tubulin subunits have 
been used up. 


ANSWER 17-4 If all the dynein arms were equally active, 
there could be no significant relative motion of one 
microtubule to the other as required for bending. (Think of a 
circle of nine weightlifters, each trying to lift his neighbor off 
the ground: if they all succeeded, the group would levitate!). 
Thus, a few ciliary dynein molecules must be activated 
selectively on one side of the cilium. As they move their 
neighboring microtubules toward the tip of the cilium, the 
cilium bends away from the side containing the activated 
dyneins. 


ANSWER 17-5 Any actin-binding protein that stabilizes 
complexes of two or more actin monomers without blocking 


the ends required for filament growth will facilitate the 
initiation of a new filament (nucleation). 


ANSWER 17-6 Only fluorescent actin molecules assembled 
into filaments are visible, because unpolymerized actin 
molecules diffuse so rapidly that they produce a dim, 
uniform background. Since, in your experiment, so few 
actin molecules are labeled (1:10,000), there should be 

at most one labeled actin monomer per filament (see 
Figure 17-30). The lamellipodium as a whole has many 
actin filaments, some of which overlap, and it therefore 
shows a random, speckled pattern of actin molecules, each 
marking a different filament. This technique (called “speckle 
fluorescence”) can be used to follow the movement of 
polymerized actin in a migrating cell. If you watch this 
pattern with time, you will see that individual fluorescent 
spots move steadily back from the leading edge toward 

the interior of the cell, a movement that occurs whether 

or not the cell is actually migrating. Rearward movement 
takes place because actin monomers are added to filaments 
at the plus end and are lost from the minus end (where 
they are depolymerized) (see Figure 17—35B). In effect, 
actin monomers “move through” the actin filaments, a 
phenomenon termed “treadmilling.” Treadmilling has been 
demonstrated to occur in isolated actin filaments in solution 
and also in dynamic microtubules, such as those within a 
mitotic spindle. 


ANSWER 17-7 Cells contain actin-binding proteins that 
bundle and cross-link actin filaments (see Figure 17—32). The 
filaments extending the lamellipodia and filopodia are firmly 
anchored in the filamentous meshwork of the cell cortex, 
thus providing the mechanical anchorage required for the 
growing rodlike filaments to deform the cell membrane. 


ANSWER 17-8 Although the subunits are indeed held 
together by noncovalent bonds that are individually weak, 
there are a very large number of them, distributed among 
a very large number of filaments. As a result, the stress a 
human being exerts by lifting a heavy object is dispersed 
over so many subunits that their interaction strength is not 
exceeded. By analogy, a single thread of silk is not nearly 
strong enough to hold a human, but a rope woven of such 
fibers is. 


ANSWER 17-9 Both filaments are composed of subunits 
in the form of protein dimers that are held together by 
coiled-coil interactions. Moreover, in both cases, the dimers 
polymerize through their coiled-coil domains into filaments. 
Whereas intermediate filament dimers assemble head-to- 
head, however, and thereby create a filament that has no 
polarity, all myosin molecules in the same half of the myosin 
filament are oriented with their heads pointing in the same 
direction. This polarity is necessary for them to be able to 
develop a contractile force in muscle. 


ANSWER 17-10 

A. Successive actin molecules in an actin filament are 
identical in position and conformation. After a first 
protein (such as troponin) has bound to the actin 
filament, there would be no way in which a second 
protein could recognize every seventh monomer in a 
naked actin filament. Tropomyosin, however, binds along 
the length of an actin filament, spanning precisely seven 
monomers, and thus provides a molecular “ruler” that 
measures the length of seven actin monomers. Troponin 


becomes localized by binding to the evenly spaced ends 
of tropomyosin molecules. 

B. Calcium ions influence force generation in the actin- 
myosin system only if both troponin (to bind the calcium 
ions) and tropomyosin (to transmit the information to 
the actin filament that troponin has bound calcium) 
are present. (i) Troponin cannot bind to actin without 
tropomyosin. The actin filament would be permanently 
exposed to the myosin, and the system would be 
continuously active, independently of whether calcium 
ions were present or not (a muscle cell would therefore 
be continuously contracted with no possibility of 
regulation). (ii) Tropomyosin would bind to actin and 
block binding of myosin completely; the system would 
be permanently inactive, no matter whether calcium 
ions were present, because tropomyosin is not affected 
by calcium. (iii) The system will contract in response to 
calcium ions. 


ANSWER 17-11 

A. True. A continual outward movement of ER is required; 
in the absence of microtubules, the ER collapses toward 
the center of the cell. 

B. True. Actin is needed to make the contractile ring that 
causes the physical cleavage between the two daughter 
cells, whereas the mitotic spindle that partitions the 
chromosomes is composed of microtubules. 

C. True. Both extensions are associated with 
transmembrane proteins that protrude from the plasma 
membrane and enable the cell to form new anchor 
points on the substratum. 

D. False. To cause bending, ATP is hydrolyzed by the 
dynein motor proteins that are attached to the outer 
microtubules in the flagellum. 

E. False. Cells could not divide without rearranging 
their intermediate filaments, but many terminally 
differentiated and long-lived cells, such as nerve cells, 
have stable intermediate filaments that are not known to 
depolymerize. 

F. False. The rate of growth is independent of the size of 
the GTP cap. The plus and minus ends have different 
growth rates because they have physically distinct 
binding sites for the incoming tubulin subunits; the rate 
of addition of tubulin subunits differs at the two ends. 

G. True. Both are nice examples of how the same 
membrane can have regions that are highly specialized 
for a particular function. 

H. False. Myosin movement is activated by the 
phosphorylation of myosin, or by calcium binding to 
troponin. 


ANSWER 17-12 The average time taken for a small 
molecule (such as ATP) to diffuse a distance of 10 um is 
given by the calculation 


(10-3)2 / (2 x 5 x 10-76) = 0.1 seconds 


Similarly, a protein takes 1 second and a vesicle 10 seconds 
on average to travel 10 um. A vesicle would require on 
average 10? seconds, or more than 30 years, to diffuse to 
the end of a 10 cm axon. Motorized transport at 1 um/sec 
would require 10° seconds, or 28 hours. These calculations 
make it clear why kinesin and other motor proteins evolved 
to carry molecules and organelles along microtubules. 


ANSWER 17-13 (1) Animal cells are much larger and 
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more diversely shaped than bacteria, and they do not have 
a cell wall. Cytoskeletal elements are required to provide 
mechanical strength and shape in the absence of a cell wall. 
(2) Animal cells, and all other eukaryotic cells, have a nucleus 
that is shaped and held in place in the cell by intermediate 
filaments; the nuclear lamins attached to the inner nuclear 
membrane support and shape the nuclear membrane, and a 
meshwork of intermediate filaments surrounds the nucleus 
and spans the cytosol. (3) Animal cells can move by a 
process that requires a change in cell shape. Actin filaments 
and myosin motor proteins are required for these activities. 
(4) Animal cells have a much larger genome than bacteria; 
this genome is fragmented into many chromosomes. 

For cell division, chromosomes need to be accurately 
distributed to the daughter cells, requiring the function of 
the microtubules that form the mitotic spindle. (5) Animal 
cells have internal organelles. Their localization in the cell 

is dependent on motor proteins that move them along 
microtubules. A remarkable example is the long-distance 
travel of membrane-enclosed vesicles (organelles) along 
microtubules in an axon that can be up to 1 m long in the 
case of the nerve cells that extend from your spinal cord to 
your feet. 


ANSWER 17-14. The ends of an intermediate filament are 
indistinguishable from each other, because the filaments are 
built by the assembly of symmetrical tetramers made from 

two coiled-coil dimers. In contrast to microtubules and actin 
filaments, intermediate filaments therefore have no polarity. 


ANSWER 17-15 Intermediate filaments have no polarity; 
their ends are chemically indistinguishable. It would 
therefore be difficult to envision how a hypothetical motor 
protein that bound to the middle of the filament could 
sense a defined direction. Such a motor protein would be 
equally likely to attach to the filament facing one end or the 
other. 


ANSWER 17-16 Katanin breaks microtubules along 
their length, and at positions remote from their GTP caps. 
The fragments that form therefore contain GDP-tubulin 
at their exposed ends and rapidly depolymerize. Katanin 
thus provides a very quick means of destroying existing 
microtubules. 


ANSWER 17-17 Cell division depends on the ability of 
microtubules both to polymerize and to depolymerize. This 
is most obvious when one considers that the formation of 
the mitotic spindle requires the prior depolymerization of 
other microtubules to free up the tubulin required to build 
the spindle. This rearrangement is not possible in Taxol- 
treated cells, whereas in colchicine-treated cells, division is 
blocked because a spindle cannot be assembled. On a less 
obvious but no less important level, both drugs block the 
dynamic instability of microtubules and would therefore 
interfere with the workings of the mitotic spindle, even if 
one could be properly assembled. 


ANSWER 17-18 Motor proteins are unidirectional in 

their action; kinesin always moves toward the plus end 

of a microtubule and dynein toward the minus end. Thus 

if kinesin molecules are attached to glass, only those 
individual motors that have the correct orientation in 
relation to the microtubule that settles on them can attach 
to the microtubule and exert force on it to propel it forward. 
Since kinesin moves toward the plus end of the microtubule, 
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the microtubule will always crawl minus-end first over the 
cover slip. 


ANSWER 17-19 

A. Phase A corresponds to a lag phase, during which 
tubulin dimers assemble to form nucleation centers 
(Figure A17-19A). Nucleation is followed by a rapid 
rise (phase B) to a plateau value as tubulin dimers add 
to the ends of the elongating microtubules (Figure 
A17-19B). At phase C, equilibrium is reached, with some 
microtubules in the population growing while others are 
rapidly shrinking (Figure A17-19C). The concentration 
of free tubulin is constant at this point because 
polymerization and depolymerization are balanced 
(see also Question 17-3, p. 586). 

B. The addition of centrosomes introduces nucleation 
sites that eliminate the lag phase A, as shown by the 
red curve in Figure A17-19D. The rate of microtubule 
growth (i.e., the slope of the curve in the elongation 
phase B) and the equilibrium level of free tubulin remain 
unchanged, because the presence of centrosomes 
does not affect the rates of polymerization and 
depolymerization. 


ANSWER 17-20 The ends of the shrinking microtubule 
are visibly frayed, and the individual protofilaments appear 
to come apart and curl as the end depolymerizes. This 
micrograph therefore suggests that the GTP cap (which is 
lost from shrinking microtubules) holds the protofilaments 
properly aligned with each other, perhaps by strengthening 
the side-to-side interactions between af-tubulin subunits 
when they are in their GTP-bound form. 


ANSWER 17-21 Cytochalasin interferes with actin filament 
formation, and its effect on the cell demonstrates the 
importance of actin to cell locomotion. The experiment with 
colchicine shows that microtubules are required to give a 
cell a polarity that then determines which end becomes 

the leading edge (see Figure 17—15). In the absence of 
microtubules, cells still go through the motions normally 
associated with cell movement, such as the extension of 
lamellipodia, but in the absence of cell polarity these are 
futile exercises because they happen indiscriminately in all 
directions. Antibodies bind tightly to the antigen (in this 
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case vimentin) to which they were raised (see Panel 4-2, 

pp. 140-141). When bound, an antibody can interfere with 
the function of the antigen by preventing it from interacting 
properly with other cell components. The antibody injection 
experiment therefore suggests that intermediate filaments 
are not required for the maintenance of cell polarity or for 
the motile machinery. 


ANSWER 17-22 Either (B) or (C) would complete the 
sentence correctly. The direct result of the action potential 
in the plasma membrane is the release of Ca?* into the 
cytosol from the sarcoplasmic reticulum; muscle cells are 
triggered to contract by this rapid rise in cytosolic Ca?*. 
Calcium ions at high concentrations bind to troponin, which 
in turn causes tropomyosin to move to expose myosin- 
binding sites on the actin filaments. (A) and (D) would be 
wrong because Ca?* has no effect on the detachment 

of the myosin head from actin, which is the result of ATP 
hydrolysis. Nor does it have any role in maintaining the 
structure of the myosin filament. 


ANSWER 17-23 Only (D) is correct. Upon contraction, the 
Z discs move closer together, and neither actin nor myosin 
filaments contract (see Figures 17-41 and 17—42). 


Chapter 18 


ANSWER 18-1 Because all cells arise by division of another 
cell, this statement is correct, assuming that “first cell 
division” refers to the division of the successful founder cell 
from which all life as we know it has derived. There were 
probably many other unsuccessful attempts to start the 
chain of life. 


ANSWER 18-2 Cells in peak B contain twice as much DNA 
as those in peak A, indicating that they contain replicated 
DNA, whereas the cells in peak A contain unreplicated DNA. 
Peak A therefore contains cells that are in G41, and peak B 
contains cells that are in G2 and mitosis. Cells in S phase 
have begun but not finished DNA synthesis; they therefore 
have various intermediate amounts of DNA and are found 

in the region between the two peaks. Most cells are in G4, 
indicating that it is the longest phase of the cell cycle (see 
Figure 18-2). 
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ANSWER 18-3 For multicellular organisms, the control of 
cell division is extremely important. Individual cells must not 
proliferate unless it is to the benefit of the whole organism. 
The Go state offers protection from aberrant activation of 
cell division because the cell-cycle control system is largely 
dismantled. If, on the other hand, a cell just paused in G4, 
it would still contain all of the cell-cycle control system and 
could readily be induced to divide. The cell would also have 
to remake the “decision” not to divide almost continuously. 
To re-enter the cell cycle from Go, a cell has to resynthesize 
all of the components that have disappeared. 


ANSWER 18-4 The cell would replicate its damaged 
DNA and therefore would introduce mutations to the two 
daughter cells when the cell divides. Such mutations could 
increase the chances that the progeny of the affected 
daughter cells would eventually become cancer cells. 


ANSWER 18-5 Before injection, the frog oocytes must 
contain inactive M-Cdk. Upon injection of the M-phase 
cytoplasm, the small amount of the active M-Cdk in 

the injected cytoplasm activates the inactive M-Cdk by 
switching on the activating phosphatase (Cdc25), which 
removes the inhibitory phosphate groups from the inactive 
M-Cdk (see Figure 18-17). An extract of the second oocyte, 
now in M phase itself, will therefore contain as much active 
M-Cdk as the original cytoplasmic extract, and so on. 


ANSWER 18-6 The experiment shows that kinetochores 
are not preassigned to one or other spindle pole; 
microtubules attach to the kinetochores that they are able 
to reach. For the chromosome to remain attached to a 
microtubule, however, tension has to be exerted. Tension 
is normally achieved by the opposing pulling forces from 
opposite spindle poles. The requirement for such tension 
ensures that if two sister kinetochores ever become 
attached to the same spindle pole, so that tension is not 
generated, one or both of the connections would be lost, 
and microtubules from the opposing spindle pole would 
have another chance to attach properly. 


ANSWER 18-7 Recall from Figure 18-30 that the new 
nuclear envelope reassembles on the surface of the 
chromosomes. The close apposition of the envelope to 
the chromosomes prevents cytosolic proteins from being 
trapped between the chromosomes and the envelope. 
Nuclear proteins are then selectively imported through 
the nuclear pores, causing the nucleus to expand while 
maintaining its characteristic protein composition. 


ANSWER 18-8 The membranes of the Golgi vesicles fuse 
to form part of the plasma membranes of the two daughter 
cells. The interiors of the vesicles, which are filled with cell 
wall material, become the new cell wall matrix separating 
the two daughter cells. Proteins in the membranes of the 
Golgi vesicles thus become plasma membrane proteins. 
Those parts of the proteins that were exposed to the lumen 
of the Golgi vesicle will end up exposed to the new cell wall 
(Figure A18-8). 


ANSWER 18-9 In a eukaryotic organism, the genetic 
information that the organism needs to survive and 
reproduce is distributed between multiple chromosomes. It 
is therefore crucial that each daughter cell receives a copy 
of each chromosome when a cell divides; if a daughter cell 
receives too few or too many chromosomes, the effects 
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are usually deleterious or even lethal. Only two copies of 
each chromosome are produced by chromosome replication 
in mitosis. If the cell were to randomly distribute the 
chromosomes when it divided, it would be very unlikely 
that each daughter cell would receive precisely one copy 

of each chromosome. In contrast, the Golgi apparatus 
fragments into tiny vesicles that are all alike, and by random 
distribution it is very likely that each daughter cell will 
receive an approximately equal number of them. 


ANSWER 18-10 As apoptosis occurs on a large scale in 
both developing and adult tissues, it must not trigger 

alarm reactions that are normally associated with cell injury. 
Tissue injury, for example, leads to the release of signal 
molecules that stimulate the proliferation of surrounding 
cells so that the wound heals. It also causes the release of 
signals that can cause a destructive inflammatory reaction. 
Moreover, the release of intracellular contents could elicit an 
immune response against molecules that are normally not 
encountered by the immune system. Such reactions would 
be self-defeating if they occurred in response to the massive 
cell death that occurs in normal development. 


ANSWER 18-11 Because the cell population is increasing 
exponentially, doubling its weight at every cell division, the 
weight of the cell cluster after N cell divisions is 

2N x 10-9 g. Therefore, 70 kg (70 x 103 g) = 2N x 10°? g, 
or 2N = 7 x 1013, Taking the logarithm of both sides 

allows you to solve the equation for N. Therefore, 

N = In (7 x 1013) / In 2 = 46; that is it would take only 

46 days if cells proliferated exponentially. Cell division in 
animals is tightly controlled, however, and most cells in 

the human body stop dividing when they become highly 
specialized. The example demonstrates that exponential cell 
proliferation occurs only for very brief periods, even during 
embryonic development. 


ANSWER 18-12 The egg cells of many animals are big 

and contain stores of enough cell components to last for 
many cell divisions. The daughter cells that form during 
the first cell divisions after fertilization are progressively 
smaller in size and thus can be formed without a need for 
new protein or RNA synthesis. Whereas normally dividing 
cells would grow continuously in G1, G2, and S phases, until 
their size doubled, there is no cell growth in these early 
cleavage divisions, and both G and G2? are virtually absent. 
As G4 is usually longer than G2 and S phase, G41 is the most 
drastically reduced phase in these divisions. 
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Answers 


ANSWER 18-13 

A. Radiation leads to DNA damage, which activates a 
regulatory mechanism (mediated by p53 and p21; see 
Figure 18—15) that arrests the cell cycle until the DNA 
has been repaired. 

B. The cell will replicate damaged DNA and thereby 
introduce mutations in the daughter cells when the cell 
divides. 

C. The cell will be able to divide normally, but it will be 
prone to mutations, because some DNA damage always 
occurs as the result of natural irradiation caused, for 
example, by cosmic rays. The mechanism mediated 
by p53 is mainly required as a safeguard against the 
devastating effects of accumulating DNA damage; this 
mechanism is not required for the natural progression of 
the cell cycle in undamaged cells. 

D. Cell division in humans is an ongoing process that does 
not cease upon reaching maturity, and it is required 
for survival. Blood cells and epithelial cells in the skin 
or lining the gut, for example, are being constantly 
produced by cell division to meet the body’s needs; each 
day, your body produces about 1011 new red blood cells 
alone. 


ANSWER 18-14 

A. Only the cells that were in the S phase of their cell cycle 
(i.e., those cells making DNA) during the 30-minute 
labeling period contain any radioactive DNA. 

B. Initially, mitotic cells contain no radioactive DNA 
because these cells were not engaged in DNA synthesis 
during the labeling period. Indeed, it takes about two 
hours before the first labeled mitotic cells appear. 

C. The initial rise of the curve corresponds to cells that 
were just finishing DNA replication when the radioactive 
thymidine was added. The curve rises as more labeled 
cells enter mitosis; the peak corresponds to those cells 
that had just started S phase when the radioactive 
thymidine was added. The labeled cells then exit from 
mitosis, and are replaced by unlabeled mitotic cells, 
which were not yet in S phase during the labeling period. 
After 20 hours, the curve starts rising again, because the 
labeled cells enter their second round of mitosis. 


D. The initial two-hour lag before any labeled mitotic 
cells appear corresponds to the G2 phase, which is the 
time between the end of S phase and the beginning of 
mitosis. The first labeled cells seen in mitosis were those 
that were just finishing S phase (DNA synthesis) when 
the radioactive thymidine was added. 


ANSWER 18-15 Loss of M cyclin leads to inactivation of 
M-Cdk. As a result, the M-Cdk target proteins become 
dephosphorylated by phosphatases, and the cells exit from 
mitosis: they disassemble the mitotic spindle, reassemble 
the nuclear envelope, decondense their chromosomes, and 
so on. The M cyclin is degraded by ubiquitin-dependent 
destruction in proteasomes, and the activation of M-Cdk 
leads to the activation of APC/C, which ubiquitylates 

the cyclin, but with a substantial delay. As discussed in 
Chapter 7, ubiquitylation tags proteins for degradation in 
proteasomes. 


ANSWER 18-16 M cyclin accumulates gradually as it is 
steadily synthesized. As it accumulates, it will tend to form 
complexes with the mitotic Cdk molecules that are present. 
The Cdk in these complexes is inhibited by phosphorylation 
(see Figure 18-10). After a certain threshold level has been 
reached, M-Cdk is activated by the phosphatase Cdc25. 
Once activated, M-Cdk acts to enhance the activity of the 
activating phosphatase; this positive feedback leads to the 
complete activation of M-Cdk (see Figure 18-17). Thus, 

M cyclin accumulation acts like a slow-burning fuse, which 
eventually helps trigger the explosive self-activation of 
M-Cdk. The precipitous destruction of M cyclin terminates 
M-Cdk activity, and a new round of M cyclin accumulation 
begins. 


ANSWER 18-17 The order is F, C, B, A, D. Together, these 
five steps are referred to as mitosis (E). Cytokinesis is the 
last step in M phase, which overlaps with anaphase and 
telophase. Mitosis and cytokinesis are both part of M phase. 


ANSWER 18-18 If the growth rate of microtubules is the 
same in mitotic and in interphase cells, their length is 
proportional to their lifetime. Thus, the average length of 
microtubules in mitosis is 1 um (= 20 um x 15 s/300 s). 


ANSWER 18-19 As shown in Figure A18-19, the 
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overlapping interpolar microtubules from opposite poles 

of the spindle have their plus ends pointing in opposite 
directions. Plus-end directed motor proteins cross-link 
adjacent, antiparallel microtubules together and tend to 
move the microtubules in the direction that will push the 
two poles of the spindle apart, as shown in the figure. 
Minus-end directed motor proteins also cross-link adjacent, 
antiparallel microtubules together but move in the opposite 
direction, tending to pull the spindle poles together (not 
shown). 


ANSWER 18-20 The sister chromatid becomes committed 
when a microtubule from one of the spindle poles 
attaches to the kinetochore of the chromatid. Microtubule 
attachment is still reversible until a second microtubule 
from the other spindle pole attaches to the kinetochore 
of its partner sister chromatid, so that the duplicated 
chromosome is now put under mechanical tension by 
pulling forces from both poles. The tension ensures that 
both microtubules remain attached to the chromosome. 
The position of a chromatid in the cell at the time that the 
nuclear envelope breaks down will influence which spindle 
pole it will be pulled to, as its kinetochore is most likely 
to become attached to the spindle pole toward which it is 
facing. 


ANSWER 18-21 Itis still not certain what drives the 
poleward movement of chromosomes during anaphase. 

In principle, two possible models could explain it 

(Figure A18-21). In the model shown in (A), microtubule 
motor proteins associated with the kinetochore dash 
toward the minus end of the depolymerizing microtubule, 
dragging the chromosome toward the pole. Although this 
model is appealingly simple, there is little evidence that 
motor proteins are required for chromosome movement 
during anaphase. Instead, current experimental evidence 
greatly supports the model outlined in (B). In this model, 
chromosome movement is driven by kinetochore proteins 
that cling to the sides of the depolymerizing microtubule 
(see Figure 18-23). These proteins frequently detach from— 
and reattach to—the kinetochore microtubule. As tubulin 
subunits continue to dissociate, the kinetochore must slide 
poleward to maintain its grip on the retreating end of the 
shrinking microtubule. 
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ANSWER 18-22 Both sister chromatids could end up in the 
same daughter cell for any of a number of reasons. (1) If the 
microtubules or their connections with a kinetochore were 
to break during anaphase, both sister chromatids could be 
drawn to the same pole, and hence into the same daughter 
cell. (2) If microtubules from the same spindle pole attached 
to both kinetochores, the chromosome would be pulled to 
the same pole. (3) If the cohesins that link sister chromatids 
were not degraded, the pair of chromatids might be pulled 
to the same pole. (4) If a duplicated chromosome never 
engaged microtubules and was left out of the spindle, it 
would also end up in one daughter cell. 

Some of these errors in the mitotic process would be 
expected to activate a checkpoint mechanism that delays 
the onset of anaphase until all chromosomes are attached 
properly to both poles of the spindle. This “spindle 
assembly checkpoint” mechanism should allow most 
chromosome-attachment errors to be corrected, which is 
one reason why such errors are rare. The consequences of 
both sister chromatids ending up in one daughter cell are 
usually dire. One daughter cell would contain only one copy 
of all the genes carried on that chromosome and the other 
daughter cell would contain three copies. The altered gene 
dosage, leading to correspondingly changed amounts of 
the mRNAs and proteins produced, is often detrimental to 
the cell. In addition, there is the possibility that the single 
copy of the chromosome may contain a defective gene with 
a critical function, which would normally be taken care of by 
the good copy of the gene on the other chromosome that is 
now missing. 


ANSWER 18-23 

A. True. Centrosomes replicate during interphase, before M 
phase begins. 

B. True. Sister chromatids separate completely only at the 
start of anaphase. 

C. False. The ends of interpolar microtubules overlap and 
attach to one another via proteins (including motor 
proteins) that bridge between the microtubules. 

D. False. Microtubules and their motor proteins play no role 
in DNA replication. 

E. False. To be a correct statement, the terms 
“centromere” and “centrosome” must be switched. 
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ANSWER 18-24 Antibodies bind tightly to the antigen 

(in this case myosin) to which they were raised. When 
bound, an antibody can interfere with the function of the 
antigen by preventing it from interacting properly with 
other cell components. (A) The movement of chromosomes 
at anaphase depends on microtubules and their motor 
proteins and does not depend on actin or myosin. Injection 
of an anti-myosin antibody into a cell will therefore have no 
effect on chromosome movement during anaphase. 

(B) Cytokinesis, on the other hand, depends on the 
assembly and contraction of a ring of actin and myosin 
filaments, which forms the cleavage furrow that splits 

the cell in two. Injection of an anti-myosin antibody will 
therefore block cytokinesis. 


ANSWER 18-25 The plasma membrane of the cell that died 
by necrosis in Figure 18—38A is ruptured; a clear break is 
visible, for example, at a position corresponding to the 

12 o'clock mark on a watch. The cell’s contents, mostly 
membranous and cytoskeletal debris, are seen spilling 

into the surroundings through these breaks. The cytosol 
stains lightly, because most soluble cell components were 
lost before the cell was fixed. In contrast, the cell that 
underwent apoptosis in Figure 18—38B is surrounded by 

an intact membrane, and its cytosol is densely stained, 
indicating a normal concentration of cell components. The 
cell's interior is remarkably different from a normal cell, 
however. Particularly characteristic are the large “blobs” 
that extrude from the nucleus, probably as the result of the 
breakdown of the nuclear lamina. The cytosol also contains 
many large, round, membrane-enclosed vesicles of unknown 
origin, which are not normally seen in healthy cells. The 
pictures visually confirm the notion that necrosis involves 
cell lysis, whereas cells undergoing apoptosis remain 
relatively intact until they are phagocytosed and digested 
by another cell. 


ANSWER 18-26 

A. False. There is no G4 to M phase transition. The 
statement is correct, however, for the G4 to S phase 
transition, in which cells commit themselves to a division 
cycle. 

B. True. Apoptosis is an active process carried out by 
special proteases (caspases). 

C. True. This mechanism is thought to adjust the number of 
neurons to the number of specific target cells to which 
the neurons connect. 

D. True. An amazing evolutionary conservation! 

E. True. Association of a Cdk protein with a cyclin is 
required for its activity (hence its name cyclin-dependent 
kinase). Furthermore, dephosphorylation at specific 
sites on the Cdk protein is required for the cyclin-Cdk 
complex to be active. 


ANSWER 18-27 Cells in an animal must behave for the 
good of the organism as a whole—to a much greater 
extent than people generally act for the good of society as 
a whole. In the context of an organism, unsocial behavior 
would lead to a loss of organization and possibly to 

cancer. Many of the rules that cells have to obey would be 
unacceptable in a human society. Most people, for example, 
would be reluctant to kill themselves for the good of 
society, yet our cells do it all the time. 


ANSWER 18-28 The most likely approach to success (if that 
is what the goal should be called) is plan C, which should 
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result in an increase in cell numbers. The problem is, of 

course, that cell numbers of each tissue must be increased 

similarly to maintain balanced proportions in the organism, 
yet different cells respond to different growth factors. As 
shown in Figure A18-28, however, the approach has indeed 
met with limited success. A mouse producing very large 
quantities of growth hormone (left)—which acts to stimulate 
the production of a secreted protein that acts as a survival 
factor, growth factor, or mitogen, depending on the cell 
type—grows to almost twice the size of a normal mouse 

(right). To achieve this twofold change in size, however, 

growth hormone was massively overproduced (about 

fiftyfold). And note that the mouse did not even attain the 
size of a rat, let alone a dog. 
The other two approaches have conceptual problems: 

A. Blocking all apoptosis would lead to defects in 
development, as rat development requires the selective 
death of many cells. It is unlikely that a viable animal 
would be obtained. 

B. Blocking p53 function would eliminate an important 
mechanism in the cell cycle that detects DNA damage 
and stops the cycle so that the cell can repair the 
damage; removing p53 would increase mutation rates 
and lead to cancer. Indeed, mice without p53 usually 
develop normally but die of cancer at a young age. 


ANSWER 18-29 The on-demand, limited release of PDGF 
at a wound site triggers cell division of neighboring cells 
for a limited amount of time, until the PDGF is degraded. 
This is different from the continuous release of PDGF from 
mutant cells, where PDGF is made in an uncontrolled way 
at high levels. Moreover, the mutant cells that make PDGF 
often express their own PDGF receptor inappropriately, 
so that they can stimulate their own proliferation, thereby 
promoting the development of cancer. 


ANSWER 18-30 All three types of mutant cells would be 

unable to divide. The cells: 

A. would enter mitosis but would not be able to exit 
mitosis. 

B. would arrest permanently in G4 because the cyclin-Cdk 
complexes that act in G; would be inactivated. 

C. would not be able to activate the transcription of 
genes required for cell division because the required 
transcription regulators would be constantly inhibited by 
unphosphorylated Rb. 


ANSWER 18-31 In alcoholism, liver cells proliferate because 
the organ is overburdened and becomes damaged by the 


large amounts of alcohol that have to be metabolized. This 
need for more liver cells activates the control mechanisms 
that normally regulate cell proliferation. Unless badly 
damaged and full of scar tissue, the liver will usually shrink 
back to a normal size after the patient stops drinking 
excessively. In liver cancer, in contrast, mutations abolish 
normal cell proliferation control and, as a result, cells divide 
and keep on dividing in an uncontrolled manner, which is 
usually fatal. 


Chapter 19 


ANSWER 19-1 After the first meiotic division, each nucleus 
has a diploid amount of DNA; however, that DNA effectively 
contains only a haploid set of chromosomes (albeit in two 
copies), representing only one or other homolog of each 
type of chromosome (although some mixing will have 
occurred during crossing-over). Because the maternal and 
paternal chromosomes of a pair will carry different versions 
of many of the genes, these daughter cells will not be 
genetically identical; each one will, however, have lost either 
the paternal or the maternal version of each chromosome. 
In contrast, somatic cells dividing by mitosis inherit a diploid 
set of chromosomes, and all daughter cells are genetically 
identical and inherit both maternal and paternal gene 
copies. The role of gametes produced by meiosis is to mix 
and reassort gene pools during sexual reproduction, and 
thus it is a definite advantage for each of them to have a 
slightly different genetic constitution. The role of somatic 
cells on the other hand is to build an organism that contains 
the same genes in all its cells and retains in each cell both 
maternal and paternal genetic information. 


ANSWER 19-2 A typical human female produces fewer 
than 1000 mature eggs in her lifetime (12 per year over 
about 40 years); this is less than one-tenth of a percent 
of the possible gametes, excluding the effects of meiotic 
crossing-over. A typical human male produces billions of 
sperm during a lifetime, so in principle, every possible 
chromosome combination is sampled many times. 


ANSWER 19-3 For simplicity, consider the situation 
where a father carries genes for two dominant traits, M 
and N, on one of his two copies of human Chromosome 1. 
If these two genes were located at opposite ends of this 
chromosome, and there was one and only one crossover 
event per chromosome as postulated in the question, half 
of his children would express trait M and the other half 
would express trait N—with no child resembling the father 
in carrying both traits. This is very different from the actual 
situation, where there are multiple crossover events per 
chromosome, causing the traits M and N to be inherited as 
if they were on separate chromosomes. By constructing a 
Punnett square like that in Figure 19—27, one can see that 
in this latter, more realistic case, we would actually expect 
one-fourth of the children of this father to inherit both traits, 
one-fourth to inherit trait M only, one-fourth to inherit trait 
N only, and one-fourth to inherit neither trait. 


ANSWER 19-4 Inbreeding tends to give rise to individuals 
who are homozygous for many genes. To see why, consider 
the extreme case where the consanguineous relationship 
takes the form of brother-sister inbreeding (as among 

the Pharaohs of ancient Egypt): because the parents are 
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closely related, there is a high probability that the maternal 
and paternal alleles inherited by the offspring will be the 
same. Inbreeding continued over many generations gives 
rise to individuals who are homozygous for almost every 
gene. Because of the randomness of the mechanism of 
inheritance, some deleterious alleles will become prevalent 
in the descendants. If the gene is important, individuals 
that inherit two defective copies will be unhealthy—often 
severely so. In another, separate inbred population, the 
same thing will happen, but chances are a different set of 
deleterious alleles will become prevalent. When individuals 
from the two separate inbred populations mate, their 
offspring will inherit deleterious alleles of genes A, B, and 
C, for example, from the mother, but functional alleles of 
those genes from the father; conversely, they will inherit 
deleterious alleles of genes D, E, and F from the father, but 
functional alleles of those genes from the mother. Because 
most deleterious mutations are recessive, the hybrid 
offspring—who are heterozygous for these genes—will thus 
escape the deleterious effects. 


ANSWER 19-5 Although any one of the three explanations 
could in principle account for the observed result, A and B 
can be ruled out as being implausible. 

A. There is no precedent for any instability in DNA so great 
as to be detectable in such a SNP analysis; in any case, 
the hypothesis would predict a steady decrease in the 
frequency of the SNP with age, not a drop in frequency 
that begins only at age 50. 

B. Human genes change only very slowly over time (unless 
a massive population migration brings an influx of 
individuals who are genetically different). People born 
50 years ago will be, on average, virtually the same 
genetically as the population being born today. 

C. This hypothesis is correct. A SNP with these properties 
has been used to discover a gene that appears to cause 
a substantial increase in the probability of death from 
cardiac abnormalities. 


ANSWER 19-6 Natural selection alone is not sufficient 

to eliminate recessive lethal genes from the population. 
Consider the following line of reasoning. Homozygous 
defective individuals can arise only as the offspring of a 
mating between two heterozygous individuals. By the rules 
of Mendelian genetics, offspring of such a mating will be 

in the ratio of 1 homozygous normal: 2 heterozygous: 1 
homozygous defective. Thus, statistically, heterozygous 
individuals should always be more numerous than the 
homozygous, defective individuals. And although natural 
selection effectively eliminates the defective genes in 
homozygous individuals through death, it cannot act to 
eliminate the defective genes in heterozygous individuals 
because they do not affect the phenotype. Natural selection 
will keep the frequency of the defective gene low in 

the population, but, in the absence of any other effect, 
there will always be a reservoir of defective genes in the 
heterozygous individuals. 

At low frequencies of the defective gene, another 
important factor—chance—comes into play. Chance 
variation can increase or decrease the frequency of 
heterozygous individuals (and thereby the frequency of 
the defective gene). By chance, the offspring of a mating 
between heterozygotes could all be normal, which would 
eliminate the defective gene from that lineage. Increases 
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in the frequency of a deleterious gene are opposed by 
natural selection; however, decreases are unopposed and 
can, by chance, lead to elimination of the defective gene 
from the population. On the other hand, new mutations 
are continually occurring, albeit at a low rate, creating 
fresh copies of the deleterious recessive allele. In a large 
population, a balance will be struck between the creation 
of new copies of the allele in this way, and their elimination 
through the death of homozygotes. 


ANSWER 19-7 

A. True. 

B. True. 

C. False. Mutations that occur during meiosis can be 
propagated, unless they give rise to nonviable gametes. 


ANSWER 19-8 In mitosis, two copies of the same 
chromosome can end up in the same daughter cell if 

one of the microtubule connections breaks before sister 
chromatids are separated. Alternatively, microtubules from 
the same spindle pole could attach to both kinetochores 
of the chromosome. As a consequence, one daughter 

cell would receive only one copy of all the genes carried 
on that chromosome, and the other daughter cell would 
receive three copies. The imbalance of the genes on this 
chromosome compared with the genes on all the other 
chromosomes would produce imbalanced levels of protein 
which, in most cases, is detrimental to the cell. If the 
mistake happens during meiosis, in the process of gamete 
formation, it will be propagated in all cells of the organism. 
A form of mental retardation called Down syndrome, 

for example, is due to the presence of three copies of 
Chromosome 21 in all of the nucleated cells in the body. 


ANSWER 19-9 Meiosis begins with DNA replication, 
producing a tetraploid cell containing four copies of each 
chromosome. These four copies have to be distributed 
equally during the two sequential meiotic divisions into four 
haploid cells. Sister chromatids remain paired so that (1) the 
cells resulting from the first division receive two complete 
sets of chromosomes and (2) the chromosomes can be 
evenly distributed again in the second meiotic division. If 
the sister chromatids did not remain paired, it would not 

be possible in the second division to distinguish which 
chromatids belong together, and it would therefore be 
difficult to ensure that precisely one copy of each chromatid 
is pulled into each daughter cell. Keeping two sister 
chromatids paired in the first meiotic division is therefore 
an easy way to keep track of which chromatids belong 
together. 

This biological principle suggests that you might 
consider clamping your socks together in matching pairs 
before putting them into the laundry. In this way, the 
cumbersome process of sorting them out afterward—and 
the seemingly inevitable mistakes that occur during that 
process—could be avoided. 


ANSWER 19-10 

A. A gene is a stretch of DNA that codes for a protein 
or functional RNA. An allele is an alternative form of 
a gene. Within the population, there are often several 
“normal” alleles, whose functions are indistinguishable. 
In addition, there may be many rare alleles that are 
defective to varying degrees. An individual, however, 
normally carries a maximum of two alleles of each gene. 


B. An individual is said to be homozygous if the two alleles 
of a gene are the same. An individual is said to be 
heterozygous if the two alleles of a gene are different. 
An individual can be heterozygous for gene A and 
homozygous for gene B. 

C. The genotype is the specific set of alleles present in 
the genome of an individual. In practice, for organisms 
studied in a laboratory, the genotype is usually specified 
as a list of the known differences between the individual 
and the wild type, which is the standard, naturally 
occurring type. The phenotype is a description of the 
visible characteristics of the individual. In practice, the 
phenotype is usually a list of the differences in visible 
characteristics between the individual and the wild type. 

D. An allele A is dominant (relative to a second allele a) if 
the presence of even a single copy of A is enough to 
affect the phenotype; that is, if heterozygotes (with 
genotype Aa) appear different from aa homozygotes. 
An allele a is recessive (relative to a second allele A) 
if the presence of a single copy makes no difference 
to the phenotype, so that Aa individuals look just like 
AA individuals. If the phenotype of the heterozygous 
individual differs from the phenotypes of individuals that 
are homozygous for either allele, the alleles are said to 
be co-dominant. 


ANSWER 19-11 

A. Since the pea plant is diploid, any true-breeding plant 
must carry two mutant copies of the same gene—both 
of which have lost their function. 

B. If each plant carries a mutation in a different gene, this 
will be revealed by complementation tests (see Panel 
19-1, p. 675). When plant A is crossed with plant B, all 
of the F4 plants will produce only round peas. And the 
same result will be obtained when plant B is crossed 
with plant C, or when plant A is crossed with plant C. In 
contrast, a cross between any two true-breeding plants 
that carry loss-of-function mutations in the same gene 
should produce only plants with wrinkled peas. This is 
true if the mutations themselves lie in different parts of 
the gene. 


ANSWER 19-12 

A. The mutation is likely to be dominant, because roughly 
half of the progeny born to an affected parent—in 
each of three marriages to hearing partners—are deaf, 
and it is unlikely that all these hearing partners were 
heterozygous carriers of the mutation. 

B. The mutation is not present on a sex chromosome. If it 
were, either only the female progeny should be affected 
(expected if the mutation arose in a gene on the 
grandfather's X chromosome), or only the male progeny 
should be affected (expected if the mutation arose in 
a gene on the grandfather's Y chromosome). In fact, 
the pedigree reveals that both males and females have 
inherited the mutant form of the gene. 

C. Suppose that the mutation was present on one of the 
two copies of the grandfather's Chromosome 12. Each 
of these copies of Chromosome 12 would be expected 
to carry a different pattern of SNPs, since one of them 
was inherited from his father and the other was inherited 
from his mother. Each of the copies of Chromosome 
12 that was passed to his grandchildren will have gone 
through two meioses—one meiosis per generation. 

Because two or three crossover events occur per 


chromosome during a meiosis, each chromosome 
inherited by a grandchild will have been subjected 

to about five crossovers since it left the grandfather, 
dividing it into six segments. An identical pattern 

of SNPs should surround whatever gene causes the 
deafness in each of the four affected grandchildren; 
moreover, this SNP pattern should be clearly different 
from that surrounding the same gene in each of the 
seven grandchildren who are normal. These SNPs 
would form an unusually long haplotype block—one 
that extends for about one-sixth of the length of 
Chromosome 12. (One-quarter of the DNA of each 
grandchild will have been inherited from the grandfather, 
in roughly 70 segments of this length scattered among 
the grandchild’s 46 chromosomes.) 


ANSWER 19-13 Individual 1 might be either heterozygous 
(+/-) or homozygous for the normal allele (+/+). Individual 2 
must be homozygous for the recessive deafness allele (-/-). 
(Both his parents must have been heterozygous because 
they produced a deaf son.) Individual 3 is almost certainly 
heterozygous (+/-) and responsible for transmitting the 
mutant allele to his children and grandchildren. Given 

that the mutant allele is rare, individual 4 is most probably 
homozygous for the normal allele (+/+). 


ANSWER 19-14 Your friend is wrong. 

A. Mendel’s laws, and the clear understanding that we now 
have concerning the mechanisms that produce them, 
rule out many false ideas concerning human heredity. 
One of them is that a first-born child has a different 
chance of inheriting particular traits from its parents than 
its siblings. 

B. The probability of this type of pedigree arising by 
chance is one-fourth for each generation, or one in 64 
for the three generations shown. 

C. Data from an enlarged sampling of family members, or 
from more generations, would quickly reveal that the 
regular pattern observed in this particular pedigree 
arose by chance. 

WHITE-EYED FLY 


White Ruby 
gene gene 
defective functional 
EE h—6rlrlti‘“< ‘<aiéCSES 
defective functional 


¥ | 


Ruby 
product 


Answers A:53 
D. If statistical tests showed that the pattern was not 
due to chance, it would suggest that some process of 
selection was involved: for example, parents who had 
had a first child that was affected might regularly opt 
for screening of subsequent pregnancies and selectively 
terminate those pregnancies in which the fetus was 
found to be affected. Fewer second children would then 
be born with the abnormality. 


ANSWER 19-15 Each carrier is a heterozygote, and 50% 

of his sperm or her eggs will carry the lethal allele. When 
two carriers marry, there is therefore a 25% chance that any 
baby will inherit the lethal allele from both parents and so 
will show the fatal phenotype. Because one person in 100 
is a carrier, one partnership in 10,000 (100 x 100) will be a 
partnership of carriers (assuming that people choose their 
partners at random). Other things being equal, one baby in 
40,000 will then be born with the defect, or 25 babies per 
year out of a total of a million babies born. 


ANSWER 19-16 A dominant-negative mutation gives rise 
to a mutant gene product that interferes with the function 
of the normal gene product, causing a loss-of-function 
phenotype even in the presence of a normal copy of the 
gene. For example, if a protein forms a hexamer, and 

the mutant protein can interact with the normal subunits 
and inhibit the function of the hexamer, the mutation will 
be dominant. This ability of a single defective allele to 
determine the phenotype is the reason why such an allele 
is dominant. A gain-of-function mutation increases the 
activity of the gene or makes it active in inappropriate 
circumstances. The change in activity often has a phenotypic 
consequence, which is why such mutations are usually 
dominant. 


ANSWER 19-17 

A. As outlined in Figure A19-17, if flies that are defective 
in different genes mate, their progeny will have one 
normal gene. In the case of a mating between a ruby- 
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eyed fly and a white-eyed fly, every progeny fly will 
inherit one functional copy of the White gene from one 
parent and one functional copy of the Ruby gene from 
the other parent. Note that the normal white allele 
produces brick-red eyes and the mutated form of the 
gene produces white eyes. Because each of the mutant 
alleles is recessive to the corresponding wild-type allele, 
the progeny will have the wild-type phenotype—brick- 
red eyes. 

B. Garnet, ruby, vermilion, and carnation complement one 
another and the various alleles of the White gene (that 
is, when these mutant flies are mated with each other, 
they produce flies with a normal eye color); thus each 
of these mutants defines a separate gene. In contrast, 
white, cherry, coral, apricot, and buff do not complement 
each other; thus, they must be alleles of the same gene, 
which has been named the White gene. Thus, these nine 
different eye-color mutants define five different genes. 

C. Different alleles of the same gene, like the five alleles 
of the White gene, often have different phenotypes. 
Different mutations compromise the function of the 
gene product to different extents, depending on the 
location of the mutation. Alleles that do not produce any 
functional product (null alleles), even if they result from 
different DNA sequence changes, do have the same 
phenotype. 


ANSWER 19-18 SNPs are single-nucleotide differences 
between individuals for which two or more variants are each 
found at high frequency in the population. In the human 
population, SNPs occur roughly once per 1000 nucleotides 
of sequence. Many have been identified and mapped in 
various organisms, including millions in the human genome. 
SNPs, which are detected by sequencing, serve as physical 
markers whose genomic locations are known. By tracking a 
mutant gene through different matings, and correlating the 
presence of the gene with the co-inheritance of particular 
SNP variants, one can narrow down the potential location of 
a gene to a chromosomal region that may contain only a few 
genes. These candidate genes can then be tested for the 
presence of a mutation that could account for the original 
mutant phenotype (see Figure 19-38). 


Chapter 20 


ANSWER 20-1 The horizontal orientation of the 
microtubules will be associated with a horizontal orientation 
of cellulose microfibrils deposited in the cell walls. The 
growth of the cells will therefore be in a vertical direction, 
expanding the distance between the cellulose microfibrils 
without stretching them (see Figure 20-6). In this way, the 
stem will rapidly elongate; in a typical natural environment, 
this will hasten emergence from darkness into light. 


ANSWER 20-2 As three collagen polypeptide chains have 
to come together to form the triple helix, a single defective 
polypeptide chain will impair assembly, even if normal 
chains are present at the same time. Collagen mutations are 
therefore dominant; that is, they have a deleterious effect 
even in the presence of a normal copy of the gene. 


ANSWER 20-3 The remarkable ability to swell and thus 
occupy a large volume of space depends on the negative 
charges. These attract a cloud of positive ions, chiefly Na‘, 


which by osmosis draw in large amounts of water, thus 
giving proteoglycans their unique properties. With fewer 
negative charges, proteoglycans will attract less water and 
occupy less space. By contrast, uncharged polysaccharides 
such as cellulose, starch, and glycogen (all composed 
entirely of glucose subunits) are easily compacted into fibers 
or granules. 


ANSWER 20-4 Focal contacts are common in connective 
tissue, where fibroblasts exert traction forces on the 
extracellular matrix, and in cell culture, where cell crawling 
is observed. The forces for pulling on the matrix or for 
crawling are generated by the actin cytoskeleton. In mature 
epithelia, focal contacts are presumably rare because the 
cells are largely fixed in place and have no need to crawl 
over the basal lamina or actively pull on it. 


ANSWER 20-5 Suppose a cell is damaged so that its 
plasma membrane becomes leaky. lons present in high 
concentration in the extracellular fluid, such as Na* and 
Ca2*, then rush into the cell, and valuable metabolites leak 
out. If the cell were to remain connected to its healthy 
neighbors by open gap junctions, these cells too would 
suffer from the damage. But the influx of Ca?* into the sick 
cell causes its gap junctions to close immediately, effectively 
isolating the cell and preventing damage from spreading in 
this way. 


ANSWER 20-6 lonizing (high-energy) radiation tears 
through matter, knocking electrons out of their orbits and 
breaking chemical bonds. In particular, it creates breaks and 
other damage in DNA, and thus causes cells to arrest in the 
cell cycle to allow time to repair the damaged DNA before 
proceeding to cell division (discussed in Chapter 18). If the 
damage is so severe that it cannot be repaired, cells usually 
kill themselves by undergoing apoptosis. 


ANSWER 20-7 Cells in the gut epithelium are exposed to 
a quite hostile environment, containing digestive enzymes 
and many other substances that vary drastically from day 
to day depending on the food intake of the organism. 
These epithelial cells form a first line of defense against 
potentially hazardous compounds and mutagens that 

we consume or are ubiquitous in our environment. Rapid 
turnover of epithelial cells protects the organism from 
harmful consequences, as wounded and sick epithelial cells 
are discarded (along with undamaged ones during the 
normal course of gut epithelium renewal). If an epithelial cell 
started to divide inappropriately as the result of a mutation, 
for example, it and its unwanted progeny would most often 
simply be discarded by natural disposal from the tip of the 
villus: even though such mutations must occur often, they 
rarely give rise to a cancer. 

A neuron, on the other hand, lives in a highly protected 
environment, largely insulated from the outside world. Its 
function depends on a complex system of connections 
with other neurons—a system that is created during 
development and is not easy to reconstruct if the neuron 
subsequently dies. 


ANSWER 20-8 Every cell division generates one additional 
cell; so if the cells were never lost or discarded from the 
body, the number of cells in the body should equal the 
number of divisions plus one. The number of divisions is 
1000-fold greater than the number of cells because, in the 
course of a lifetime, 1000 cells are discarded by mechanisms 
such as apoptosis for every cell that is retained in the body. 


ANSWER 20-9 

A. False. Gap junctions are not connected to the 
cytoskeleton; they form cell-cell channels that allow 
small molecules to pass from one cell to another. 

B. True. Upon wilting, the turgor pressure in the plant cell is 
reduced, and consequently the cell walls, having tensile 
but little compressive strength, like a deflated rubber 
tire, no longer provide rigidity. 

C. False. Proteoglycans can withstand a large amount of 

compressive force but do not have a rigid structure. 

Their space-filling properties and ability to resist 

compression result from their tendency to absorb large 

amounts of water. 

True. 

True. 

True. Stem cells stably express control genes that ensure 

that their daughter cells can only develop into certain 

differentiated cell types. 


mmo 


ANSWER 20-10 Small cytosolic molecules, such as glutamic 
acid, cyclic AMP, and Ca?" ions, pass readily through both 
gap junctions and plasmodesmata. Some proteins and 
mRNAs can pass through plasmodesmata, but all such 
macromolecules are excluded from gap junctions. Plasma 
membrane phospholipids diffuse in the plane of the 
membrane through plasmodesmata because the plasma 
membranes and smooth ER membranes of adjacent cells 
are continuous through these junctions. This traffic is not 
possible through gap junctions, because the membranes of 
the connected cells remain separate. 


ANSWER 20-11 Plants are exposed to extreme changes 

in the environment, which often are accompanied by huge 
fluctuations in the osmotic properties of their surroundings. 
An intermediate-filament network as we know it from animal 
cells would not be able to provide full osmotic support for 
cells: the sparse, rivetlike attachment points would not be 
able to prevent the membrane from bursting in response to 
a huge osmotic pressure applied from the inside of the cell. 


ANSWER 20-12 Action potentials can, in fact, be passed 
from cell to cell through gap junctions. Indeed, heart muscle 
cells contract synchronously by this mechanism. This way 

of passing the signal from cell to cell is rather limited, 
however. As we discuss in Chapter 12, synapses are far 
more sophisticated and allow signals to be modulated and 
integrated with other signals received by the cell. Thus, gap 
junctions are like simple soldered joints between electrical 
components, while synapses are like complex relay devices, 
enabling systems of neurons to perform computations. 


ANSWER 20-13 To make jello, gelatin is boiled in water, 
which denatures the collagen fibers. Upon cooling, the 
disordered fibers form a tangled mess that solidifies into a 
gel. This gel actually resembles the collagen as it is initially 
secreted by fibroblasts. It is not until the fibers have been 
aligned, bundled, and cross-linked that they acquire their 
ability to resist tensile forces. 


ANSWER 20-14 The evidence that DNA is the blueprint 
that specifies all the structural characteristics of an organism 
is based on observations that small changes in the DNA 

by mutation can result in large changes in the organism. 
Although DNA provides the plans that specify structure, 
these plans need to be executed during development. 

This requires a suitable environment (a human baby would 
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not fit into a stork’s egg shell), suitable nourishment, 
suitable molecular tools present in the egg (such as the 
appropriate transcription regulators required for early 
embryo development), suitable spatial organization (such 
as the asymmetries in the egg cell required to allow 

for appropriate cell differentiation during the early cell 
divisions), and so on. Thus inheritance is not restricted 

to the passing on of the organism's DNA, because 
development requires appropriate conditions to be set 
up by the parent. Nevertheless, when all these conditions 
are met, the plans that are archived in the genome will 
determine the structure of the organism to be built. 


ANSWER 20-15 White blood cells circulate in the 
bloodstream and migrate into and out of tissues in 
performance of their normal function of defending the body 
against infection: they are therefore naturally invasive. Once 
mutations have occurred to upset the normal controls on 
production of these cells, there is no need for additional 
mutations to enable the cells to spread through the body. 
Thus, the number of mutations that have to accumulate 

to give rise to leukemia is smaller than for other types of 
cancer. 


ANSWER 20-16 The shape of the curve reflects the need 
for multiple driver mutations to accumulate in a cell before a 
cancer results. If a single driver mutation were sufficient, the 
graph would be a straight horizontal line: the likelihood of 
occurrence of a particular mutation, and therefore of cancer, 
would be the same at any age. If two driver mutations were 
required, the graph would be a straight line sloping upward 
from the origin: the second mutation has an equal chance 
of occurring at any time, but will tip the cell into cancerous 
behavior only if the first mutation has already occurred 

in the same cell lineage; and the likelihood that the first 
mutation has already occurred will be proportional to the 
age of the individual. The steeply curved graph shown in the 
figure goes up approximately as the fifth power of the age, 
and this indicates that far more than two driver mutations 
have to accumulate before cancer sets in. It is not easy 

to say precisely how many, because of the complex ways 

in which cancers develop. Successive mutations can alter 
cell numbers and cell behavior, and thereby change both 
the probability of subsequent mutations and the selection 
pressures that drive the evolution of a cancer. 


ANSWER 20-17 During exposure to the carcinogen, 
mutations are induced, but the number of relevant (driver) 
mutations in any one cell is usually not enough to convert 
it directly into a cancer cell. Over the years, the cells that 
have become predisposed to cancer through the induced 
mutations accumulate progressively more mutations. 
Eventually, one of the mutant cells will turn into a cancer 
cell. The long delay between exposure and cancer has made 
it extremely difficult to hold cigarette manufacturers or 
producers of industrial carcinogens legally responsible for 
the damage that is caused by their products. 


ANSWER 20-18 By definition, a carcinogen is any substance 
that promotes the occurrence of one or more types of 
cancer. The sex hormones can therefore be classified as 
naturally occurring carcinogens. Although most carcinogens 
act by directly causing mutations, carcinogenic effects 

are also often exerted in other ways. The sex hormones 
increase both the rate of cell division and the survival of 
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cells, thereby increasing cell numbers in hormone-sensitive 
organs such as breast, uterus, and prostate. The increase 

in cell division boosts the mutation rate per cell, because 
mutations, regardless of environmental factors, are 
spontaneously generated in the course of DNA replication 
and chromosome segregation. The increase in cell numbers 
increases the total pool of cells at risk. In these and possibly 
other ways, the hormones can favor the development of 
cancer, even though they do not directly cause mutations. 


ANSWER 20-19 The short answer is no—cancer in general 
is not a hereditary disease. It arises from new mutations 
occurring in our own somatic cells, rather than from 
mutations we inherit from our parents. In some rare types 
of cancer, however, there is a strong heritable risk factor, 

so that parents and their children both show the same 
predisposition to a specific form of the disease. This occurs, 
for example, in families carrying a mutation that knocks 

out one of the two copies of the tumor suppressor gene 
APG; the children then inherit a propensity to colorectal 
cancer. Much weaker heritable tendencies are also seen in 
several other cancers, including breast cancer, but the genes 
responsible for these effects are still mostly unknown. 


Glossary 


acetyl CoA 
Activated carrier that donates the carbon atoms in its readily 
transferable acetyl group to many metabolic reactions, 
including the citric acid cycle and fatty acid biosynthesis; 
the acetyl group is linked to coenzyme A (CoA) by a 
thioester bond that releases a large amount of energy when 
hydrolyzed. 


acid 
A molecule that releases a proton when dissolved in water; 
this dissociation generates hydronium (H30%*) ions, thereby 
lowering the pH. 


actin filament 
Thin, flexible protein Sr yy yyy yi Soe 
filament made from 
a chain of globular actin molecules; a major constituent of 
all eukaryotic cells, this cytoskeletal element is essential for 
cell movement and for the contraction of muscle cells. 


actin-binding protein 
Protein that interacts with actin monomers or filaments 
to control the assembly, structure, and behavior of actin 
filaments and networks. 


action potential 
Traveling wave of electrical excitation caused by rapid, 
transient, self-propagating depolarization of the plasma 
membrane in a neuron or other excitable cell; also called 
a nerve impulse. 


activated carrier 
A small molecule that stores energy or chemical groups 
in a form that can be donated to many different metabolic 
reactions. Examples include ATP, acetyl CoA, and NADH. 


activation energy 
The energy that must be acquired by a molecule to undergo 
a chemical reaction. 


active site 
Region on the surface of an enzyme that binds to a substrate 
molecule and catalyzes its chemical transformation. 


active transport 
The movement of a solute across a membrane against its 
electrochemical gradient; requires an input of energy, such 
as that provided by ATP hydrolysis. 


adaptation 
Adjustment of sensitivity following repeated stimulation; 
allows a cell or organism to register small changes in a 
signal despite a high background level of stimulation. 


adenylyl cyclase 
Enzyme that catalyzes the formation of cyclic AMP from 
ATP; an important component in some intracellular 
signaling pathways. 


adherens junction 
Cell junction that helps hold together epithelial cells in a 
sheet of epithelium; actin filaments inside the cell attach to 
its cytoplasmic face. 


ADP 
Nucleoside diphosphate produced by hydrolysis of the 
terminal phosphate of ATP. (See Figure 3-31.) 


allele 
An alternative form of a gene; for a given gene, many alleles 
may exist in the gene pool of the species. 


allosteric 
Describes a protein that can exist in multiple conformations 
depending on the binding of a molecule (ligand) at a 
site other than the catalytic site; such changes from one 
conformation to another often alter the protein's activity or 
ligand affinity. 


alpha helix (a helix) 
Folding pattern, common in many proteins, in which a 
single polypeptide chain twists around itself to form a rigid 
cylinder stabilized by hydrogen bonds between every fourth 
amino acid. 


alternative splicing 
The production of different mRNAs (and proteins) from the 
same gene by splicing its RNA transcripts in different ways. 


Alu sequence 
Family of mobile genetic elements that comprises about 
10% of the human genome; this short, repetitive sequence is 
no longer mobile on its own, but requires enzymes encoded 
by other elements to transpose. 


amino acid 
Small organic molecule containing both an amino group 
and a carboxyl group; it serves as the building block of 
proteins. 


amino acid sequence 
The order of the amino acid subunits in a protein chain. 
Sometimes called the primary structure of a protein. 


aminoacyl-tRNA synthetase 
During protein synthesis, an enzyme that attaches the 
correct amino acid to a tRNA molecule to form a “charged” 
aminoacyl-tRNA. 


amphipathic 
Having both hydrophobic and hydrophilic regions, as in a 
phospholipid or a detergent molecule. 


anabolic pathway 
Series of enzyme-catalyzed reactions by which large 
biological molecules are synthesized from smaller subunits; 
usually requires an input of energy. 


anabolism 
Set of metabolic pathways by which large molecules are 
made from smaller ones. 


anaphase 
Stage of mitosis during which the two sets of chromosomes 
separate and are pulled toward opposite ends of the dividing 
cell. 


anaphase-promoting complex (APC/C) 
A protein complex that triggers the separation of sister 
chromatids and orchestrates the carefully timed destruction 
of proteins that control progress through the cell cycle; the 
complex catalyzes the ubiquitylation of its targets. 


antenna complex 
In chloroplasts and photosynthetic bacteria, the part of 
the membrane-bound photosystem that captures energy 
from sunlight; contains an array of proteins that bind 
hundreds of chlorophyll molecules and other photosensitive 
pigments. 
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antibody 
Protein produced by B lymphocytes in response to a foreign 
molecule or invading organism. Binds to the foreign 
molecule or cell extremely tightly, thereby inactivating it or 
marking it for destruction. 


anticodon 
Set of three consecutive nucleotides in a transfer RNA 
molecule that recognizes, through base-pairing, the three- 
nucleotide codon on a messenger RNA molecule; this 
interaction helps to deliver the correct amino acid to a 
growing polypeptide chain. 


antigen 
Molecule or fragment of a molecule that is recognized by 
an antibody. 


antiport 
Type of coupled transporter that transfers two different 
ions or small molecules across a membrane in opposite 
directions, either simultaneously or in sequence. 


apical 
Describes the top or the tip of a cell, structure, or organ; in 
an epithelial cell, for example, this surface is opposite the 
base, or basal surface. 


apoptosis 
A tightly controlled form of programmed cell death that 
allows excess cells to be eliminated from an adult or 
developing organism. 


archaeon 
Microscopic organism that is a member of one of the 
two divisions of prokaryotes; often found in hostile 
environments such as hot springs or concentrated brine. 
(See also bacterium.) 


asexual reproduction 
Mode of reproduction in which offspring arise from a 
single parent, producing an individual genetically identical 
to that parent; includes budding, binary fission, and 
parthenogenesis. 


aster 


Star-shaped array of microtubules emanating from a 
centrosome or from a pole of a mitotic spindle. 


atom 
The smallest particle of an element that still retains its 
distinctive chemical properties; consists of a positively 
charged nucleus surrounded by a cloud of negatively 
charged electrons. 


atomic weight 
The mass of an atom relative to the mass of a hydrogen 
atom; equal to the number of protons plus the number of 
neutrons that the atom contains. 


ATP 
Activated carrier that serves as the principal carrier of 
energy in cells; a nucleoside triphosphate composed of 
adenine, ribose, and three phosphate groups. (See Figure 
2-26.) 


ATP synthase 
Abundant membrane-associated enzyme complex that 
catalyzes the formation of ATP from ADP and inorganic 


phosphate during oxidative phosphorylation and 
photosynthesis. 
autophagy 


Mechanism by which a cell “eats itself,” digesting molecules 
and organelles that are damaged or obsolete. 


Avogadro’s number 
The number of molecules in a mole, the quantity of 
a substance equal to its molecular weight in grams; 
approximately 6 x 107°. 


axon 
Long, thin extension that conducts electrical signals away 
from a nerve cell body toward remote target cells. 


bacteriorhodopsin 
Pigmented protein found in abundance in the plasma 
membrane of the salt-loving archaeon Halobacterium 
halobium; pumps protons out of the cell, fueled by light 
energy. 


bacterium 
Microscopic organism that is a member of one of the two 
divisions of prokaryotes; some species cause disease. 
The term is sometimes used to refer to any prokaryotic 
microorganism, although the world of prokaryotes also 
includes archaea, which are only distantly related to each 
other. (See also archaeon.) 


basal 
Situated near the base; opposite of apical. 


basal lamina 
Thin mat of extracellular matrix, secreted by epithelial cells, 
upon which these cells sit. 


base 
Molecule that accepts a proton when dissolved in water; 
also used to refer to the nitrogen-containing purines or 
pyrimidines in DNA and RNA. 


base pair 
Two complementary nucleotides in an RNA or a DNA 
molecule that are held together by hydrogen bonds— 
normally G with C, and A with T or U. 


Bcl2 family 
Related group of intracellular proteins that regulates 
apoptosis; some family members promote cell death, others 
inhibit it. 

beta sheet (f sheet) 

Folding pattern found in many 
proteins in which neighboring 
regions of the polypeptide chain 
associate side-by-side with each 
other through hydrogen bonds to 
give a rigid, flattened structure. 


bi-orientation 
The symmetrical attachment of a sister-chromatid pair 
on the mitotic spindle, such that one chromatid in the 
duplicated chromosome is attached to one spindle pole and 
the other is attached to the opposite pole. 

binding site 
Region on the surface of a protein, typically a cavity or 
groove, that interacts with another molecule (a ligand) 
through the formation of multiple noncovalent bonds. 


biosynthesis 
An enzyme-catalyzed process by which complex molecules 
are formed from simpler substances by living cells; also 
called anabolism. 


bivalent 
Structure formed when a duplicated chromosome pairs 
with its homolog at the beginning of meiosis; contains four 
sister chromatids. 


buffer 
Mixture of weak acids and bases that maintains the pH of a 
solution by releasing and taking up protons. 


C-terminus 
The end of a polypeptide chain that carries a free carboxyl 
group (-COOR). 


Ca?* pump (or Ca?* ATPase) 
An active transporter that uses energy supplied by ATP 
hydrolysis to actively expel Ca* from the cell cytosol. 


Ca?*/calmodulin-dependent protein kinase 
(CaM-kinase) 
Enzyme that phosphorylates target proteins in response to 
an increase in Ca?* ion concentration through its interaction 
with the Ca?*-binding protein calmodulin. 


cadherin 
A member of a family of Ca?*+-dependent proteins that 
mediates the attachment of one cell to another in animal 
tissues. 


calmodulin 
Small Ca%-binding protein that modifies the activity of 
many target proteins in response to changes in Ca?* 
concentration. 


cancer 
Disease caused by abnormal and uncontrolled cell 
proliferation, followed by invasion and colonization of body 
sites normally reserved for other cells. 


carbon fixation 
Process by which green plants and other photosynthetic 
organisms incorporate carbon atoms from atmospheric 
carbon dioxide into sugars. The second stage of 
photosynthesis. 


caspase 
One of a family of proteases that, when activated, mediates 
the destruction of the cell by apoptosis. 


catabolism 
Set of enzyme-catalyzed reactions by which complex 
molecules are degraded to simpler ones with release of 
energy; intermediates in these reactions are sometimes 
called catabolites. 


catalyst 
Substance that accelerates a chemical reaction by lowering 
its activation energy; enzymes perform this role in cells. 


Cdk (cyclin-dependent protein kinase) 
Enzyme that, when complexed with a regulatory cyclin 
protein, can trigger various events in the cell-division cycle 
by phosphorylating specific target proteins. 


Cdk inhibitor protein 
Regulatory protein that blocks the assembly or activity 
of cyclin-Cdk complexes, delaying progression primarily 
through the G; and S phases of the cell cycle. 


cDNA library 
Collection of DNA fragments synthesized using all of the 
mRNAs present in a particular type of cell as a template. 


cell 
The basic unit from which a living organism is made; an 
aqueous solution of chemicals, enclosed by a membrane, 
that has an ability to self-replicate. 


cell cortex 
Specialized layer of cytoplasm on the inner face of the 
plasma membrane. In animal cells, it is rich in actin 
filaments that govern cell shape and drive cell movement. 


cell cycle 
The orderly sequence of events by which 
a cell duplicates its contents and divides 
into two. 


cell junction 
Specialized region of connection between two cells or 
between a cell and the extracellular matrix. 


cell memory 
The ability of differentiated cells and their descendants to 
maintain their identity. 


cell respiration 
Process by which cells harvest the energy stored in food 
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molecules; usually accompanied by the uptake of O2 and 
the release of CO2. 


cell signaling 
The molecular mechanisms by which cells detect and 
respond to external stimuli and send messages to other 
cells. 


cell wall 
Mechanically strong fibrous layer deposited outside the 
plasma membrane of some cells. Prominent in most plants, 
bacteria, algae, and fungi, but not present in most animal 
cells. 


cell-cycle control system 
Network of regulatory proteins that govern the orderly 
progression of a eukaryotic cell through the stages of cell 
division. 

cellulose microfibril 
Long, thin polysaccharide fiber that helps strengthen plant 
cell walls. 


centriole 
Cylindrical array of microtubules usually found in pairs at 
the center of a centrosome in animal cells. Also found at the 
base of cilia and flagella, where they are called basal bodies. 


centromere 
Specialized DNA sequence that 
allows duplicated chromosomes to 
be separated during M phase; can 
be seen as the constricted region of 
a mitotic chromosome. 


centrosome 
Microtubule-organizing center that sits near the nucleus in 
an animal cell; during the cell cycle, this structure duplicates 
to form the two poles of the mitotic spindle. 


centrosome cycle 
Process by which the centrosome duplicates (during 
interphase) and the two new centrosomes separate (at 
the beginning of mitosis) to form the poles of the mitotic 


spindle. 
e 
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chemical bond 
A sharing or transfer of electrons that holds two atoms 


together. (See also covalent bond and noncovalent 
bond.) 


chemical group 
A combination of atoms, such as a hydroxyl group (-OH) or 
an amino group (-NH3), with distinct chemical and physical 
properties that influence the behavior of the molecule in 
which it resides. 


channel 
A protein that forms a hydrophilic pore 
across a membrane, through which 
selected small molecules or ions can 
passively diffuse. 


chemiosmotic coupling 
Mechanism that uses the energy stored in a transmembrane 
proton gradient to drive an energy-requiring process, such 
as the synthesis of ATP by ATP synthase or the transport of 
a molecule across a membrane. 


chiasma (plural chiasmata) 
X-shaped connection between paired homologous 
chromosomes during meiosis; represents a site of crossing- 
over between two non-sister chromatids. 


chlorophyll 
Light-absorbing green pigment that plays a central part in 
photosynthesis. 


chloroplast 
Specialized organelle in algae and plants that contains 
chlorophyll and serves as the site for photosynthesis. 
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cholesterol 
Short, rigid lipid molecule present in large amounts in the 
plasma membranes of animal cells, where it makes the lipid 
bilayer less flexible. 


chromatin 


Complex of DNA and proteins that makes up the 
chromosomes in a eukaryotic cell. 


chromatin-remodeling complex 
Enzyme (typically multisubunit) that uses the energy of 
ATP hydrolysis to alter the arrangement of nucleosomes in 
eukaryotic chromosomes, changing the accessibility of the 
underlying DNA to other proteins. 


chromatography 
Technique used to separate the individual molecules in a 
complex mixture on the basis of their size, charge, or their 
ability to bind to a particular chemical group. In a common 
form of the technique, the mixture is run through a column 
filled with a material that binds the desired molecule, and 
it is then eluted from the column with a solvent gradient. 


chromosome 
Long, threadlike structure composed of DNA and proteins 
that carries the genetic information of an organism; 
becomes visible as a distinct entity when a plant or animal 
cell prepares to divide. 


chromosome condensation 
Process by which a duplicated chromosome becomes 
packed into a more compact structure prior to cell division. 


cilium 
Hairlike structure made of microtubules found on the 
surface of many eukaryotic cells; when present in large 


numbers, its rhythmic beating can drive the movement of 
fluid over the cell surface, as in the epithelium of the lungs. 


citric acid cycle 
Series of reactions that generate large amounts of NADH 
by oxidizing acetyl groups derived from food molecules 
to CO2. In eukaryotic cells, this central metabolic pathway 
takes place in the mitochondrial matrix. 


classical genetic approach 
Experimental techniques used to isolate the genes 
responsible for an interesting phenotype. 


clathrin 
Protein that makes up the coat of a type of transport vesicle 
that buds from either the Golgi apparatus (on the outward 
secretory pathway) or from the plasma membrane (on the 


inward endocytic pathway). 
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codon 


Group of three consecutive nucleotides that specifies 
a particular amino acid or that starts or stops protein 
synthesis; applies to the nucleotides in an MRNA or in a 
coding sequence of DNA. 


coated vesicle 
Small membrane-enclosed sac that wears a 
distinctive layer of proteins on its cytosolic 
surface. It is formed by pinching-off of a 
protein-coated region of cell membrane. 


coenzyme 
Small molecule that binds tightly to an enzyme and helps it 
to catalyze a reaction. 


cohesin 
Protein complex that holds sister chromatids together after 
DNA has been replicated in the cell cycle. 


coiled-coil 
Stable, rodlike protein structure formed when two or more 
a helices twist repeatedly around each other. 


collagen 
Triple-stranded, fibrous protein that is a major component 
of the extracellular matrix and connective tissues; it is the 
main protein in animal tissues, and different forms can be 
found in skin, tendon, bone, cartilage, and blood vessels. 


combinatorial control 
Describes the way in which groups of transcription 
regulators work together to regulate the expression of a 
single gene. 


complementary 
Describes two molecular surfaces that fit together closely 
and form noncovalent bonds with each other. Examples 
include complementary base pairs, such as A and T, and the 
two complementary strands of a DNA molecule. 


complementary DNA (cDNA) 
DNA molecule synthesized from an mRNA molecule and 
therefore lacking the introns that are present in genomic 
DNA. 


complementation test 
Genetic experiment that determines whether two mutations 
that are associated with the same phenotype lie in the same 
gene or in different genes. 


condensation reaction 
Chemical reaction in which a covalent bond is formed 
between two molecules as water is expelled; used to build 
polymers, such as proteins, polysaccharides, and nucleic 
acids. 


condensin 
Protein complex that helps 
configure duplicated chromosomes 
for segregation by making them 
more compact. 


conformation 
Precise, three-dimensional shape of a protein or other 
macromolecule, based on the spatial location of its atoms 
in relation to one another. 


connective tissue 
Tissues such as bone, tendons, and the dermis of the skin, 
in which extracellular matrix makes up the bulk of the 
tissue and carries the mechanical load. 


conserved synteny 
The preservation of gene order in the genomes of different 
species. 


contractile ring 
Structure made of actin and myosin filaments that forms a 
belt around a dividing cell, pinching it in two. 


coupled reaction 
Linked pair of chemical reactions in which free energy 
released by one reaction serves to drive the other reaction. 


covalent bond 
Stable chemical link between two atoms produced by 
sharing one or more pairs of electrons. 


CRISPR 
System for gene editing based on a bacterial enzyme that 
uses a guide RNA molecule to search for and modify specific 
nucleotide sequences in the genome. 


crossing-over 
Process whereby two homologous chromosomes break at 
corresponding sites and rejoin to produce two recombined 
chromosomes that have physically exchanged segments of 
DNA. 


cryoelectron microscopy (cryo-EM) 
Technique for observing the detailed structure of a 
macromolecule at very low temperatures after freezing 
native structures in ice. 


cyclic AMP 
Small intracellular signaling molecule generated from 
ATP in response to hormonal stimulation of cell-surface 
receptors. 


cyclic-AMP-dependent protein kinase (PKA) 
Enzyme that phosphorylates target proteins in response to 
a rise in intracellular cyclic AMP concentration. 


cyclin 
Regulatory protein whose concentration rises and falls at 
specific times during the eukaryotic cell cycle; cyclins help 
control progression from one stage of the cell cycle to the 
next by binding to cyclin-dependent protein kinases (Cdks). 


cytochrome 
A family of membrane-bound, colored, heme-containing 
proteins that transfer electrons during cellular respiration 
and photosynthesis. 


cytochrome c oxidase 
Protein complex that serves as the final electron carrier in 
the respiratory chain; removes electrons from cytochrome c 
and passes them to O2 to produce H20. 


cytokinesis 
Process by which the cytoplasm of a plant or animal cell 
divides in two to form individual daughter cells. 


cytoplasm 
Contents of a cell that are contained within its plasma 
membrane but, in the case of eukaryotic cells, outside the 
nucleus. 


cytoskeleton 
System of protein filaments in the cytoplasm of a eukaryotic 
cell that gives the cell shape and the capacity for directed 
movement. Its most abundant components are actin 
filaments, microtubules, and intermediate filaments. 


cytosol 
Contents of the main compartment of the cytoplasm, 
excluding membrane-enclosed organelles such as 
endoplasmic reticulum and mitochondria. The cell fraction 
remaining after membranes, cytoskeletal components, and 
other organelles have been removed. 


dendrite 
Short, branching structure that extends from the surface of 
a nerve cell and receives signals from other neurons. 


deoxyribonucleic acid (DNA) 
Double-stranded polynucleotide formed from two separate 
chains of covalently linked deoxyribonucleotide units. 
It serves as the cell’s store of genetic information that is 
transmitted from generation to generation. 


depolarization 


A shift in the membrane potential, making it less negative 
on the inside of the cell. 


desmosome 
Specialized cell-cell junction, usually 
formed between two epithelial cells, that 
serves to connect the ropelike keratin 
filaments of the adjoining cells, providing 
tensile strength. 


detergent 
Soapy substance used to solubilize lipids and membrane 
proteins. 


diacylglycerol (DAG) 
Small messenger molecule produced by the cleavage 
of membrane inositol phospholipids in response to 
extracellular signals. Helps activate protein kinase C. 


dideoxy (Sanger) sequencing 
The standard method of determining the nucleotide 
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sequence of DNA; utilizes DNA polymerase and a set of 
chain-terminating nucleotides. 


differentiated cell 
Cell that has undergone a coordinated change in gene 
expression, enabling it to perform a specialized function. 


differentiation 
Process by which a pluripotent cell undergoes a progressive, 
coordinated change to a more specialized cell type, brought 
about by large-scale changes in gene expression. 


diffusion 
Process by which molecules and small particles move 
from one location to another by random, thermally driven 
motion. 


diploid 
Describes a cell or organism containing two sets of 
homologous chromosomes, one inherited from each 
parent. (See also haploid.) 


disulfide bond 
Covalent cross-link formed between the sulfhydryl groups 
on two cysteine side chains; often used to reinforce a 
secreted protein's structure or to join two different proteins 
together. 


DNA 
Double-stranded polynucleotide formed from two separate 
chains of covalently linked deoxyribonucleotide units. 
It serves as the cell’s store of genetic information that is 
transmitted from generation to generation. 


DNA cloning 
Production of many identical copies of a DNA sequence. 


DNA library 
Collection of cloned DNA molecules, representing either 
an entire genome (genomic library) or copies of the mRNA 
produced by a cell (CDNA library). 


DNA ligase 
Enzyme that seals nicks that arise in the backbone of a DNA 
molecule; in the laboratory, can be used to join together two 
DNA fragments. 


DNA methylation 
The enzymatic addition of methyl groups to cytosine bases 
in DNA; this covalent modification generally turns off genes 
by attracting proteins that block gene expression. 


DNA polymerase 
Enzyme that catalyzes the synthesis of a DNA molecule from 
a DNA template using deoxyribonucleoside triphosphate 
precursors. 


DNA repair 
Collective term for the enzymatic processes that correct 
damage to DNA. 


DNA replication 
The process by which a copy of a DNA molecule is made. 


double helix 
The typical structure of a DNA 
molecule in which the two 
complementary polynucleotide 
strands are wound around each 
other with base-pairing between the strands. 


dynamic instability 
The rapid switching between growth and shrinkage shown 
by microtubules. 


dynein 
Motor protein that uses the energy of ATP hydrolysis to 
move toward the minus end of a microtubule. One form of 
the protein is responsible for the bending of cilia. 
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electrochemical gradient 
Driving force that determines which way an ion will move 
across a membrane; consists of the combined influence 
of the ion’s concentration gradient and the membrane 
potential. 


electron 
Negatively charged subatomic particle that occupies space 
around an atomic nucleus (e). 


electron microscope 
Instrument that illuminates a specimen using beams of 
electrons to reveal and magnify the structures of very small 
objects, such as organelles and large molecules. 


electron-transport chain 
A series of membrane-embedded electron carrier molecules 
that facilitate the movement of electrons from a higher to 
a lower energy level, as in oxidative phosphorylation and 
photosynthesis. 


electronegativity 
The tendency of an atom to attract electrons. 


electrophoresis 
Technique for separating a mixture of proteins or DNA 
fragments by placing them on a polymer gel and subjecting 
them to an electric field. The molecules migrate through 
the gel at different speeds depending on their size and net 
charge. 


electrostatic attraction 
Force that draws together oppositely charged atoms. 
Examples include ionic bonds and the attractions between 
molecules containing polar covalent bonds. 


embryonic stem (ES) cell 
An undifferentiated cell type derived from the inner cell 
mass of an early mammalian embryo and capable of 
differentiating to give rise to any of the specialized cell 
types in the adult body. 


endocytosis 
Process by which cells take in materials through an 
invagination of the plasma membrane, which surrounds 
the ingested material in a membrane-enclosed vesicle. (See 
also pinocytosis and phagocytosis.) 


endomembrane system 
Interconnected network of membrane-enclosed organelles 
in a eukaryotic cell; includes the endoplasmic reticulum, 
Golgi apparatus, lysosomes, peroxisomes, and endosomes. 


endoplasmic reticulum (ER) 
Labyrinthine membrane-enclosed 
compartment in the cytoplasm of 
eukaryotic cells where lipids and 
proteins are made. 


endosome 
Membrane-enclosed compartment of a eukaryotic cell 
through which material ingested by endocytosis passes on 
its way to lysosomes. 


entropy 
Thermodynamic quantity that measures the degree of 
disorder in a system. 


enzyme 
A protein that catalyzes a specific chemical reaction. 


enzyme-coupled receptor 
Transmembrane protein that, when stimulated by the 
binding of a ligand, activates an intracellular enzyme (either 
a separate enzyme or part of the receptor itself). 


epigenetic inheritance 
The transmission of a heritable pattern of gene expression 
from one cell to its progeny that does not involve altering 
the nucleotide sequence of the DNA. 


epithelium (plural epithelia) 
Sheet of cells covering an external surface or lining an 
internal body cavity. 

equilibrium 
State in which the forward and reverse rates of a chemical 
reaction are equal so that no net chemical change occurs. 


equilibrium constant, K 
For a reversible chemical reaction, the ratio of substrate to 
product when the rates of the forward and reverse reactions 
are equal. 


euchromatin 
One of the two main states in which chromatin exists 
within an interphase cell. Prevalent in gene-rich areas, its 
less compact structure allows access for proteins involved 
in transcription. (See also heterochromatin.) 


eukaryote 
An organism whose cells have a distinct nucleus and 
cytoplasm. 


evolution 
Process of gradual modification and adaptation that occurs 
in living organisms over generations. 


exocytosis 
Process by which most molecules are secreted from 
a eukaryotic cell. These molecules are packaged in 
membrane-enclosed vesicles that fuse with the plasma 
membrane, releasing their contents to the outside. 


exon 
Segment of a eukaryotic gene that is transcribed into RNA 
and dictates the amino acid sequence of part of a protein. 


exon shuffling 
Mechanism for the evolution of new genes; in the process, 
coding sequences from different genes are brought together 
to generate a protein with a new combination of domains. 


extracellular matrix 
Complex network of polysaccharides (such as 
glycosaminoglycans or cellulose) and proteins (such as 
collagen) secreted by cells. A structural component of tissues 
that also influences their development and physiology. 


extracellular signal molecule 
Any molecule present outside the cell that can elicit a 
response inside the cell when the molecule binds to a 
receptor. 


FAD 
A molecule that accepts electrons and hydrogen atoms 
from an electron donor; see FADH). 


FADH2 
A high-energy electron carrier produced by reduction of 
FAD during the breakdown of molecules derived from food, 
including fatty acids and acetyl CoA. 


fat 
Type of lipid used by living cells to store metabolic energy. 
Mainly composed of triacylglycerols. (See Panel 2-5, pp. 
74-75.) 


fat droplet 
Large cluster of hydrophobic fats or oils that forms inside 
the cells. 


fatty acid 
Molecule that consists of a 
carboxylic acid attached to 
a long hydrocarbon chain. 
Used as a major source of energy during metabolism and as 
a starting point for the synthesis of phospholipids. 


feedback inhibition 
A form of metabolic control in which the end product of 


a chain of enzymatic reactions reduces the activity of an 
enzyme early in the pathway. 


feedback regulation 
Process whereby enzymes are either positively or negatively 
regulated in response to the levels of metabolites that are 
not their substrates. 


fermentation 
The breakdown of organic molecules without the 
involvement of molecular oxygen. This form of oxidation 
yields less energy than aerobic cell respiration. 


fertilization 
The fusion of two gametes—sperm and egg—to produce a 
new individual organism. 


fibroblast 
Cell type that produces the collagen-rich extracellular matrix 
in connective tissues such as skin and tendon. Proliferates 
readily in wounded tissue and in tissue culture. 


fibronectin 
Extracellular matrix protein that helps cells attach to the 
matrix by acting as a “linker” that binds to a cell-surface 
integrin molecule on one end and to a matrix component, 
such as collagen, on the other. 


fibrous protein 
A protein with an elongated, rodlike shape, such as collagen 
or a keratin filament. 


filopodium 
Long, thin, actin-containing extension on the surface of an 
animal cell. Sometimes has an exploratory function, as ina 
growth cone. 


flagellum 
Long, whiplike structure capable of propelling a cell through 
a fluid medium with its rhythmic beating. Eukaryotic flagella 
are longer versions of cilia; bacterial flagella are completely 
different, being smaller and simpler in construction. 


fluorescence microscope 
Instrument used to visualize a specimen that has been 
labeled with a fluorescent dye; samples are illuminated 
with a wavelength of light that excites the dye, causing it 
to fluoresce. 


free energy, G 
Energy that can be harnessed to do work, such as driving a 
chemical reaction. 


free-energy change, AG 
“Delta G”: in a chemical reaction, the difference in free 
energy between reactant and product molecules. A large 
negative value of AG indicates that the reaction has a 
strong tendency to occur. (See also standard free-energy 
change.) 


G protein 
A membrane-bound GTP-binding protein involved in 
intracellular signaling; composed of three subunits, this 
intermediary is usually activated by the binding of a 
hormone or other ligand to a transmembrane receptor. 


G-protein-coupled receptor (GPCR) 
Cell-surface receptor that associates with 
an intracellular trimeric GTP-binding 
protein (G protein) after activation by an 
extracellular ligand. These receptors are 
embedded in the membrane by seven 
transmembrane « helices. 


G, cyclin 
Regulatory protein that helps drive a cell through the first 
gap phase of the cell cycle and toward S phase. 


G, phase 
Gap 1 phase of the eukaryotic cell cycle; falls between the 
end of cytokinesis and the start of DNA synthesis. 
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G,-Cdk 
Protein complex whose activity drives the cell through the 
first gap phase of the cell cycle; consists of a G; cyclin plus 
a cyclin-dependent protein kinase (Cdk). 


G1/S cyclin 
Regulatory protein that helps to launch the S phase of the 
cell cycle. 


G,/S-Cdk 
Protein complex whose activity triggers entry into S phase 
of the cell cycle; consists of a Gj/S cyclin plus a cyclin- 
dependent protein kinase (Cdk). 


G2 phase 
Gap 2 phase of the eukaryotic cell cycle; falls between the 
end of DNA synthesis and the beginning of mitosis. 


gain-of-function mutation 
Genetic change that increases the activity of a gene or 
makes it active in inappropriate circumstances; such 
mutations are usually dominant. 


gamete 
Cell type in a diploid organism that carries only one set of 
chromosomes and is specialized for sexual reproduction. A 
sperm or an egg; also called a germ cell. 


gamete 
Cell type in a diploid organism that carries only one set of 
chromosomes and is specialized for sexual reproduction. A 
sperm or an egg; also called a germ cell. 


gap junction 
In animal tissues, specialized 
connection between juxtaposed 
cells through which ions and small 
molecules can pass from one cell to 
the other. 


GDP 
Nucleoside diphosphate that is produced by the hydrolysis 
of the terminal phosphate of GTP, a reaction that also 
produces inorganic phosphate. 


gene 
Unit of heredity containing the instructions that dictate the 
characteristics or phenotype of an organism; in molecular 
terms, a segment of DNA that directs the production of a 
particular protein or functional RNA molecule. 


gene duplication and divergence 
A process by which new genes can form; involves the 
accidental generation of an additional copy of a stretch 
of DNA containing one or more genes, followed by an 
accumulation of mutations that over time can alter the 
function or expression of either the original or its copy. 


gene expression 
The process by which a gene makes a product that is useful 
to the cell or organism by directing the synthesis of a protein 
or an RNA molecule with a characteristic activity. 


gene family 
A set of related genes that has arisen through a process of 
gene duplication and divergence. 


gene knockout 
A genetically engineered animal in which a specific gene 
has been inactivated. 


general transcription factors 
Proteins that assemble on the promoters of eukaryotic 
genes near the start site of transcription and load the RNA 
polymerase in the correct position. 


genetic code 
Set of rules by which the information contained in the 
nucleotide sequence of a gene and its corresponding RNA 
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molecule is translated into the amino acid sequence of a 
protein. 


genetic instability 
An increased rate of mutation often caused by defects 
in the systems that govern the accurate replication and 
maintenance of the genome; the resulting mutations 
sometimes drive the evolution of cancer. 


genetic map 
A graphic representation of the order of genes in 
chromosomes spaced according to the amount of 
recombination that occurs between them. 


genetic screen 
Experimental technique used to search through a collection 
of mutants for a particular phenotype. 


genetics 
The study of genes, heredity, and the variation that gives rise 
to differences between one living organism and another. 


genome 
The total genetic information carried by all the 
chromosomes of a cell or organism; in humans, the total 
number of nucleotide pairs in the 22 autosomes plus the X 
and Y chromosomes. 


genomic library 
Collection of cloned DNA molecules that represents the 
entire genome of a cell. 


genotype 
The genetic makeup of a cell or organism, including which 
alleles (gene variants) it carries. 


germ line 
The lineage of reproductive cells that contributes to the 
formation of a new generation of organisms, as distinct 
from somatic cells, which form the body and leave no 
descendants in the next generation. 


globular protein 
Any protein in which the polypeptide chain folds into a 
compact, rounded shape. Includes most enzymes. 


gluconeogenesis 
Set of enzyme-catalyzed reactions by which glucose is 
synthesized from small organic molecules such as pyruvate, 
lactate, or amino acids; in effect, the reverse of glycolysis. 


glucose CHOH 
Six-carbon sugar that plays a major PE O oi 
role in the metabolism of living cells. l l 
Stored in polymeric form as glycogen i a 
in animal cells and as starch in plant "O % ofan 
cells. (See Panel 2-4, pp. 72-73.) h OH 


glycocalyx 
Protective layer of carbohydrates on the outside surface 
of the plasma membrane formed by the sugar residues of 
membrane glycoproteins, proteoglycans, and glycolipids. 


glycogen 
Branched polymer composed exclusively of glucose units 
used to store energy in animal cells. Granules of this 
material are especially abundant in liver and muscle cells. 


glycolysis 
Series of enzyme-catalyzed oxidation reactions in which 
sugars are partially degraded and their energy is captured 
by the activated carriers ATP and NADH. (Literally, “sugar 
splitting.”) 

glycosaminoglycan (GAG) 
Polysaccharide chain that can form a gel that acts as a 


“space filler’ in the extracellular matrix of connective 
tissues; helps animal tissues resist compression. 
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Golgi apparatus 
Membrane-enclosed organelle in 
eukaryotic cells that modifies the 
proteins and lipids made in the 
endoplasmic reticulum and sorts 
them for transport to other sites. 
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gradient-driven pump 


A protein that uses energy stored in the electrochemical 
gradient of ions to actively transport a solute across a 
membrane. 


green fluorescent protein (GFP) 
Fluorescent protein, isolated from a jellyfish, that is used 
experimentally as a marker for monitoring the location and 
movement of proteins in living cells. 


growth factor 
Extracellular signal molecule that stimulates a cell to 
increase in size and mass. Examples include epidermal 
growth factor (EGF) and platelet-derived growth factor 
(PDGF). 


GTP 
Nucleoside triphosphate used in the synthesis of RNA and 
DNA. Like the closely related ATP, serves as an activated 
carrier in some energy-transfer reactions. Also has a special 
role in microtubule assembly, protein synthesis, and cell 
signaling. 


GTP-binding protein 
Intracellular signaling protein whose activity is determined 
by its association with either GTP or GDP. Includes both 
trimeric G proteins and monomeric GTPases, such as Ras. 


H+ pump 
A protein or protein complex that uses energy supplied by 
ATP hydrolysis, an ion gradient, or light to actively move 
protons across a membrane. 


haploid 
Describes a cell or organism with only one set of 
chromosomes, such as a sperm cell or a bacterium. (See 
also diploid.) 


haplotype block 
A combination of alleles or other DNA markers that has been 
inherited as a unit, undisturbed by genetic recombination, 
across many generations. 


helix 
An elongated structure whose subunits twist in a regular 
fashion around a central axis, like a spiral staircase. 


hemidesmosome 
Structure that anchors epithelial cells to the basal lamina 
beneath them. 


heterochromatin 
Highly condensed region of an interphase chromosome; 
generally gene-poor and transcriptionally inactive. (See also 
euchromatin.) 


heterozygous 
Possessing dissimilar alleles for a given gene. 


histone 
One of a small group of abundant, highly conserved proteins 
around which DNA wraps to form nucleosomes, structures 
that represent the most fundamental level of chromatin 
packing. 


histone-modifying enzyme 
Enzyme that catalyzes the covalent addition of a small 
molecule, such as a methyl or acetate group, to a specific 
amino acid side chain on a histone. 


homolog 
A gene, chromosome, or any structure that has a close 
similarity to another as a result of common ancestry. 


homologous 
Describes genes, chromosomes, or any structures that are 
similar because of their common evolutionary origin. Can 
also refer to similarities between protein sequences or 
nucleic acid sequences. 


homologous gene—sce homologous. 


homologous recombination 
Mechanism by which double-strand breaks in a DNA 
molecule can be repaired flawlessly; uses an undamaged, 
duplicated, or homologous chromosome to guide the repair. 
During meiosis, the mechanism results in an exchange of 
genetic information between the maternal and paternal 
homologs. 


homozygous 
Possessing identical alleles for a given gene. 


horizontal gene transfer 
Process by which DNA is passed from the genome of one 
organism to that of another, even to an individual from 
another species. This contrasts with “vertical” gene transfer, 
which refers to the transfer of genetic information from 
parent to progeny. 


hormone 
Extracellular signal molecule that is secreted and 
transported via the bloodstream (in animals) or the sap (in 
plants) to target tissues on which it exerts a specific effect. 


hybridization 
Experimental technique in which two complementary 
nucleic acid strands come together and form hydrogen 
bonds to produce a double helix; used to detect specific 
nucleotide sequences in either DNA or RNA. 


hydrogen bond 
A weak noncovalent interaction H 
between a positively charged 
hydrogen atom in one molecule 
and a negatively charged atom, 
such as nitrogen or oxygen, in another; hydrogen bonds are 
key to the structure and properties of water. 


O IMi H — O — 


hydrolysis 
Chemical reaction that involves cleavage of a covalent 
bond with the accompanying consumption of water (its -H 
being added to one product of the cleavage and its -OH to 
the other); the reverse of a condensation reaction. 


hydronium ion 
The form taken by a proton (H+) in aqueous solution. 


hydrophilic 
Molecule or part of a molecule that readily forms hydrogen 
bonds with water, allowing it to readily dissolve; literally, 
“water loving.” 


hydrophobic 
Nonpolar, uncharged molecule or part of a molecule 
that forms no hydrogen bonds with water molecules and 
therefore does not dissolve; literally, “water fearing.” 


hydrophobic force 
A noncovalent interaction that forces together the 
hydrophobic portions of dissolved molecules to minimize 
their disruption of the hydrogen-bonded network of water; 
causes membrane phospholipids to self-assemble into a 
bilayer and helps to fold proteins into a compact, globular 
shape. 


in situ hybridization 
Technique in which a single-stranded RNA or DNA probe 
is used to locate a complementary nucleotide sequence 
in a chromosome, cell, or tissue; used to diagnose genetic 
disorders or to track gene expression. 
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induced pluripotent stem (iPS) cell 
Somatic cell that has been reprogrammed to resemble 
and behave like a pluripotent embryonic stem (ES) cell 
through the artificial introduction of a set of genes encoding 
particular transcription regulators. 


initiator tRNA 


Special tRNA that initiates the translation of an mRNA in a 
ribosome. It always carries the amino acid methionine. 


inorganic 
Not composed of carbon atoms. 


inositol 1,4,5-trisphosphate (IP3) 
Small intracellular signaling molecule that triggers the 
release of Ca? from the endoplasmic reticulum into the 
cytosol; produced when a signal molecule activates a 
membrane-bound protein called phospholipase C. 


inositol phospholipid 
Minor lipid component of plasma membranes that plays a 
part in signal transduction in eukaryotic cells; cleavage yields 
two small messenger molecules, IP3 and diacylglycerol. 


integrin 
One of a family of transmembrane proteins present on cell 
surfaces that enable cells to make and break attachments 


to the extracellular matrix, allowing them to crawl through 
a tissue. 


intermediate filament 
Fibrous cytoskeletal element, about 10 nm in diameter, that 
forms ropelike networks in animal cells; helps cells resist 
tension applied from outside. 


interphase 
Long period of the cell cycle between one mitosis and the 
next. Includes G; phase, S phase, and G2 phase. 


intracellular condensate 
A large aggregate of phase-separated macromolecules that 
creates a region with a special biochemistry without the use 
of an encapsulating membrane. 


intracellular signaling pathway 
A set of proteins and small-molecule second messengers 
that interact with each other to relay a signal from the 
cell membrane to its final destination in the cytoplasm or 
nucleus. 


intrinsically disordered sequence 
Region in a polypeptide chain that lacks a definite structure. 


intron 
Noncoding sequence within a eukaryotic gene that is 
transcribed into an RNA molecule but is then excised by 
RNA splicing to produce an MRNA. 

ion 
An atom carrying an electrical charge, either positive or 
negative. 


ion channel 
Transmembrane protein that forms a pore across the lipid 
bilayer through which specific inorganic ions can diffuse 


down their electrochemical gradients. 
ionic bond 


Interaction formed when one atom donates electrons to 
another; this transfer of electrons causes both atoms to 
become electrically charged. 


ion-channel-coupled receptor 
Transmembrane receptor protein or 
protein complex that opens in response to 
the binding of a ligand to its external face, 
allowing the passage of a specific inorganic 
ion. 
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iron-sulfur center 
Tightly bound metal complex that carries electrons in 
proteins that operate early in the electron-transport chain; 
has a relatively weak affinity for electrons. 


K* leak channel 
Ion channel permeable to K+ that randomly flickers between 
an open and closed state; largely responsible for the resting 
membrane potential in animal cells. 


karyotype 
An ordered display of the full set of chromosomes of a cell, 
arranged with respect to size, shape, and number. 


keratin filament 
Class of intermediate filament abundant in epithelial 
cells, where it provides tensile strength; main structural 


component of hair, feathers, and claws. 
kinetochore 


Protein complex that assembles on the centromere of a 
condensed mitotic chromosome; the site to which spindle 
microtubules attach. 


kinesin 
A large family of motor proteins that uses 
the energy of ATP hydrolysis to move 
toward the plus end of a microtubule. 


L1 element 
Type of retrotransposon that constitutes 15% of the human 
genome; also called LINE-1. 


lagging strand 
At a replication fork, the DNA strand that is made 
discontinuously in short fragments that are later joined 
together to form one continuous new strand. 


lamellipodium 
Dynamic sheetlike extension on the surface of an animal 
cell, especially one migrating over a surface. 


law of independent assortment 
Principle that, during gamete formation, the alleles for 
different traits segregate independently of one another; 
Mendel's second law of inheritance. 


law of segregation 
Principle that the maternal and paternal alleles for a trait 
separate from one another during gamete formation and 
then reunite during fertilization; Mendel’s first law of 
inheritance. 


leading strand 
At a replication fork, the DNA strand that is made by 
continuous synthesis in the 5'-to-3’ direction. 


ligand 
General term for a small molecule that binds to a specific 
site on a macromolecule. 


ligand-gated channel 
An ion channel that is stimulated to open by the binding of 
a small molecule such as a neurotransmitter. 


light reactions 
In photosynthesis, the set of reactions that converts the 
energy of sunlight into chemical energy in the form of ATP 
and NADPH (stage 1 of photosynthesis). 

lipid 
An organic molecule that is insoluble in water but dissolves 
readily in nonpolar organic solvents; typically contains 
long hydrocarbon chains or multiple rings. One class, the 
phospholipids, forms the structural basis for biological 
membranes. 


lipid bilayer 
Thin pair of closely juxtaposed sheets, composed mainly of 


phospholipid molecules, that forms the structural basis for 
all cell membranes. 


local mediator 
Secreted signal molecule that acts at a short range on 
adjacent cells. 


long noncoding RNA 
Class of RNA molecules more than 200 nucleotides in length 
that does not encode proteins. Often used to regulate gene 
expression. 


loss-of-function mutation 
A genetic alteration that reduces or eliminates the activity of 
a gene. Such mutations are usually recessive: the organism 
can function normally as long as it retains at least one 
normal copy of the affected gene. 


lysosome 
Membrane-enclosed organelle that breaks down worn-out 
proteins and organelles and other waste materials, as well 
as molecules taken up by endocytosis; contains digestive 
enzymes that are typically most active at the acid pH found 
inside these organelles. 


lysozyme 
Enzyme that severs the polysaccharide chains that form the 
cell walls of bacteria; found in many secretions including 
saliva and tears, where it serves as an antibiotic. 


M cyclin 
Regulatory protein that binds to mitotic Cdk to form M-Cdk, 
the protein complex that triggers the M phase of the cell 
cycle. 


M phase 
Period of the eukaryotic cell cycle during which the nucleus 
and cytoplasm divide. 


M-Cdk 
Protein complex that triggers the M phase of the cell cycle; 
consists of an M cyclin plus a mitotic cyclin-dependent 
protein kinase (Cdk). 


macromolecule 
Polymer built from covalently linked subunits; includes 
proteins, nucleic acids, and polysaccharides with a 
molecular mass greater than a few thousand daltons. 


MAP kinase 
Mitogen-activated protein kinase. Signaling molecule that 
is the final kinase in a three-kinase sequence called the 
MAP-kinase signaling module. 


MAP-kinase signaling module 
Set of three functionally interlinked protein kinases that 
allows cells to respond to extracellular signal molecules 
that stimulate proliferation; includes a mitogen-activated 
protein kinase (MAP kinase), a MAP kinase kinase, and a 
MAP kinase kinase kinase. 


mass spectrometry 
Sensitive technique that enables the determination of the 
exact mass of all of the molecules in a complex mixture. 


matrix 
Large internal compartment within a mitochondrion. 


mechanically-gated channel 
An ion channel that allows the passage of select ions across 
a membrane in response to a physical perturbation. 


meiosis 
Specialized type of cell division by which eggs and sperm 
cells are made. Two successive nuclear divisions with only 
one round of DNA replication generate four haploid cells 
from an initial diploid cell. 


membrane domain 
Functionally and structurally specialized region in the 


membrane of a cell or organelle; typically characterized by 
the presence of specific proteins. 


membrane potential 
Voltage difference across a membrane due to a slight 
excess of positive ions on one side and of negative ions on 
the other. 


membrane protein 
A protein associated with the lipid bilayer of a cell 
membrane. 


membrane transport protein 
Any transmembrane protein that provides a passageway for 
the movement of select substances across a cell membrane. 


membrane-enclosed organelle 
Any organelle in a eukaryotic cell that is surrounded by a 
lipid bilayer—for example, the endoplasmic reticulum, Golgi 
apparatus, and lysosome. 


messenger RNA (mRNA) 
RNA molecule that specifies the amino acid sequence of a 
protein. 


metabolism 
The sum total of the chemical reactions that take place in 
the cells of a living organism. 


metaphase 
Stage of mitosis in which chromosomes are properly 
attached to the mitotic spindle at its equator but have not 
yet segregated toward opposite poles. 


metastasis 
The spread of cancer cells from the initial site of the tumor 
to form secondary tumors at other sites in the body. 


Michaelis constant (Km) 
The concentration of substrate at which an enzyme works 
at half its maximum velocity; serves as a measure of how 
tightly the substrate is bound. 


micrometer 
Unit of length equal to one millionth (10%) of a meter or 
10 centimeter. 


microRNA (miRNA) 
Small noncoding RNA that controls gene expression by 
base-pairing with a specific mRNA to regulate its stability 
and its translation. 


microscope 
Instrument for viewing extremely small objects. Some use a 
focused beam of visible light and are used to examine cells 
and organelles. Others use a beam of electrons and can be 
used to examine objects as small as individual molecules. 


microtubule 


Long, stiff, cylindrical EREEREER ET 
structure composed of BERR 


the protein tubulin. Used 
by eukaryotic cells to organize their cytoplasm and guide the 
intracellular transport of macromolecules and organelles. 


microtubule-associated protein 
Accessory protein that binds to microtubules; can stabilize 
microtubule filaments, link them to other cell structures, or 
transport various components along their length. 


mismatch repair 
Mechanism for recognizing and correcting incorrectly 
paired nucleotides—those that are noncomplementary. 


mitochondrion 
Membrane-enclosed organelle, about the size of a 
bacterium, that carries out oxidative phosphorylation and 
produces most of the ATP in eukaryotic cells. 
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mitogen 
An extracellular signal molecule that stimulates cell 
proliferation. 


mitosis 
Division of the nucleus of a eukaryotic cell. 


mitotic spindle 
Array of microtubules and associated molecules that forms 
between the opposite poles of a eukaryotic cell during 
mitosis and pulls duplicated chromosome sets apart. 


mobile genetic element 
Short segment of DNA that can move, sometimes through 
an RNA intermediate, from one location in a genome to 
another; an important source of genetic variation in most 
genomes. Also called a transposon. 


model organism 
A living thing selected for intensive study as a representative 
of a large group of species. Examples include the mouse 
(representing mammals), the yeast Saccharomyces cerevisiae 
(representing a unicellular eukaryote), and Escherichia coli 
(representing bacteria). 


molecular switch 
Intracellular signaling protein that toggles between an 
active and inactive state in response to receiving a signal. 


molecular weight 
Sum of the atomic weights of the atoms in a molecule; as 
a ratio of molecular masses, it is a number without units. 


molecule 
Group of atoms joined together by covalent bonds. 


monomer 
Small molecule that can be linked to others of a similar type 
to form a larger molecule (polymer). 


monomeric GTPase 
Small, single-subunit GTP-binding protein. Proteins of this 
family, such as Ras and Rho, are part of many different 
signaling pathways. 


motor protein 
Protein such as myosin or kinesin that uses energy derived 
from the hydrolysis of a tightly bound ATP molecule to 
propel itself along a protein filament or polymeric molecule. 


mutation 
A randomly produced, permanent change in the nucleotide 
sequence of DNA. 


myofibril 
Long, cylindrical structure that constitutes the contractile 
element of a muscle cell; constructed of arrays of highly 
organized bundles of actin, myosin, and other accessory 
proteins. 


myosin 
Type of motor protein that uses ATP to drive movements 
along actin filaments. One subtype interacts with actin to 
form the thick contractile bundles of skeletal muscle. 


myosin filament 
Polymer composed of interacting molecules of myosin-II; 
interaction with actin promotes contraction in muscle and 
nonmuscle cells. 


myosin-I 
Simplest type of myosin, present in all cells; consists of a 
single actin-binding head and a tail that can attach to other 
molecules or organelles. 


myosin-II 
Type of myosin that exists as a 
dimer with two actin-binding 
heads and a coiled-coil tail; 
can associate to form long myosin filaments. 
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N-terminus 
The end of a polypeptide chain that carries a free a-amino 
group. 


Na* pump (or Na*-K* ATPase) 
Transporter found in the plasma membrane of most animal 
cells that actively pumps Na* out of the cell and K* in using 
the energy derived from ATP hydrolysis. 


NAD* 
A molecule that accepts a hydride ion (H-7) from a donor 
molecule, thereby producing the activated carrier NADH. 
Widely used in the energy-producing breakdown of sugar 
molecules. (See Figure 3-34.) 


NADH 
Activated carrier of electrons that is widely used in the 
energy-producing breakdown of sugar molecules. (See 
Figure 3-34.) 


NADP* 
Molecule that accepts a hydride ion (H-) from a donor 
molecule, thereby producing the activated carrier NADPH; 
widely used as an electron donor in biosynthetic pathways. 


NADPH 
Activated carrier closely related to NADH and used as an 
electron donor in biosynthetic pathways. In the process it is 
oxidized to NADP*. 


Nernst equation 
An equation that relates the concentrations of an inorganic 
ion on the two sides of a permeable membrane to the 
membrane potential at which there would be no net 
movement of the ion across the membrane. 


nerve terminal 
Structure at the end of an axon that signals to another 
neuron or target cell. 


neuron 
An electrically excitable cell that integrates and transmits 
information as part of the nervous system; a nerve cell. 


neurotransmitter 
Small signaling molecule secreted by a nerve cell at a 
synapse to transmit information to a postsynaptic cell. 
Examples include acetylcholine, glutamate, GABA, and 
glycine. 


nitric oxide (NO) 
Locally acting gaseous signal molecule that diffuses 
across cell membranes to affect the activity of intracellular 
proteins. 


nitrogen fixation 
Conversion of nitrogen gas from the atmosphere into 
nitrogen-containing molecules by soil bacteria and 
cyanobacteria; requires a great deal of energy. 


noncovalent bond 
Chemical association that does not involve the sharing 
of electrons; singly they are relatively weak, but they can 
sum together to produce strong, highly specific interactions 
between molecules. Examples are hydrogen bonds and van 
der Waals attractions. 


nonhomologous end joining 
An error-prone mechanism for repairing double-strand 
breaks in DNA by rejoining the two broken ends; often 
results in a loss of information at the site of repair. 


nuclear envelope 
Double membrane surrounding the nucleus. Consists of 
outer and inner membranes, perforated by nuclear pores. 


nuclear lamina 
Fibrous layer on the inner surface of the inner nuclear 
membrane formed as a network of intermediate filaments 
made from nuclear lamins. 


nuclear magnetic resonance (NMR) spectroscopy 
Technique used for determining the three-dimensional 
structure of a protein in solution. 


nuclear pore 
Channel through which selected large 
molecules move between the nucleus 
and the cytoplasm. 


nuclear receptor 
Protein inside a eukaryotic cell that, on binding to a signal 
molecule, enters the nucleus and regulates transcription. 


nucleolus 
Large structure within the nucleus where ribosomal RNA is 
transcribed and ribosomal subunits are assembled. 


nucleosome 
Beadlike structural unit of a eukaryotic chromosome 
composed of a short length of DNA wrapped around an 
octameric core of histone proteins; includes a nucleosomal 
core particle (DNA plus histone protein) along with a 
segment of linker DNA that ties the core particles together. 


nucleotide 
Basic building block of the nucleic acids, DNA and RNA; a 
nucleoside linked to a phosphate. 


nucleus 
In biology, refers to the prominent, rounded structure that 
contains the DNA of a eukaryotic cell. In chemistry, refers to 
the dense, positively charged center of an atom. 


Okazaki fragment 
Short length of DNA, including an RNA primer, produced on 
the lagging strand during DNA replication. Following primer 
removal, adjacent fragments are rapidly joined together by 
DNA ligase to form a continuous DNA strand. 


oncogene 
A gene that, when activated, can potentially make a cell 
cancerous. Typically a mutant form of a normal gene (proto- 
oncogene) involved in the control of cell growth or division. 


open reading frame (ORF) 
Long sequence of nucleotides that contains no stop codon; 
used to identify potential protein-coding sequences in DNA. 


optogenetics 
Technique that uses light to control the activity of neurons 
into which light-gated ion channels have been artificially 
introduced. 


organelle 
A discrete structure or subcompartment of a eukaryotic 
cell that is specialized to carry out a particular function. 
Examples include mitochondria and the Golgi apparatus. 


organic molecule 
Chemical compound that contains carbon and hydrogen. 


organoid 
A miniature, three-dimensional collection of tissues formed 
from the proliferation, differentiation, and self-assembly of 
pluripotent cells in culture. 


osmosis 
Passive movement of water across a cell membrane from 
a region where the concentration of water is high (because 
the concentration of solutes is low) to a region where the 
concentration of water is low (and the concentration of 
solutes is high). 


oxidation 
Removal of electrons from an atom, as occurs during the 
addition of oxygen to a carbon atom or when a hydrogen 
is removed from a carbon atom; can also refer to a partial 
shift of electrons between atoms linked by a covalent bond. 


oxidative phosphorylation 
Membrane-based process in bacteria and mitochondria in 
which ATP formation is driven by the transfer of electrons 
from food molecules to molecular oxygen. 


p53 
Transcription regulator that controls the cell’s response to 
DNA damage, preventing the cell from entering S phase 
until the damage has been repaired or inducing the cell to 
commit suicide if the damage is too extensive; mutations in 
the gene encoding this protein are found in many human 
cancers. 

pairing 
In meiosis, the process by which a pair of duplicated 
homologous chromosomes attach to one another to form a 
structure containing four sister chromatids. 


passive transport 
The spontaneous movement of a solute down its 
concentration gradient across a cell membrane via a 
membrane transport protein, such as a channel or a 
transporter. 


patch-clamp recording 
Technique used to monitor the activity of ion channels in a 
membrane; involves the formation of a tight seal between 
the tip of a glass electrode and a small region of cell 
membrane, and manipulation of the membrane potential 
by varying the concentrations of ions in the electrode. 


pedigree 
Chart showing the line of descent, or ancestry, of an 
individual organism. 


peptide bond 
Covalent chemical bond between the T 
carbonyl group of one amino acid and the =—C—N— 
amino group of a second amino acid. (See l 
Panel 2-6, pp. 76-77.) i 


peroxisome 
Small membrane-enclosed organelle that contains enzymes 
that degrade lipids and destroy toxins. 


pH scale 
Concentration of hydrogen ions in a solution, expressed 
as a logarithm. An acidic solution with pH 3 will contain 
10 M hydrogen ions. 


phagocytic cell 
A cell such as a macrophage or neutrophil that is specialized 
to take up particles and microorganisms by phagocytosis. 


phagocytosis 
The process by which particulate material is engulfed 
(“eaten”) by a cell. Prominent in predatory cells, such as 
Amoeba proteus, and in cells of the vertebrate immune 
system, such as macrophages. 


phenotype 
The observable characteristics of a cell or organism. 


phosphatidylcholine 
Common phospholipid present in abundance in most cell 
membranes; uses choline attached to a phosphate as its 
head group. 


phosphoinositide 3-kinase (PI 3-kinase) 
Enzyme that phosphorylates inositol phospholipids in 
the plasma membrane, which generates docking sites for 
intracellular signaling proteins that promote cell growth 
and survival. 


phospholipase C 
Enzyme associated with the plasma membrane that 
generates two small messenger molecules in response to 
activation. 
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phospholipid @ 
A major type of lipid molecule in many cell 
membranes. Generally composed of two fatty N 
acid tails linked to one of a variety of phosphate- 
containing polar groups. 


photosynthesis 
The process by which plants, algae, and some bacteria 
use the energy of sunlight to drive the synthesis of organic 
molecules from carbon dioxide and water. 


photosystem 
Large multiprotein complex containing chlorophyll that 
captures light energy and converts it into chemical-bond 
energy; consists of a set of antenna complexes and a 
reaction center. 


phragmoplast 
In a dividing plant cell, structure made of microtubules and 
membrane vesicles that guides the formation of a new cell 
wall. 


phylogenetic tree 
Diagram or “family tree” showing the evolutionary 
relationships among groups of organisms or proteins. 


pinocytosis 
Type of endocytosis in which soluble materials are taken 
up from the environment and incorporated into vesicles for 
digestion. (Literally, “cell drinking.”) 


plasma membrane 
The protein-containing lipid bilayer that surrounds a living 
cell. 


plasmid 
Small, circular DNA molecule that replicates independently 
of the genome. Used extensively as a vector for DNA 
cloning. 


plasmodesma (plural plasmodesmata) 
Cell-cell junction that connects one plant cell to the next; 
consists of a channel of cytoplasm lined by membrane. 


pluripotent 
Capable of giving rise to any type of cell or tissue. 


point mutation 
Change in a single nucleotide pair in a DNA sequence. 


polar 
In chemistry, describes a molecule or bond in which 
electrons are distributed unevenly. 


polarity 
An inherent asymmetry that allows one end of an object 
to be distinguished from another; can refer to a molecule, 
a polymer (such as an actin filament), or even a cell (for 
example, an epithelial cell that lines the mammalian small 
intestine). 


polyadenylation 
The addition of multiple adenine nucleotides to the 3' end of 
a newly synthesized mRNA molecule. 


polymer 
Long molecule made by covalently linking multiple identical 
or similar subunits (monomers). 


polymerase chain reaction (PCR) 
Technique for amplifying selected regions of DNA by 
multiple cycles of DNA synthesis; can produce billions of 
copies of a given sequence in a matter of hours. 


polymorphism 
DNA sequence for which two or more variants are present 
at high frequency in the general population. 


polypeptide backbone 
Repeating sequence of the atoms (-N-C-C-) that form the 
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core of a protein molecule and to which the amino acid side 
chains are attached. 


polypeptide, polypeptide chain 
Linear polymer composed of multiple amino acids. Proteins 
are composed of one or more long polypeptide chains. 


=x +++) 
or pathway stimulates continued 


production or activity; controls a © 


variety of biological processes, including enzyme activity, 
cell signaling, and gene expression. 


positive feedback loop 
An important form of regulation in 
which the end product of a reaction 


post-transcriptional control 
Regulation of gene expression that occurs after transcription 
of the gene has begun; examples include RNA splicing and 
translational control. 


primary structure 
The amino acid sequence of a protein. 


primase 
An RNA polymerase that uses DNA as a template to produce 
an RNA fragment that serves as a primer for DNA synthesis. 


programmed cell death 
A tightly controlled form of cell suicide that allows cells that 
are unneeded or unwanted to be eliminated from an adult 
or developing organism; the major form is called apoptosis. 


prokaryote 
Major category of living cells distinguished by the absence 
of a nucleus; includes the archaea and the eubacteria 
(commonly called bacteria). 


prometaphase 
Stage of mitosis in which the nuclear envelope breaks down 
and duplicated chromosomes are captured by the spindle 
microtubules; precedes metaphase. 


promoter 
DNA sequence that initiates gene transcription; includes 
sequences recognized by RNA polymerase and its accessory 
proteins. 


promoter 
DNA sequence that initiates gene transcription; includes 
sequences recognized by RNA polymerase and its accessory 
proteins. 


proofreading 
The process by which DNA polymerase corrects its own 
errors as it moves along DNA. 


prophase 
First stage of mitosis, during which the duplicated 
chromosomes condense and the mitotic spindle forms. 


protease 
Enzyme that degrades proteins by hydrolyzing their peptide 
bonds. 


proteasome 
Large protein machine that degrades 
proteins that are damaged, misfolded, or no 
longer needed by the cell; its target proteins 
are marked for destruction primarily by the 
attachment of a short chain of ubiquitin. 


protein 
Macromolecule built from amino acids that provides cells 
with their shape and structure and performs most of their 
activities. 


protein domain 
Segment of a polypeptide chain that can fold into a compact, 
stable structure and that often carries out a specific function. 


protein family 
A group of polypeptides that share a similar amino acid 
sequence or three-dimensional structure, reflecting a 
common evolutionary origin. Individual members often 
have related but distinct functions, such as kinases that 
phosphorylate different target proteins. 


protein kinase 
Enzyme that catalyzes the transfer of a phosphate group 
from ATP to a specific amino acid side chain on a target 
protein. 


protein kinase C (PKC) 
Enzyme that phosphorylates target proteins in response to 
a rise in diacylglycerol and Ca?* ions. 


protein machine 
Assembly of protein molecules that operates as a 
cooperative unit to perform a complex series of biological 
activities, such as replicating DNA. 


protein phosphatase 
Enzyme that catalyzes the removal of a phosphate 
group from a protein, often with high specificity for the 
phosphorylated site. 


protein phosphorylation 
The covalent addition of a phosphate group to a side chain 
of a protein, catalyzed by a protein kinase; serves as a form 
of regulation that usually alters the activity or properties of 
the target protein. 


proteoglycan 
Molecule consisting of one or more glycosaminoglycan 
chains attached to a core protein; these aggregates can 
form gels that regulate the passage of molecules through 
the extracellular medium and guide cell migration. 


proto-oncogene 
Gene that when mutated or overexpressed can transform a 
normal cell into a cancerous one. 


proton 
Positively charged particle found in the nucleus of every 
atom; also, another name for a hydrogen ion (H+). 


protozoan 
A free-living, 
eukaryote. 


pump 
Transporter that uses a source of energy, such as ATP 
hydrolysis or sunlight, to actively move a solute across a 
membrane against its electrochemical gradient. 


nonphotosynthetic, single-celled, motile 


purifying selection 
Preservation of a specific nucleotide sequence by the 
elimination of individuals carrying mutations that interfere 
with its functions. 


pyruvate EN ZF 
Three-carbon metabolite that is the end C 
product of the glycolytic breakdown of l 
glucose; provides a crucial link to the C=O 
citric acid cycle and many biosynthetic | 
pathways. CH; 


quaternary structure 
Complete structure formed by multiple, interacting 
polypeptide chains that form a larger protein molecule. 


quinone 
Small, lipid-soluble, mobile electron carrier molecule that 
functions in the respiratory and photosynthetic electron- 
transport chains. (See Figure 14-21.) 


Rab protein 
One of a family of small GTP-binding proteins present on 
the surfaces of transport vesicles and organelles that serves 


as a molecular marker to help ensure that transport vesicles 
fuse only with the correct membrane. 


Ras 
One of a large family of small GTP-binding proteins (the 
monomeric GTPases) that helps relay signals from cell- 
surface receptors to the nucleus. Many human cancers 
contain an overactive mutant form of the protein. 


reaction center 
In photosynthetic membranes, a protein complex that 
contains a special pair of chlorophyll molecules; it performs 
the photochemical reactions that convert the energy of 
photons (light) into high-energy electrons for transport 
down the photosynthetic electron-transport chain. 


reading frame 
One of the three possible ways in which a set of successive 
nucleotide triplets can be translated into protein, depending 
on which nucleotide serves as the starting point. 


receptor 
Protein that recognizes and responds to a specific signal 
molecule. 


receptor serine/threonine kinase 
Enzyme-coupled receptor that phosphorylates 
proteins on serine or threonine. 


target 


receptor tyrosine kinase (RTK) 
Enzyme-coupled receptor in which the 
intracellular domain has a tyrosine kinase 
activity, which is activated by ligand 
binding to the receptor’s extracellular 
domain. 


receptor-mediated endocytosis 
Mechanism of selective uptake of material by animal cells 
in which a macromolecule binds to a receptor in the plasma 
membrane and enters the cell in a clathrin-coated vesicle. 


recombinant DNA 
A DNA molecule that is composed of DNA sequences from 
different sources. 


redox pair 
Two molecules that can be interconverted by the gain or 
loss of an electron; for example, NADH and NAD‘. 


redox potential 
A measure of the tendency of a given redox pair to donate 
or accept electrons. 


redox reaction 
A reaction in which electrons are transferred from one 
chemical species to another. An oxidation-reduction 
reaction. 


reduction 
Addition of electrons to an atom, as occurs during the 
addition of hydrogen to a carbon atom or the removal of 
oxygen from it; can also refer to a partial shift of electrons 
between atoms linked by a covalent bond. 


regulatory DNA sequence 
DNA sequence to which a transcription regulator binds to 
determine when, where, and in what quantities a gene is to 
be transcribed into RNA. 


regulatory RNA 
RNA molecule that plays a role in controlling gene 
expression. 


replication fork 

y-shaped junction at the site where DNA is being replicated. 
replication origin 

Nucleotide sequence at which DNA replication is initiated. 


reporter gene 
Gene encoding a protein whose activity is easy to monitor 
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experimentally; used to study the expression pattern of a 
target gene or the localization of its protein product. 


respiratory enzyme complex 
Set of proteins in the inner mitochondrial membrane that 
facilitates the transfer of high-energy electrons from NADH 
to water while pumping protons into the intermembrane 
space. 


resting membrane potential 
Voltage difference across the plasma membrane when a 
cell is not stimulated. 


restriction enzyme 
Enzyme that can cleave a 
DNA molecule at a specific, > 
short sequence of nucleotides. 3 
Extensively used in recombinant 
DNA technology. 


retrotransposon 
Type of mobile genetic element that moves by being first 
transcribed into an RNA copy that is reconverted to DNA 
by reverse transcriptase and inserted elsewhere in the 
chromosomes. 


retrovirus 
RNA-containing virus that replicates in a cell by first 
making a double-stranded DNA intermediate that becomes 
integrated into the cell's chromosome. 


reverse transcriptase 
Enzyme that makes a double-stranded DNA copy from 
a single-stranded RNA template molecule. Present in 
retroviruses and as part of the transposition machinery of 
retrotransposons. 


Rho protein family 
Family of small, monomeric GTPases that controls the 
organization of the actin cytoskeleton. 


ribosomal RNA (rRNA) 
RNA molecule that forms the structural and catalytic core 
of the ribosome. 


ribosome 
Large macromolecular complex, composed of RNAs and 
proteins, that translates a messenger RNA into a polypeptide 
chain. 


ribozyme 
An RNA molecule with catalytic activity. 


RNA 
Molecule produced by the transcription of DNA; usually 
single-stranded, it is a polynucleotide composed of 
covalently linked ribonucleotide subunits. Serves a variety 
of informational, structural, catalytic, and regulatory 
functions in cells. 


RNA (ribonucleic acid) 
Molecule produced by the transcription of DNA; usually 
single-stranded, it is a polynucleotide composed of 
covalently linked ribonucleotide subunits. Serves a variety 
of informational, structural, catalytic, and regulatory 
functions in cells. 


RNA capping 
The modification of the 5’ end of a maturing RNA transcript 
by the addition of an atypical nucleotide. 


RNA interference (RNAi) 
Cellular mechanism activated by double-stranded RNA 
molecules that results in the destruction of RNAs containing 
a similar nucleotide sequence. It is widely exploited as an 
experimental tool for preventing the expression of selected 
genes (gene silencing). 


RNA polymerase 
Enzyme that catalyzes the synthesis of an RNA molecule 
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from a DNA template using ribonucleoside triphosphate 
precursors. 


RNA processing 
Broad term for the modifications that a precursor MRNA 
undergoes as it matures into an MRNA. It typically includes 
5' capping, RNA splicing, and 3’ polyadenylation. 


RNA splicing 
Process in which intron sequences are excised from RNA 
molecules in the nucleus during the formation of a mature 
messenger RNA. 


RNA transcript 
RNA molecule produced by transcription that is 
complementary to one strand of DNA. 


RNA world 
Hypothetical period in Earth’s early history in which life- 
forms were thought to use RNA both to store genetic 
information and to catalyze chemical reactions. 


RNA-Seq 
Sequencing technique used to determine directly the 
nucleotide sequence of a collection of RNAs. 
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rough endoplasmic reticulum 
Region of the endoplasmic reticulum 
associated with ribosomes and involved in 
the synthesis of secreted and membrane- 
bound proteins. 


S cyclin 
Regulatory protein that helps to launch the S phase of the 
cell cycle. 


S phase 
Period during a eukaryotic cell cycle in which DNA is 
synthesized. 


S-Cdk 
Protein complex whose activity initiates DNA replication; 
consists of an S cyclin plus a cyclin-dependent protein 
kinase (Cdk). 


sarcomere 
Highly organized assembly of actin and myosin filaments 
that serves as the contractile unit of a myofibril in a muscle 
cell. 


saturated 
Describes an organic molecule that contains a full 
complement of hydrogen; in other words, no double or 
triple carbon-carbon bonds. 


scaffold protein 
Protein with multiple binding sites for other macromolecules, 
holding them in a way that speeds up their functional 
interactions. 


secondary structure 
Regular local folding pattern of a polymeric molecule. In 
proteins, it refers to a helices and f sheets. 


secretion 
Production and release of a substance from a cell. 


secretory vesicle 
Membrane-enclosed organelle in which molecules destined 
for secretion are stored prior to release. 


sequence 
The linear order of monomers in a large molecule—for 
example, amino acids in a protein or nucleotides in DNA; 
encodes information that specifies a macromolecule’s 
precise biological function. 


serine/threonine kinase 
Enzyme that phosphorylates target proteins on serines or 
threonines. 


sexual reproduction 
Mode of reproduction in which the genomes of two 
individuals are mixed to produce an individual that is 
genetically distinct from its parents. 


side chain 
Portion of an amino acid not involved in forming peptide 
bonds; its chemical identity gives each amino acid unique 
properties. 


signal sequence 
Amino acid sequence that directs a protein to a specific 
location in the cell, such as the nucleus or mitochondria. 


signal transduction 
Conversion of an impulse or stimulus from one physical or 
chemical form to another. In cell biology, the process by 
which a cell responds to an extracellular signal. 


single-nucleotide polymorphism (SNP) 
Form of genetic variation in which one portion of the 
population differs from another in terms of which nucleotide 
is found at a particular position in the genome. 


sister chromatid 
Copy of a chromosome, produced by DNA replication, that 
remains bound to the other copy. 


small interfering RNA (siRNA) 
Short length of RNA produced from double-stranded RNA 
during the process of RNA interference. It base-pairs 
with identical sequences in other RNAs, leading to the 
inactivation or destruction of the target RNA. 


small nuclear RNA (snRNA) 
RNA molecule of around 200 nucleotides that participates 
in RNA splicing. 


SNARE 
One of a family of membrane proteins responsible for the 
selective fusion of vesicles with a target membrane inside 
the cell. 


SNP (single-nucleotide polymorphism) 
Form of genetic variation in which one portion of the 
population differs from another in terms of which nucleotide 
is found at a particular position in the genome. 


somatic cell 
Any cell that forms part of the body of a plant or animal that 
is not a germ cell or germ-line precursor. 


spindle pole 
Centrosome from which microtubules radiate to form the 
mitotic spindle. 


spliceosome 
Large assembly of RNA and protein molecules that splices 
introns out of pre-mRNA in the nucleus of eukaryotic cells. 


standard free-energy change, AG° 
The free-energy change measured at a defined concentration, 
temperature, and pressure. (See also free-energy change.) 


starch 
Polysaccharide composed exclusively of glucose units, used 
as an energy store in plant cells. 


stem cell 
Relatively undifferentiated, self-renewing cell that produces 
daughter cells that can either differentiate into more 
specialized cell types or can retain the developmental 
potential of the parent cell. 


steroid hormone 
Hydrophobic signal molecule related to cholesterol; can 
pass through the plasma membrane to interact with 
intracellular receptors that affect gene expression in the 
target cell. Examples include estrogen and testosterone. 


stroma 
In a chloroplast, the large interior space that contains the 
enzymes needed to incorporate CO? into sugars during the 
carbon-fixation stage of photosynthesis; equivalent to the 
matrix of a mitochondrion. 


substrate 
A molecule on which an enzyme acts to catalyze a chemical 
reaction. 


substrate 
A molecule on which an enzyme acts to catalyze a chemical 
reaction. 


subunit 
A monomer that forms part of a larger molecule, such as an 
amino acid residue in a protein or a nucleotide residue in 
a nucleic acid. Can also refer to a complete molecule that 
forms part of a larger molecule. Many proteins, for example, 
are composed of multiple polypeptide chains, each of which 
is called a protein subunit. 


sugar 
a substance made of carbon, hydrogen, and oxygen with 
the general formula (CH2O)n. A carbohydrate or saccharide. 
The “sugar” of everyday use is sucrose, a sweet-tasting 
disaccharide made of glucose and fructose. 


survival factor 
Extracellular signal molecule that must be present to 
suppress apoptosis. 


symport 
A transporter that transfers two different solutes across a 


cell membrane in the same direction. 


synapse 
Specialized junction where a nerve cell 
communicates with another cell (such as 
a nerve cell, muscle cell, or gland cell), 
usually via a neurotransmitter secreted by 
the nerve cell. 
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synaptic vesicle 
Small membrane-enclosed sac filled with neurotransmitter 
that releases its contents by exocytosis at a synapse. 


telomerase 
Enzyme that elongates telomeres, synthesizing the 
repetitive nucleotide sequences found at the ends of 
eukaryotic chromosomes. 


telomere 
Repetitive nucleotide sequence that caps the ends of 
linear chromosomes. Counteracts the tendency of the 
chromosome otherwise to shorten with each round of 
replication. 


telophase 
Final stage of mitosis in which the two sets of separated 
chromosomes decondense and become enclosed by a 
nuclear envelope. 


template 
A molecular structure that serves as a pattern for the 
production of other molecules. For example, one strand 
of DNA directs the synthesis of the complementary DNA 
strand. 


tertiary structure 
Complete three-dimensional structure of a fully folded 
protein. 


tethering protein 
Filamentous transmembrane protein involved in the 
docking of transport vesicles to target membranes. 


thylakoid 
Inachloroplast, the flattened, disclike sac whose membranes 
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contain the proteins and pigments that convert light energy 
into chemical-bond energy during photosynthesis. 


tight junction 
Cell-cell junction that seals adjacent epithelial cells together, 
preventing the passage of most dissolved molecules from 
one side of the epithelial sheet to the other. 


tissue 
Cooperative assembly of cells and matrix woven together 
to form a distinctive multicellular fabric with a specific 
function. 


transcription 
Process in which RNA polymerase uses one strand of 
DNA as a template to synthesize a complementary RNA 
sequence. 


transcription regulator 
Protein that binds specifically to a regulatory DNA sequence 
to switch a gene either on or off. 


transcriptional activator 
A protein that binds to a specific regulatory region of DNA to 
stimulate transcription of an adjacent gene. 


transcriptional repressor 
A protein that binds to a specific regulatory region of DNA 
to prevent transcription of an adjacent gene. 


transfer RNA (tRNA) 
Small RNA molecule that serves as an adaptor that “reads” 
a codon in mRNA and adds the correct amino acid to the 
growing polypeptide chain. 


transformation 
Process by which cells take up DNA molecules from their 
surroundings and then express genes present on that DNA. 


transgenic organism 
A plant or animal that has stably incorporated into its 
genome one or more genes derived from another cell or 
organism. 


transition state 
Transient structure that forms during the course of a 
chemical reaction; in this configuration, a molecule has the 
highest free energy; it is no longer the substrate, but is not 
yet the product. 


translation 
Process by which the sequence of nucleotides in a messenger 
RNA molecule directs the incorporation of amino acids into 
protein. 


translation initiation factor 
Protein that promotes the proper association of ribosomes 
with mRNA and is required for the initiation of protein 
synthesis. 


transmitter-gated ion channel 
Transmembrane receptor protein or protein complex that 
opens in response to the binding of a neurotransmitter, 
allowing the passage of a specific inorganic ion; its 
activation can trigger an action potential in a postsynaptic 
cell. 


transport vesicle 
Membrane vesicle that carries proteins from one 
intracellular compartment to another—for example, from 
the endoplasmic reticulum to the Golgi apparatus. 


transporter 
Membrane transport protein that moves a solute across a 
cell membrane by undergoing a series of conformational 
changes. 


transposon 
General name for short segments of DNA that can move 
from one location to another in the genome. Also known as 
mobile genetic elements. 
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tubulin 
Protein from which microtubules are made. 


tumor suppressor gene 
A gene that in a normal tissue cell inhibits cancerous 
behavior. Loss or inactivation of both copies of such a gene 
from a diploid cell can cause it to behave as a cancer cell. 


turnover number 
The maximum number of substrate molecules that an 
enzyme can convert into product per second. 


tyrosine kinase 
Enzyme that phosphorylates target proteins on tyrosines. 


unfolded protein response (UPR) 
Molecular program triggered by the accumulation of 
misfolded proteins in the endoplasmic reticulum. Allows 
cells to expand the endoplasmic reticulum and produce 
more of the molecular machinery needed to restore proper 
protein folding and processing. 


unsaturated 
Describes an organic molecule that contains one or more 
double or triple bonds between its carbon atoms. 


van der Waals attraction 
Weak noncovalent interaction, due to fluctuating electrical 
charges, that comes into play between two atoms within a 
short distance of each other. 


vesicular transport 
Movement of material between organelles in the eukaryotic 
cell via membrane-enclosed vesicles. 

virus 
Particle consisting of nucleic acid (RNA 
or DNA) enclosed in a protein coat and 
capable of replicating within a host cell 
and spreading from cell to cell. 


Vmax 
The maximum rate of an enzymatic reaction, reached when 
the active sites of all of the enzyme molecules in a sample 
are fully occupied by substrate. 


voltage-gated channel 
Channel protein that permits the passage of selected ions, 
such as Na*, across a membrane in response to changes 
in the membrane potential. Found primarily in electrically 
excitable cells such as nerve and muscle cells. 


voltage-gated Na* channel 
Protein in the plasma membrane of electrically excitable 
cells that opens in response to membrane depolarization, 
allowing Na* to enter the cell. It is responsible for action 
potentials in these cells. 


Wnt protein 
Member of a family of extracellular signal molecules that 
regulates cell proliferation and migration during embryonic 
development and that maintains stem cells in a proliferative 
state. 


x-ray crystallography 
Technique used to determine the _ three-dimensional 
structure of a protein molecule by analyzing the diffraction 
pattern produced when a beam of x-rays is passed through 
an ordered array of the protein. 


zygote 
Diploid cell produced by fusion of a male and a female 
gamete. A fertilized egg. 
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A site, ribosomes 251, 252F, 253-254, 
256T 
abbreviations and codes 
amino acids 76F, 120F 
nucleotides and bases 79F, 175-176 
absorptive cells/brush-border cells 523, 
703-704, 712-714 
acetic acid 49, 470 
acetyl CoA 
as an activated carrier 108-109 
in the citric acid cycle 430, 438-441, 
462 
mitochondrial conversions to 
438, 462F 
oxidation, ATP yield 469T 
acetylation, lysine residues 153, 
189-191 
acetylcholine 
cardiac ion channel effects 548 
as excitatory neurotransmitter 
418, 419T 
as extracellular signal molecule 536T 
ion channel binding 418F 
phospholipase C and 552T 
acetylcholine receptor 118F, 418, 
419-420, 537, 543 
N-acetylglucosamine 53, 73F 
acid anhydrides 67F 
acidic side-chain amino acids 77F 
acidity 
hydrolytic enzymes and 527 
maintenance in organelles 395, 
401-402, 465 
acids 
acidic side-chain amino acids 77F 
hydronium ion formation by 49-50 
proton donation by 470 
strong and weak 69F 
see also amino acids; carboxylic 
acids 
aconitase/cis-aconitate 442F 
ACTH (adrenocorticotropic hormone) 
550T 
actin 
B-actin gene 325F 
animal cell cortex 381 
contractile structures with myosin 
597, 599-605, 627 
polymerization 593-594 
actin-binding proteins 592, 594-596, 
598-599, 603F 
actin filaments 22, 134, 574, 592-599 
binding to extracellular matrix 
698-699, 705 
cell cortex and 595-597, 600F 


extracellular signals 598-599 
structural polarity 593 
actin-related proteins (ARPs) 595, 
598-599 
action at a distance 276 
action potentials 
direction 415F 
nerve axons 411, 414-416 
transmitter-gated ion channels 
409-410 
triggering muscle contraction 604 
voltage-gated ion channels 411-416 
activated carriers 
acetyl CoA as 108-109 
ATP as 104-106, 479 
and biosynthesis 101-112 
carboxylated biotin as 109T, 110F 
in cell respiration 427 
chemiosmotic coupling and 462 
coupled reactions and 101, 104 
FADH, as 108, 438, 440 
in fatty acid breakdown 438 
GTP as 440 
NADH and NADPH as 106-107, 479 
in photosynthesis 86, 478-479, 482 
S-adenosylmethionine as 109T 
activation energy 89-90, 91F, 100, 146, 
147F, 428F 
active sites 149, 151, 241F, 257, 259, 
261F, 475, 509, 516 
competitive inhibition and 145F 
lysozyme 146-147 
serine proteases 132F 
active transport 
contrasted with passive 393 
gradient-driven pumps 399-400 
three types of pump 397 
see also pump proteins 
adaptation, photoreceptor cells 556 
adaptins 513, 514F 
adenine 
methylation 218 
polyadenylation 237-239, 242, 244F 
as a purine base 57 
adenosine phosphates see ADP; AMP; 
ATP 
adenovirus 319F 
adenylyl cyclase 375T, 547-549, 551F, 
552, 562F 
adherens junctions 704-705, 706F, 708F 
adipocytes 450 
ADP (adenosine 5’-diphosphate) 57F 
ATP/ADP ratio 467-468 
as regulatory ligand 151F 
see also ATP; phosphorylation 
adrenaline see epinephrine 
adrenergic receptors 118F, 550 


aerobic metabolism 438, 476, 478, 489 
affinity chromatography 159, 166F 
age and cancer incidence 223F, 721 
age-related macular degeneration 
(AMD) 685 
aging, humans 357F, 475, 579 
aging, mice 357F 
agriculture 28, 359-360 
Agrobacterium 360F 
AIDS 318T, 319F, 320, 344F, 526, 680 
Akt kinase (PKB) 561, 562F 
albinism 667-668, 680 
aldolase 436F 
alkalis see bases 
alleles 
dominant and recessive 665-666 
as gene variants 653 
law of independent assortment 
668-669 
mixing in meiosis 664 
SNP linkage to 684 
allosteric proteins 
allosteric enzymes 151, 152F, 160T 
cytochrome c oxidase 474 
E. coli tryptophan repressor 273 
hemoglobin 304 
phosphofructokinase 448 
a helices 
as amphipathic 371, 372F 
as a common folding pattern 
126-128 
in enzyme-coupled receptors 558 
in GPCRs 545F 
in intermediate filaments 576 
proposed 160T 
protein-membrane association 128, 
376-378, 510 
a subunit, G protein 545-549, 553F, 
556 
ALS (amyotrophic lateral sclerosis) 578 
alternative splicing 241, 288 
Alu sequences 310, 311F, 317, 322F 
Alzheimer’s disease 129, 130F 
AMD (age-related macular 
degeneration) 685 
amino acid sequences/sequencing 
in antibodies 139 
determined by nucleotide sequences 
178, 181F 
and the genetic code 178 
importance of 59 
intrinsically disordered sequences 
131 
protein characterization 159-161 
protein shape and 119-121 
signal sequences (sorting signals) 
501-502, 504-505, 507, 509 
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amino acid sequences/sequencing 
(continued) 
similarity across species 31 
see also polypeptides; proteins 
amino acid side chains 
basic and acidic side chains 76F-77F 
charged and polar side chains 120F 
amino acids 
bacterial biosynthesis 150F 
D- and -forms 56 
ionization 76F 
mitochondrial matrix breakdown 
438 
precursors of 441 
as protein constituents 4, 56, 76F-77F 
protein sequencing 159-161 
in proteins from different species 31 
radiolabeled 246-247, 520 
as subunits 51 
three- and one-letter abbreviations 
120F, 245F 
tRNA coupling 249 
amino group in weak bases 50, 69F 
aminoacyl-tRNA synthetases 249 
amoebae 
as eukaryotes 16 
genome size 34, 181 
and osmotic swelling 395 
phagocytosis 524, 596 
as protozoa 27F 
AMP (adenosine monophosphate) 78F, 
111, 549 
see also cyclic AMP 
AMP-PNP 588 
amphipathic molecules 54-55, 367, 
368F, 370-371, 376-378 
amyloid plaques/fibers/structures 129, 
130F, 157-158 
Anabaena cylindrica 15F 
anabolic pathways 82 
activated carriers and 101F 
glycolysis and citric acid cycle 
products 441 
NADPH and 108 
regulation 447 
anaerobic respiration 14, 433-434 
anaphase, meiosis 656F, 659-660, 661F 
anaphase, mitosis 627F, 629F, 633-639, 
656F 
see also APC/C 
anaphase A/anaphase B 634, 635F 
ancestral cell 5-6, 457, 459 
see also common ancestors 
anemia 191, 715 
sickle-cell anemia 160T, 222, 680 
aneuploidy 663 
angina 555 
animals 
basic tissue types 695 
cell signaling in plants and 567 
cell structure 8F, 9 
contractile ring 637-638 
explanation of differences between 
323, 326 
glucose transport 400-401 
model organisms 29, 32 
optogenetics in living animals 421, 422F 
resting membrane potentials 406 
separate evolution of multicellularity 
from plants 567, 692 
anions 
concentration gradients 391 
solution behavior 47 


annealing see DNA hybridization 
antenna complexes 481, 483F 
antibiotic resistance 
horizontal gene transfer and 308, 
338 
mobile genetic elements and 315, 
316F 
plasmids in 338 
antibiotics and prokaryotic protein 
synthesis 255-256 
antibodies 138-139, 140F 
in affinity chromatography 159, 166F 
binding sites 138-139, 140F 
cancer treatment and checkpoint 
inhibitors 728 
ER assembly of 517 
immunoglobulin domains 139F 
immunoprecipitation 141F, 563, 
730 
labeling with 141F, 384 
making and using 140F-141F 
monoclonal antibody preparation 
141F 
raising in laboratory animals 140F 
specificity 138, 140F 
staining and 12F 
against tumors 728 
antibody labeling 
membrane flow investigation 
384-385 
anticancer drugs 148, 562, 584 
anticodons 248-249 
antifreeze proteins 117-118F 
antigen-binding sites 138-139 
antimitotic drugs 584 
antiparallel B-sheets 127F, 129, 131F, 
139F 
antiparallel strands, DNA 175F, 176, 
177F, 206 
antiparallel tetramers, intermediate 
filaments 576F 
antiports 400-401, 467, 468F 
APC (adenomatous polyposis coli) gene 
726-727, 728T, 730-731 
APC/C (anaphase-promoting complex 
or cyclosome) 617, 618F, 619, 625, 
633-635 
apoptosis 
avoidance by cancer cells 723 
Bad protein and 561 
Caenorhabditis elegans 32 
extracellular signals and 538F, 642 
as programmed cell death 640 
response to DNA damage 621 
suppression by survival factors 
643-644 
UPR and 518 
aquaporins 394, 404 
Arabidopsis thaliana 
cellular structure 695F 
gene numbers 323, 567 
genome size 35T, 323 
as a model plant 28, 567 
regeneration from a callus 360 
archaea 14-16, 26F, 314, 463, 466, 
490-491, 499 
arginine and nitric oxide 555 
Armadillo protein 730 
ARPs (actin-related proteins) 595, 
598-599 
arthritis 680, 683, 684, 697 
asexual reproduction, examples 652 
asparagine glycosylation 516 


aspartate 
bacterial biosynthesis and 150F 
precursors 441 
aspartate transcarbamoylase 125F, 151F 
asters 627F, 630, 632F 
atherosclerosis 526 
atomic number 40 
atomic weights 41, 43F 
atoms 
defined and described 40-41 
visualization 11 
ATP (adenosine 5’-triphosphate) 
as an activated carrier 57, 104-106, 
109T 
ATP/ADP ratio 467-468 
formation in mitochondria 17 
generation by oxidative 
phosphorylation 446, 456 
as a nucleotide 57, 79F 
in photosynthesis 479-480, 482 
production in the earliest cells 455, 
488-489 
protein phosphorylation and 
152-153 
rate of ADP conversion 466, 468 
rate of turnover 422, 468 
resulting from glucose oxidation 
processes 428, 469T 
resulting from glycolysis 423, 
434-435, 469T 
ATP analogs 588 
ATP cycle 57F, 104 
ATP-driven pumps 397, 399, 400F, 488F, 
526-527 
ATP synthase 
evolution of oxidative 
phosphorylation 456 
in Methanococcus jannaschii 491 
in photosynthesis 478-479, 482, 
484 
stage 2 of oxidative phosphorylation 
456, 461 
use of electrochemical proton 
gradient 465-466, 477 
ATP synthesis 
chemiosmotic coupling and 
476-477 
electron transport and 446 
ATPases 
ATP synthase reversibility 466 
kinesins and dyenins as 578 
protein pumps as 397, 399 
attachment proteins 381 
auditory hair cells 13F, 408, 409F, 
419T, 702 
autism 347, 359, 420, 717 
autocatalysis 5F, 259-260 
autocrine signaling 535-536 
automated genetic screening 676 
automated genome sequencing 
346-348 
autophagy 528 
Avery, Oswald et al. 194-195 
Avogadro’s number 41 
axons 
growth of 596 
intermediate filaments in 577 
signaling function 410-411, 415 
squid giant axons 411-413, 588 
terminals 411, 415, 420F, 535, 585 
transport along 585 
voltage-gated ion channels 419T 
axoplasm 412-413F, 588 
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B lymphocytes/B cells 140F, 141F, 
268 
Bacillus subtilis 303 
backbone models, protein structure 
124, 126F, 132F 
“backstitching” maneuver 207-208 
BACs (bacterial artificial chromosomes) 
348-349 
bacteria 
Cas9 defense mechanism 358 
cell walls and lysozyme 136 
chemical composition 51, 52T 
circular DNA in 179 
conjugation in 308F 
DNA cloning in 334-341 
in genetic engineering 106 
habitats 14-15 
membrane fluidity 372 
origins of chloroplasts as 15, 18-19, 
457-458, 490, 499 
origins of mitochondria as 14, 17-18, 
457, 490, 499, 500F 
as prokaryotes 11, 15 
replication rate 14 
restriction nucleases and 335 
shapes and sizes 14 
start codons 254 
sugar digestion regulators 275-276 
surface-to-volume ratios 499 
thermophilic 342F 
toxins and G proteins 547-548 
transcription initiation in 233, 
235 
transcription regulators 271-275 
translation accompanying 
transcription 255 
see also E. coli 
bacterial flagella 467 
bacteriorhodopsin 118F, 160T, 379-380, 
397, 402, 403T, 477, 545 
Bad protein 561 
“bait” proteins 563, 730 
Bak protein 642, 643F 
ball-and-stick models 44F, 52F, 56-57F, 
107F, 109F 
Barr bodies 191F 
barrier DNA sequences 190-191 
basal bodies 580F, 582, 590 
basal lamina 383F, 578F, 702-703, 
704-707, 713, 719F, 722 
base-pairing 58, 176, 179F, 216, 217F, 
230, 241, 289 
enabling DNA replication 
200-201 
hydrogen bonds, A-T and C-G 
177F, 201 
tRNA with rRNA 251 
wobble base-pairing 248 
see also complementary base-pairing 
bases (in solution) 
basic side-chain amino acids 76F 
hydroxyl ion formation 50, 69F 
proton acceptance by 50 
bases (nucleotide) 
abbreviations 79F, 176 
in DNA and RNA 57, 175, 209 
external features 234, 272 
as purines and pyrimidines 57, 78F 
unusual, in tRNA 248F 
See also adenine; cytosine; guanine; 
thymine; uracil 


basic side-chain amino acids 76F 
Bax protein 642, 643F 
Bcl2 family proteins 642, 644 
Beggiatoa 15F 
behavioral effects of mutations 676 
By complex, G protein 546, 547F, 548, 
553F 
B barrels 376F, 378 
B cells, pancreatic 267-268, 523F, 
536T 
B-galactosidase 281F, 353F 
B sheets 126, 127F, 129-130, 131F, 139F, 
157, 158F, 160T 
membrane proteins 378 
bi-orientation 632 
Bicoid gene 281F 
binding 
extracellular signal molecules 537F 
as a protein function 137-138 
binding energies 97-98 
binding sites 137-138 
cooperative binding 305F, 400 
multiple binding sites 151 
multiple polypeptide chains 
132-133 
oxygen 474 
transporters 392, 396, 400 
see also active sites; substrate 
binding 
binding strength, noncovalent 
interactions 97 
biosynthesis 
activated carriers and 101-112 
pathways beginning with glycolysis 
or the citric acid cycle 441 
see also anabolic pathways; catabolic 
pathways 
biotechnology industry 161 
bioterrorism 343 
biotin as an activated carrier 109, 110F, 
149 
birth defects 681 
bisphosphoglycerate 434-435, 437F 
1,3-bisphosphoglycerate 434, 435F, 
437F, 476, 486F 
bivalents 657-659, 661, 670 
blastocysts 716 
blebs 640 
blindness 361, 681, 685 
blood cells 
lubrication 383 
types 714, 715F 
blood groups 54, 73F 
blood platelets 699 
blood samples 341, 344F, 714 
body plan formation 280 
body size, determination of 639 
bond angles 44 
bond energies 
in activated carriers 86, 101, 427, 
466F 
conversion 84-85 
covalent bonds 44, 46 
“high-energy” bonds 95F, 428 
bond lengths 44, 48T, 68F 
bond strengths 45-46, 48T, 67F 
bone 693, 695-697, 698F, 700, 711 
bone marrow 710, 712, 714-715 
Boveri, Theodor 24T 
brain 
nerve cell receptors 421 
transmitter-gated ion channels 
419 
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CaM-kinase effects 554 
energy consumption 416 
gene expression 551, 552F 
regulatory DNA and evolution 326 
Brcal and Brca2 proteins 728 
brewer's yeast see Saccharomyces 
cerevisiae 
brown fat cells 476-477 
brush-border cells/absorptive cells 523, 
703-704, 712-714 
budding yeasts 2, 16, 17F, 28 
see also Saccharomyces cerevisiae 
buffers 50 
bypass reactions 448 
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C. elegans see Caenorhabditis 
C-termini, polypeptides 56, 120 
Ca% ions 
Ca?* channel abnormalities 717 
concentrations inside and outside 
cells 391T, 399 
fertilization and egg development 
553, 663 
inositol trisphosphate effects 
552-553 
intracellular messenger role 545, 
553 
ion-channel-coupled receptors 
544 
required by cadherins 705 
sequestration in smooth ER 497 
triggering muscle contraction 553, 
604-605 
voltage-gated Ca channels 416, 
417F, 604F 
Ca** pumps 118F, 399, 400F, 403T, 
554, 605 
cadherins 705, 706-707F, 711, 723, 725 
Caenorhabditis elegans 
centrosome 582F 
genome 35T, 309, 676-677 
introducing dsRNA 355 
as model organism 29, 32 
social behavior 676 
caffeine 549 
calcium ions see Ca?* 
calcium phosphate in bone 696F, 
700 
calico cats 191, 192F 
callus formation 359-360 
calluses 359-360 
calmodulin 125F, 554 
calories, conversion with joules 45, 
94F 
Calvin cycle 486z 
CaM-kinases (Ca?*/calmodulin- 
dependent protein kinases) 554 
cancer cells 
characteristics 718-719, 722-723 
competitive advantage 721-723 
favoring glycolysis 723 
invasiveness 718-719 
matrix proteases in 697 
mismatch repair and 218-219 
and somatic mutations 223 
and telomere shortening 215, 723 
unrestrained proliferation 718-719 
cancer-critical/driver mutations/genes 
720-721, 723-727, 728T, 730-731 
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cancers 718-729 catalysis cell communities see tissues 
anticancer drugs 148, 562, 728-729 and activation energy 89-90 cell cortex 
arising from uncorrected mutations defined 91 actin filaments and 574F, 595-597, 


599-600F 

and cell locomotion 596-598 

contractile ring and 637 

and plasma membranes 380-381, 
385, 596 

cell cycle 

chromosome behavior and 181-182 

duration and turnover rate 610, 
611T, 711-712 

eukaryotic, four phases 611-612 

overview 609, 610-613 


218-219, 222-223 
breast cancer 722F, 728 see also enzymes 
chromosomal loops and 278 B-catenins 730-731 
as clones of misbehaving cells cations 

718-719, 722F concentration gradients 391-392 
colorectal cancer 218, 223F, 715, solution behavior 47 

719F, 721F, 726-727, 730-731 CCRS5 receptor 682 
death rates 718 Cdc genes 30-31 
and epidemiology 719-720 Cdcé phosphatase 623, 624F 
as failure of controls 620, 726 Cdc25 phosphatase 618F, 623, 625 
familial predisposition 726, 730 Cdk inhibitors 618-621, 622F 
genetic instability 721 Cdks (cyclin-dependent protein kinases) senescence 723 
leukemia 148, 714F, 715, 729 613-618, 620-623, 624F, 630, 635 see also G, phase; G, phase; M phase; 
malignant and benign tumors 719, 731 cell-cycle control system 613 S phase 
melanomas 728, 729F, 731 G,-Cdks 617, 621 cell-cycle control system 612-613, 


energetics of 88-100 


metastases 719, 722, 725, 727-728, 
729F 

oncogenes and tumor suppressor 
genes 560, 723-725, 726-730 

p53 gene and 621, 723 

Ras mutations in 560, 673, 722 

regulatory pathways targeted 
725-726 

retinoblastoma 620 


G,/S-Cdks 614, 617, 620-621, 630 
M-Cdks 614, 616, 617T, 618-620, 
623-625, 630 
S-Cdks 614, 617, 620-621, 622F, 623, 
624F, 630 
vertebrate cyclins and Cdks 617T 
cDNA (complementary DNA) libraries 
339, 340F 
cells 


RTK abnormalities and 558 
treatment options 728 
Candida albicans 324F 
cap-binding complexes 242 
CAP (catabolite activator protein) 130, 
133, 275 
capping proteins 583, 598F 
carbohydrates 
cell surface 382-382, 386 
mono-, di-, poly-, and 
oligosaccharides 53-54 
see also sugars 
carbon-carbon bonds 44-45 
carbon compounds, importance 39, 
50-51 
carbon cycle 87 
carbon dioxide in respiration and 
photosynthesis 86, 88, 89, 427, 478, 
480F, 485F 
carbon fixation 86, 478, 480, 484-487 


animal cells as polarized 585 

apical, basal, and lateral surfaces 
382, 401, 702-705, 706F 

chemical similarity 3-4 

composition by mass 58 

defense mechanisms 290-291 

diploid and haploid distinguished 
652 

discovery 7 

energy use 82-88 

eukaryotic 16-27 

as fundamental to life 1 

genomics experiments in living cells 
352-354 

germ cells 65, 299-300, 652 

intracellular condensates 157, 158F, 
242 

microscopic examination 6-11 

numbers, control of 639-646 

numbers in Caenorhabditis elegans 


613-619 
checkpoints 612, 635, 663, 721, 728 
defects and cancer 721, 725, 728 
eukaryotes, as conserved 30-31, 613 
pausing the cycle 618-619 
temperature-sensitive mutants 678 


cell division 


asymmetric cell division 637 

cdc genes 30-31 

coordinated gene expression in 279, 
282 

cytoskeleton role in 23 

meiosis as reductive division 652, 
654 

microscopic view 7F 

mitogens and 620-621 

nondividing states 621-622 

organelle distribution 500 

protein phosphorylation 152 

rates of 610, 622 

and whole-genome duplication 
306 

yeast studies 28, 30 

see also cell cycle; meiosis; mitosis 


cell-free systems 155, 160T, 246-247, 


520 


cell fusion 


mouse-human hybrids 381 
in sexual reproduction 651, 653, 
662-663 


carbon-fixation cycle (Calvin cycle) 486 32 
carbon isotopes 41 numbers in human body 709 
carbon-nitrogen and carbon-oxygen organization into tissues and organs 


cell homogenates 158, 164F-165F 
cell junctions 704-706, 708F 


bonds 67F 
carbon skeletons 66F 
carboxyl group transfer 110F 
carboxylic acids 
as amphipathic 54 
carboxylated biotin 109T, 110F 
derivatives of 74F 
in water 54, 67F, 69F 
weakness of 49, 69F 
see also amino acids; fatty acids 
carboxypeptidase 149 
cardiac muscle see heart muscle 
cargo receptors 513, 514F 
carnivorous plants 405 
B-carotene 360 
cartilage 695, 697, 700-701 
Cas9 enzyme 358 
caspases 641-643 
catabolic pathways 82 
activated carriers and 101F, 108 
regulation 447 


three stages of catabolism 428-430 


see also citric acid cycle 


691 

plant and animal contrasted 8F, 567, 
692 

prokaryotic 11-16 

recognition by surface carbohydrate 
383 

repertoire of activities 710 

reprogramming differentiated cells 
285 

role of small molecules 50-58 

second law of thermodynamics and 
83-84, 90 

self-organization in 134 

shapes and sizes of 2, 11F, 639-645 

shared chemistry 3-4 

tissue organization and 710-711 

turnover 711-712 

unity and diversity 2-5, 701-702, 710 

viral lysis 317 


cell-cell interactions 


in organoids 718 
recognition 383, 516 
see also cell junctions; cell signaling 


cytoskeleton-linked junctions 691, 
693, 698-699, 700F, 704-707 

desmosomes 575, 577, 579, 
704-707 

gap junctions 403, 707-709 

importance of adhesion 711, 723 

intermediate filaments and 575 

in plants 692-693, 708, 709F 

tight junctions 382, 383F, 401, 527F, 
703-705, 708F 


cell locomotion/crawling 


characteristic of animal cells 2, 23 

enzyme-coupled receptors and 557 

integrins and 698-699 

proteoglycans and 701 

role of the cytoskeleton 590-592, 
594, 596-599 

see also motor proteins 


cell membranes 


as amphipathic 367-369, 370, 376 

as asymmetrical 373-374 

concentration gradients across 380, 
393-395, 405F 


internal membranes 19, 365-366, 
496-499, 500F 
preserved orientation 374, 377, 510 
see also lipid bilayers; membrane 
proteins; plasma membrane 
cell memory 191-192, 278, 286-287, 711 
cell proliferation 
cancer cells 712, 718-722, 725-727, 
730-731 
cell-cycle control system 610-611, 
621-622, 640, 644 
extracellular signals and 640, 643 
Ras mutations 564, 673 
RTK role 559, 560 
supplying differentiated cells 
712-714 
Wnt pathway and 714-715, 726, 
730-731 
cell respiration 
ATP generation from 455 
citric acid cycle elucidation 444-445 
complementary to photosynthesis 
86 -87 
dependence on diffusion 390 
efficiency 468-469 
mitochondria in 17 
role of sugars 427 
cell signaling 
contact-dependent signaling 
535-536 
general principles 534-545 
in plants and animals 567 
selectivity of response 537 
and stability 711 
types of signaling 534-536 
see also extracellular signal 
molecules; intracellular signaling 
pathways; signal transduction 
cell structure 
investigations 8-11, 24T 
plant, animal and bacterial 25F 
cell-surface receptors see receptors 
cell surfaces 
carbohydrate layer 382-382, 386 
patch-clamp recording 160T, 
407-408, 411F 
tumor-specific molecules 728 
cell theory 7-8, 24T 
cell type 
in culture 285 
and gene expression 268, 340 
protein composition 269-270 
selection for RNA-Seq 325 
specialization 278-287 
see also differentiation 
cell walls 
cellulose fibril orientation 695-695 
discovery of cells and 7 
and osmotic swelling 395, 693 
plant cytokinesis 638 
and plasma membranes 380 
plasmodesmata 708, 709F 
primary and secondary 693-694 
prokaryotes 14 
cellular respiration 17, 86, 87F, 427, 
444-445, 463-464, 468-469 
cellulase 168F 
cellulose 53, 693F, 694-695 
cellulose synthase 695F 
central dogma 4, 228, 246 
centrifugation 
boundary and band sedimentation 
61F 


density gradient 165F, 203 
fixed-angle and swinging-arm 164F 
separation of organelles 164F-165F 
ultracentrifuge 60-61, 164F-165F, 
203-204, 252F 
centrioles 25F, 582, 630F 
centromeres 182, 183F, 190, 311, 626, 
631 
in meiosis 659F, 660, 661F 
centrosome cycle 630 
centrosomes 
in idealized animal cell 21F, 25F 
microtubule growth from 580, 
581-582 
in mitotic spindle assembly 580, 627, 
630F 
see also cytoskeleton 
cesium chloride 165F, 203 
Cfh gene (complement factor H) 685 
Chalfie, Martin 520 
channelrhodopsin 421, 422F 
channels 
distinguished from transporters 389, 
404 
K+ leak channels 375T, 405, 406F, 
413, 415, 419T 
nuclear pores as 497, 501, 503F 
translocator 509 
see also ion channels 
chaperone proteins (molecular 
chaperones) 123, 124F, 258, 505, 506F, 
517-518 
charge separation 481, 485F, 489F 
Chase, Martha 195 
checkpoint inhibitors 728 
checkpoints, cell-cycle 611, 635, 663 
chemical bonds 40-50, 66F 
bond lengths 44, 48T, 68F 
bond strengths 45-46, 48T, 67F 
ionic and covalent 42-43 
noncovalent bonds 47-48 
single and double 45-46, 48T, 67F 
as source of food energy 427 
see also bond energies; covalent 
chemical bonds; peptide bonds 
chemical groups 51, 66F-67F 
chemical reactions 
activation energy 89-90, 91F 
coupled reactions 92, 94F-95F 
driven by photosynthesis 85-86 
equilibrium reactions 92 
free energy and direction of 89 
free energy and progress of 92 
localization within cells 495-496 
reverse reactions 49, 53F, 92, 94F, 
96, 100, 103, 110 
spontaneous reactions 89, 94F-95F 
see also condensation reactions; 
enzymes; reaction rates 
chemical signal interconversion with 
electrical 416-417 
chemical similarity of cells 3-4 
chemiosmotic coupling 
as ancient and widespread 457-458, 
464, 488F, 490-491 
delayed acceptance of 457, 469 
oxidative phosphorylation 476 
chemiosmotic hypothesis 457, 476, 477 
chemotaxis 596 
chemotherapy 728 
chiasmata 659-660 
chimpanzees 223, 310, 311F, 323, 326 
chitin 53 
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Chlamydomonas 591F 
Chlorobium tepidum 489F 
chlorophyll 
adsorption spectrum 480 
location in chloroplasts 18 
source of “high-energy” electrons 
463, 479, 482 
special pair dimer 481, 482F, 
483-484, 485F 
structure 481F 
chloroplasts 
collaboration with mitochondria 
487F 
energy storage 450, 487 
origins 15, 18-19, 457-458, 490, 499 
and photosynthesis 18-19, 478-488 
protein and lipid imports into 
505-506 
structure and function 19, 478-479 
thylakoid membrane 458F, 479-483, 
485F, 487F, 505 
cholera 547-549 
cholesterol 
as amphipathic 367, 368F 
and membrane fluidity 372 
receptor-mediated endocytosis 
525-526 
structure 75F 
synthesis 108F 
choline 55, 74F 
chromatids 183F 
chromatids, sister 183, 625, 626F, 
627-629F, 631, 633-634, 635F, 
657-658, 660, 661F 
chromatin 
compacting and extending 187, 
189-192 
defined 179, 184 
epigenetic inheritance and 287 
euchromatin 190 
heterochromatin 184F, 189-191, 277, 
291, 321-322F 
regulating DNA accessibility 188 
chromatin-remodeling complexes 
188-189, 276-277, 279, 291 
chromatography 
affinity chromatography 159, 166F 
column chromatography 141F, 166F 
gel-filtration chromatography 
166F 
immunoaffinity chromatography 
141F 
ion-exchange chromatography 166F 
protein isolation using 158-159, 
166F, 563 
chromosomal abnormalities 180, 352, 
721, 722F, 728 
chromosomal reassortment 653, 
660-662, 675F 
chromosomal translocations 180F, 722F 
chromosomes 
behavior and Mendel’s laws 669-67 1 
condensation 183-184 
discovery 174 
DNA packaging in 186-187 
homologous and sex chromosomes 
179, 184F, 304F, 345F, 652, 655-659, 
661F, 671F, 679 
human genome 321-323 
interphase chromosomes 181-184, 
185F, 187, 189-192, 277 
meiotic chromosome numbers 
654-656, 661 


1:5 


1:6 


Index 


chromosomes (continued) 
mitotic chromosomes 182, 183F, 184, 
185F, 187-188, 625, 626F, 631F, 
632-634, 635F 
segregation errors 662-663 
chromosome condensation 183-184, 625 
chromosome painting 179, 180F, 184F, 
722F 
chromosome pairing in meiosis 655, 
657, 659 
chromosome segregation 181, 610, 615, 
619, 629F, 655, 662F, 722F 
chromosome structure 
compressed state of DNA 183-184 
in eukaryotes 178-187 
gene expression and 278 
nucleosomes in 184-186 
regulation of 188-192 
chymotrypsin 125F, 132, 307F 
cilia 26-27, 118F, 580, 582, 590-592 
primary cilia 592 
stereocilia 13F, 409 
ciliary dynein 586, 591-592 
circular DNA 34, 179, 201, 211F, 213, 
273, 337 
cisternae (Golgi apparatus) 512F, 513T, 
518-519 
citrate synthase 142T, 442F 
citric acid cycle 
acetyl group oxidation in 438 
biosynthetic pathways beginning 
with 441, 461 
diagrams 429F, 442F-443F 
elucidation of 444-445 
mitochondrial matrix 438 
oxaloacetate in 110F, 438 
sequential pathways 444 
as third stage of catabolism 430 
Cl concentrations inside and outside 
cells 391T 
clam eggs 616 
clamp loaders 211-212, 213T 
classical genetics approaches 29, 354, 
564, 674-676 
clathrin-coated pits/vesicles 512-513, 
514F, 524-525, 526F 
claudins 704 
cleavage divisions 611, 615 
cleavage furrows 636, 637F 
clone-by-clone sequencing 348-349 
cloning 
cancers as clones 718-719, 722F 
multicellular organisms as clones 
709 
see also DNA cloning 
CML (chronic myeloid leukemia) 148, 
729 
CNVs (copy number variations) 679 
co-immunoptrecipitation 563, 730 
coated vesicles 512-513, 514F, 524-525, 
526F 
codes and abbreviations 
for amino acids 120F, 245F 
for bases and nucleotides 79F, 
176 
genetic code 178, 244-248, 249F 
“coding problem” 243 
codons 
defined 244 
neutral mutations 302 
start codons 254, 255F 
stop codons 245F, 251, 254, 255F, 
324 


coenzymes 79F, 148-149 
coenzyme A see acetyl CoA 
cohesins 625, 626F, 627, 633, 634F, 
658F, 660, 661F 
coiled coils 128, 576, 600 
colchicine 584, 590, 633, 637 
collagen 48, 135, 136F, 696-698, 699F, 
700-702, 703F 
color-blindness 672 
colorectal cancer 218, 223F, 715, 719F, 
721F, 726-727, 730-731 
combinatorial transcription control 279, 
282-285 
common ancestors 15, 34-35, 313 
see also ancestral cell 
comparative genomics 312, 349 
compartments, eukaryotic cells see 
organelles 
competitive inhibition 145, 444-445 
complement system 685 
complementary base-pairing 
codon-anticodon recognition 248F, 
249 
defined 176, 177F 
DNA probes and cloning 184, 335F, 
340-341, 341, 352F 
replication fidelity and 202, 207, 209, 
244 
RNA 240, 241F, 260, 291 
in transcription 229-231, 233 
see also DNA hybridization 
complementary DNA (cDNA) 
cDNA libraries 339, 340F 
MRNA analysis 351 
complementation tests 675F, 678 
concentration gradients 
contributing to electrochemical 
gradient 393-394 
inorganic ions 391 
Na* pump 398-399 
osmosis 394-395 
passive transport and 393 
in velocity sedimentation 165F 
condensation reactions 
biopolymers generally 59 
disaccharide formation 53 
driven by ATP hydrolysis 106, 110 
peptide bond formation as 119F 
phosphorylation as 105 
condensins 187F, 625, 626F, 628F 
conditional knockout mice 357 
conditional mutants 676-678 
confocal fluorescence microscopy 9F, 
12F, 17F, 24T, 32F 
conformations 
macromolecules 62 
NADH and NADPH 108 
conformations, DNA 
chromosome visibility and 174 
space-filling model 177F 
conformations, protein 
changes driven by ATP hydrolysis 
602, 603F 
changes in ATP synthase 466 
changes in gated ion channels 404, 
408, 414-415F 
changes in transporters and pumps 
396-397, 398F, 475F 
changes on binding ligands 151-152, 
554F, 559, 566, 586, 698 
changes on dimer cleavage 641F 
changes on inhibition 151-152, 621 
changes on phosphorylation 152-153 


disulfide bond stabilization 136 
HPr bacterial protein 124, 125-126F, 
129 
hydrophobicity and 121, 122F 
conjugate acid-base pairs 470 
conjugation, bacterial 308F 
connective tissues 692F, 693, 695-698, 
700, 702, 703F, 723 
connexons 707 
consanguineous marriages 668, 681 
conservative model, DNA replication 
202-203, 204F 
conserved DNA/mechanisms 
cell-cycle control system 30-31, 613 
evolutionary relationships and 
309-310, 313-315, 350 
functionally important regions as 
310-313, 314 
in human genome 323 
conserved orientation in membranes 
374, 377, 510 
conserved proteins 31, 131, 186 
conserved synteny 311 
constitutive exocytosis pathway 519, 
522 
contact-dependent signaling 535-536 
contractile bundles 27, 592, 593F, 595, 
601, 605, 705F 
contractile ring 592, 598, 600, 636-638 
control mechanisms 447 
COP-coated vesicles 513 
corn (maize) 307, 308F, 360F, 450F, 667 
cortisol 270, 282, 536T, 550T, 565, 566F 
coupled reactions 
activated carriers and 101, 106-107 
energetics of 92, 94F-95F 
in glycolysis 434 
“paddle-wheel” analogy 104 
photosystems I and II 484 
coupled transporters 467, 468F 
covalent chemical bonds 43-46 
disulfide bonds 77F, 136, 139F, 516 
“high-energy” bonds 67, 95F, 102-103, 
109F, 111, 112F 
peptide bonds 60, 67F, 70F, 76F, 92, 
126 
polar covalent bonds 43F, 45, 47-49 
single and double bonds 44-45 
covalent modification 
biotin and 149 
DNA methylation as 287 
of histone tails 188-189 
of proteins 153-154, 258F, 447, 516 
see also protein kinases 
crawling see cell locomotion 
Cre recombinase 357F 
creatine phosphate 102-103 
Crick, Francis 174-175, 202, 204 
CRISPR gene editing 358-359, 564 
cross-linking see disulfide bonds 
cross-pollination 664-665 
cross-talk 278, 568 
crossovers (meiosis) 657-660, 661F, 
662, 671-672, 675F, 679-680 
gene duplication and 303-304 
independent segregation and 
671-672 
cryoelectron microscopy 11, 160T, 161, 
169F, 379, 483F, 513F 
crypts, intestinal 713-714, 715F, 727F, 
728T, 731 
crystallization see X-ray diffraction 
CTP (cytidine triphosphate) 151F, 231 
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cultured cell types 285 ion concentrations 391, 399 dihydrolipoyl dehydrogenase/ 
cultured cells 12F, 32, 33F mRNA degradation in 242-243 transacetylase 438F 
curare 419, 544T pH control 401 dihydroxyacetone phosphate 436F 
cyanide 468, 475 dimensions see sizes 
cyanobacteria 314F, 315, 458, 478, 482, diploid cells 652 
489 D disaccharides 53, 73F 
cyclic AMP 79F disease states 
adenylyl cyclase and 549 DAG (diacylglycerol) 549, 551-553 atherosclerosis 526 
binding example 138F dalton (unit) 41 caused by mitochondrial dysfunction 
CAP activator and 275 Darwin, Charles 8, 28, 616 459 
hormones mediated by 550T, 551F databases caused by viruses 318T 
signaling pathway effects 549-551 comparative genomics 35, 341 diabetes 427, 518, 680, 682-684 
cyclic AMP-dependent protein kinase in DNA cloning 361 matrix proteases in 697 
(PKA) 550-551, 552F genome sequences 341-342, 350, see also cancers; genetic disorders 
cyclic AMP phosphodiesterase 549-550 684 disease susceptibility/predisposition 
cyclic GMP 555, 557F genomes and protein structure 526, 592, 678-680, 683-686 
cyclin-Cdk complexes 614, 617-618, 159-161, 162, 361F disorder (entropy) 83-84, 94F 
619F, 620, 623 ddNTPs (dideoxynucleoside disulfide bonds 77F, 136, 139F, 516 
cyclins 31, 614, 615-616 triphosphates) 346, 347F DNA 
G, cyclin 617, 620 de-differentiation 269F, 285 3’ and 5’ ends 176, 206 
G,/S cyclin 614, 620 deafness, inherited 668, 673-674, 680 chemical differences from RNA 58, 
M cyclin 614-616, 617F, 618-619, deamination in DNA 215, 216-217F, 229 
625, 633 261-262 in chloroplasts 19, 458 


regulation 617-618, 620-621 
S cyclin 614, 617 
vertebrate cyclins and Cdks 617T 
see also Cdks 
cysteine residues 
disulfide bonds 77F, 136, 139F, 516 
palmitate addition 153 
cystic fibrosis 517, 672, 682, 685 
cytochalasin 594 
cytochromes 
heme group 474 
cytochrome b,,, 131F 
cytochrome b,-f complex 482F, 484, 


death receptors 642 

definitions, genetics 675F 

dehydrogenations as oxidations 88 

Delbrtick, Max 202 

Delta protein 536T, 565 

AG (delta G) see free energy change 

denaturation, DNA 341, 342F 

denaturation, protein 123 

dendrites 3F, 353F, 410-411, 416F, 
420F 

density gradient centrifugation 165F, 
203 

deoxyribose formation 261 


double-helix formation 58, 175-176 

gene expression and 6, 228 

genetic information storage 3, 58, 
174, 193-195 

isolating and cloning 334-341 

localization in prokaryotes and 
eukaryotes 16F 

mitochondrial 17, 458 

packaging into chromosomes 
179-180, 183-184 

reading, between species 31 

see also conserved DNA; DNA 
structure; genomes; nucleotides; 


485F depolarization regulatory DNA 
cytochrome c in apoptosis 642 neurons 411, 414-415, 418 DNA/RNA hybrid helices 319 
cytochrome c oxidase complex 464, plasma membrane 408, 421 422F DNA-binding proteins 118F, 130F, 132, 
474, 475, 484 depurination in DNA 215, 216-217F 188F, 272 
cytochrome c reductase complex 473 desmosomes 575, 577, 579 single-strand 211 
cytokinesis 636-639 and cell junctions 704-707 DNA cloning 
contractile ring in 636-638 hemidesmosomes 578F, 704, 707, genomic and complementary DNA 
and mitosis as M phase 627, 629F 708F 340, 344F 
in plants 638 detergents 378-379, 385F by PCR 341-345 
cytosine development process see embryonic using bacteria 334-341 
deamination 262 development using vectors 337-339, 360-361F 
methylation 287 diabetes 427, 518, 680, 682-684 DNA damage 
as a pyrimidine base 57, 67F diacylglycerol (DAG) 549, 551-553 apoptosis and 621 
cytidine triphosphate (CTP) 151F Dicer protein 290-291 cell-cycle control and 621, 623 
cytoskeleton dideoxy sequencing (Sanger) 346, 347F, depurination and deamination 215 
cell junctions linked to 691, 693, 351F double-strand breaks 214, 219-222, 
698-699, 700F, 704-707 Didinium 26 358, 458F, 658F, 728 
and cell wall in plants 695 differential centrifugation 165F, 498 effects of mutagens 674 
enzyme-coupled receptors and 557 differentiation failure to repair 300-302 


extracellular matrix coupling 
698-699 

functions 22-24, 573 

mitotic spindle and contractile ring 
626-627 

motor proteins and 154-155 

muscle contraction and 600-606 

organelle location and movement 
498 

types of protein filament 574 

see also actin filaments; 
centrosomes; intermediate 
filaments; microtubules 

cytosol 

defined 21-22, 496 

diffusion rates 99 

as dynamic 23-24 

intercytosolic communication 707 


de-differentiation 269F, 285 
in embryonic development 6, 
282-284 
induced, in ES cells 716 
RTK role 559, 560 
as selective gene expression 
267-268 
terminal differentiation 286, 622, 
644, 712-714 
diffusion 
contrasted with facilitated transport 
390 
nitric oxide 554 
rates in the cytosol 99 
diffusion coefficients 384F 
digestion, in catabolism 430 
dihybrid crosses 669, 670F 
dihydrofolate reductase 147 


point mutations 300-301 
DNA duplication see gene duplication 
DNA fingerprinting 343-344, 345F 
DNA helicases 211, 212F, 213T, 357F, 
623, 628F 
DNA hybridization (renaturation) 
chromosome painting 179F 
detecting nucleotide sequences 
340-341 
in dideoxy sequencing 347F 
and DNA cloning 340-341 
by siRNAs 355 
in situ hybridization 352 
DNA libraries/genomic libraries 339, 
348 
see also genomic libraries 
DNA ligase 142T, 210, 213T, 218, 220T, 
337-338, 339F 
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DNA methylation 287 
DNA-only transposons 315, 316F 
DNA polymerases 
and cDNA libraries 339, 340F 
compared to RNA polymerases 232 
DNA polymerases I and III 210 
PCR use 342 
proofreading by 207-208, 209F 
repair polymerases 210F, 215, 
217-218, 221 
reverse transcriptase as 316 
template-based synthesis by 205-207 
DNA probes 184, 341, 352F 
DNA repair 215-223 
mismatch repair system 218-219 
DNA replication 200-215 
avoiding re-replication 623 
as bidirectional 205 
cell cycle phases 623 
conservative, semiconservative, and 
dispersive models 201, 202-204 
elucidation 202-204 
“end replication problem” 213 
error rates 218, 720 
leading and lagging strands 
distinguished 207 
in meiosis 655-656 
preservation of genome sequences 
223 
rates in prokaryotes and eukaryotes 
200 
replication machine 200-201, 205, 
210-211, 218 
S-Cdks and 623 
temperature-sensitive mutants 677 
transcription distinguished from 
230-231 
see also replication forks; replication 
origins 
DNA sequencing see genome; 
nucleotide sequences 
DNA structure elucidation 174-178 
DNA topoisomerases 212 
DNA viruses 319 
DNP (2,4-dinitrophenol) 476-477 
docking sites 
histone 189 
phosphorylated tyrosines 558, 
560-561, 563 
protein 153 
dogs, genetic traits 667F 
dolichol 75F, 516, 517F 
domains 
defined 130 
and exon shuffling 306-307 
illustrated 131F 
interaction domains 558-559 
prokaryotic 15-16 
and protein families 162 
dominant alleles 665-666 
dopamine 707-708 
double bonds 45 
in phospholipids 371 
resonance 66F 
double helix, in tRNAs 245 
double-strand breaks 214, 219-222, 
358, 458F, 658F, 728 
double-stranded RNA (dsRNA) 318 
Down syndrome 662 
driver mutations in cancer see 
cancer-critical 
Drosophila melanogaster 
Armadillo protein 730 


effects of mobile genetic elements 
307F 

embryonic development 12F, 710 

Eve gene 280-281 

Ey gene/transcription regulator 
284-286 

genome size 35, 309, 323 

mitotic spindles 633F 

as model organism 29 

Notch receptor 565 

drugs 

anticancer drugs 148, 562, 584 

development using human stem 
cells 717 


effects on microtubule dynamics 584 


pharmaceutical proteins 334 
see also antibiotic resistance; toxins 
dsRNA (double-stranded RNA) 290, 354 
dynamin 512, 514F 
dyneins 118F, 586-587, 590-592, 634 
ciliary dyneins 586, 591-592 
cytoplasmic dyneins 586-587 


E 


E. coli 
chemical complexity 51 
cytosol 22F 
evolutionary ancestors 490 
gene regulation experiments 
280-281 
genome 34-35 
Lac operon 275-276, 279 
as model organism 14F, 27-28 
E site, ribosomes 251, 252F, 253 
EcoRI enzyme 335-337F 
effector proteins 537, 540, 552, 560 
eggs 
enucleated 269F 
experiments on enucleated eggs 
268-269 
as gametes 652 
multicellular organisms from 
709-710 
oocytes 615, 616F 
size difference from sperm 653F 
zygotes as fertilized eggs 652, 663 
elastase 132 
elastin 135, 136F, 696 
electrical signal interconversion with 
chemical 416-417 
electrochemical gradients 
active transport 396-397 
component forces 393-394 
Na* and K* gradients 394 
Na* pump and 399 
oxidative phosphorylation and 456 
passive transport 396 
electrochemical H* gradients 402, 
464-465, 466-467, 482, 491 
electrochemical Na* gradients 399-400, 
401F, 402, 403F 
electrons 
activated carriers of 106-107 
and chemistry 40-47 
in oxidation and reduction 87-88 
see also “high-energy” 
electron affinities 470-471 
electron carriers 464, 470-474 
chlorophyll special pairs and 481 
cytochrome c oxidase complex 
474-475 


in the electron-transport chain 446, 
456, 471-473 
FADH, as 438 
mobile electron carriers 464, 
482-484 
NADH and NADPH as 107 
plastocyanin as 484 
plastoquinone and ferredoxin as 
482 
electron microscopy 
and cell structure 9-11 
light microscopes and 6 
transmission and scanning electron 
microscopes 10-11 
electron shells 41-44, 46 
electron-transport systems 
anaerobic respiration 434 
in chloroplasts/photosynthesis 456F 
energetics 471 
first appearance 455 
in mitochondria 430, 432, 439 
molecular mechanisms 469-475 
in oxidative phosphorylation 445, 456 
in photosynthesis 456F, 479, 481 
respiratory enzyme complexes 
464-465 
electronegativity 45-46, 88F 
electrophoresis see gel electrophoresis 
electrostatic attraction 48, 62, 71F 
histones and DNA 186 
and protein conformation 121 
elements (chemical) 
defined 40 
in living organisms 41-42 
periodic table 43 
reactivity 15, 42, 46, 54 
Embden-Meyerhof pathway 432F 
embryonic development 
apoptosis in 640 
asymmetric cell division 637 
differentiated cell types in 6 
differentiation in 6, 282-284 
epithelial sheets in 705, 706F 
model organisms 32, 710 
transcription regulators in 280-281 
zebrafish 32, 710 
embryonic stem cells (ES cells) 283, 
285-286, 355-356, 715-718 
“end replication problem” 213 
endocrine cells/signaling 534 
endocytosis 
balanced by exocytosis 21 
endocytic pathways 523-528 
and lysosomes 511 
phagocytosis and pinocytosis 
523-526, 528F, 592 
receptor-mediated 525-526 
endomembrane system 499, 500F, 501, 
507, 511, 512F, 519, 522 
endoplasmic reticulum (ER) 
cell division and 639 
ER retention signals 517, 519 
extent 498, 507F 
internal ER signal sequences 510, 
511F 
phospholipid synthesis at 373 
positioning 587 
possible origins 499 
ribosome attachment 507 
rough and smooth 20F, 497, 507 
sarcoplasmic reticulum 400F, 403T, 
604-605 
as source of proteins and lipids 506 


endosomes 497-500, 507, 511-513, 
522-523, 525-528 
early and late 526 
endosymbiosis 26 
endothelial cells 386F, 536T, 555, 
710-711 
energetically favorable reactions 
carbon fixation 486 
DNA double helix formation 176 
formation of activated carriers 101, 
104 
formation of lipid bilayers 370 
free-energy change and 91 
protein conformations and 122 
energetically unfavorable reactions 
ADP phosphorylation 106 
free-energy changes 92 
gluconeogenesis 448-449 
membrane fusion as 515 
energy 
from chemical bonds in food 427 
from fermentation 455 
glucose as predominant source 427 
from glycolysis 430-431 
membrane-based mechanisms 
456-457 
storage in electrochemical gradients 
399 
transcription 231 
use by living cells 82-88 
see also free energy 
energy carriers 57 
energy conversion in cells 84-85 
energy sources 
evolution of energy-generating 
systems 488-491 
small molecules as 51, 54, 427 
enhancers (gene activation) 276, 278 
entropy (disorder) 83-84 
environmental factors 
and cancer 719-720, 721 
and human disease 680, 682-686 
microenvironment modification 723 
sexual reproduction benefits 654 
enzymes 
classification 142T 
coenzymes 79F, 148-149 
effectiveness of catalysis by 81-82, 
89-90 
energetics of catalysis by 88-100 
feedback inhibition 149-150 
mechanisms of catalysis by 139, 
142-143, 146-147 
performance 144 
as proteins 59 
regulation of catalytic effects 
150-151 
ribozymes 109, 252-253, 259-260, 
261T 
ribulose bisphosphate carboxylase 
59, 118F, 168F, 485-486 
selectivity/specificity 142 
see also substrate binding 
enzyme-coupled receptors 543, 545, 
557-569 
enzyme-substrate complexes 
formation and stabilization 142F, 
143, 146, 147F 
lysozyme 146-147F 
enzyme inhibition 
competitive and feedback inhibition 
145, 150 
by drugs 147-148 


epidemiology of cancers 719-720 
epidermis see skin 
epidermolysis bullosa simplex 577, 579 
epigenetic changes 724 
epigenetic inheritance 287, 724 
epinephrine 450, 536T, 550-551, 555 
GPCR binding 545F 
epithelial cells 
apical, basal, and lateral surfaces 
382, 401 
cilia 590 
formation of cups, tubes, and 
vesicles 705, 706F 
glucose transport 400-401 
keratin filaments 575F 
sheets of, as polarized 702-703 
stratified epithelia 702, 713, 714F 
use of symports 400-401 
equilibrium constant, K 95F-97 
equilibrium density centrifugation 203 
equilibrium reactions 92, 93F, 94F, 100 
equilibrium sedimentation 165F 
ER see endoplasmic reticulum 
error rates 
DNA replication 218, 720 
meiosis 662-663 
transcription 232 
Escherichia coli see E. coli 
estradiol 536T, 565, 566F 
ethylene 
bond geometry 45 
as a plant hormone 567, 568F 
ethylene glycol poisoning 145 
euchromatin 190 
eukaryotes 
chromosome structure in 178-187 
eukaryotic cell 16-27 
genome size 34-35 
membrane-enclosed organelles 
496-500 
origins 24, 26-27 
prokaryotes distinguished from 11 
transcription initiation in 233 
Eve (even-skipped) gene 280-281 
evolution 
ancestral cell 5-6 
ancestral genes 33-35 
atmospheric oxygen and 439, 455, 
490F 
of brains 326 
of cancer cells 721-723 
as complementary to cell theory 8 
conserved mechanisms 309-310, 
313-315, 350 
of energy-generating systems 488-491 
of oxidative phosphorylation 456 
of prokaryotes 14 
reconstructing the process 309-315 
of RNA before DNA 109, 259 
separate, of plants and animals 567, 
692 
see also conserved DNA 
exocytosis 
balanced by endocytosis 21, 523 
constitutive exocytosis pathway 519, 
522 
regulated exocytosis pathway 522 
secretory pathways 515 
in vesicular transport 511 
exon shuffling 298, 299F, 306-307 
exons (expressed sequences) 
introns and 239 
mobile genetic elements and 317F 


Index 


expression vectors 361 
extracellular matrix 
fibrous proteins in 32, 135, 696 
light microscopy 8, 25F, 692F 
in plants and animals 692-701 
protein cross-linking 136 
space-filling gels 577, 595 
see also basal lamina; cell walls 
extracellular signal molecules 
actin filaments and 598 
binding 537F 
contact-dependent cell signaling 
535-536 
hormones as 536 
local mediators 535-536, 545, 550T, 
555, 557 
mitogens as 620, 643 
plasma membrane crossing 565-567 
range 534-536 
role in cell signaling 534 
see also neurotransmitters 
extracellular signals 
apoptosis induction 538, 642 
and cancer 645 
necessary for survival 642-643 
speed of response 538-539 
stem cell populations and 714-715 
Wnt pathway 714-715, 726, 730-731 
extreme environments 16 
Ey gene/transcription regulator 
284-286 
eye cup/optic cup 705, 706F, 717-718 


F 


Factor VIII gene 239F, 317 
FADH, (flavin adenine dinucleotide, 
reduced form) 108, 109T, 438, 439F, 
440-443F, 445F, 446, 462, 463-464F, 
473 
energy yield 468-469 
familial hypertrophic cardiomyopathy 
605 
Fas receptor/ligand 642 
fat droplets, chloroplasts 450, 487 
fat droplets, cytoplasmic 54, 75F, 
450-45 1F 
fats 
as an energy source 438 
breakdown and utilization 428 
brown fat cells 476-477 
storage 450 
fatty acids 
acetyl CoA from 438, 439F 
biosynthesis 108 
as lipids 55, 74F-75F 
saturated and unsaturated 54, 
74F 
as subunits 51 
feedback inhibition 145, 150, 274 
feedback loops 4, 5F 
feedback regulation 
cell-cycle control system 612 
intracellular signaling pathways 540, 
541F 
metabolic enzymes 447-448 
in signaling pathways 540, 541F, 556 
fermentation 433-435, 455 
ferredoxins 482, 483-485F, 489F 
fertilization 
diploid genomes from 663 
egg development following 709-710 
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fetal hemoglobin 305 
fibroblasts 
actin in 597F 
in connective tissues 696F, 697-698, 
699F 
micrographs of 12F, 33F, 638F 
fibroin 126 
fibronectin 698, 699-700F, 702 
fibrous proteins 134-135, 136F, 574, 
596, 631, 696 
see also protein filaments 
filopodia 593F, 596-599 
filtration 60-61 
first-cousin marriages 654, 669 
first law of thermodynamics 84 
Fischer, Emil 60 
FISH (fluorescence in situ hybridization) 
352F 
fission yeasts 2, 3F, 616 
see also Schizosaccharomyces pombe 
fixing specimens 9, 12F 
flagella 
bacterial 467 
microtubules in 580-583, 590-591 
sperm 590-592 
flies see Drosophila 
“flip-flop” operation 371, 372F, 373 
flippases 373-374 
fluidity of lipid bilayers 370-373, 
384-385 
fluorescein 382F 
fluorescence dyes 
visualizing DNA in cells 174F 
visualizing DNA in electrophoresis 
336-337 
visualizing kinesin motion 589F 
fluorescence microscopy 9, 12F 
confocal fluorescence microscopy 
OF, 12F 
endoplasmic reticulum 507F 
immunofluorescence microscopy 
575F 
microtubules 580F, 632F 
mitotic spindle 632-633F 
stained DNA 174F, 184F 
use of GFP 353-354, 384 
fluorescent antibodies 141F 
fluorescent proteins 118F, 382F, 384, 
507F, 520, 550F 
see also GFP 
FMRI (fragile X mental retardation 
gene) 359 
folding see 8 sheets (at beta); 
conformations; helices 
food breakdown see catabolic 
pathways; fats; glucose 
food storage 449-451 
food testing 343 
forensic science 
DNA fingerprinting 343-344, 345F 
PCR use 343-344, 345F 
formins 595, 598-599 
founder effects 682 
fragile X syndrome 359 
Franklin, Rosalind 175 
FRAP (fluorescence recovery after 
photobleaching) technique 384 
free energy, G 
in biological reactions 91, 94F-95F 
in catalysis 88-100 
from phosphoanhydride hydrolysis 
57 
and protein conformations 122 


free-energy change, AG 
equilibrium reactions 92, 94F 
favorable and unfavorable reactions 
89-92 
redox reactions 470-471 
sequential reactions 98-99 
see also standard free-energy change 
free ribosomes 508 
frogs 
egg cell 2, 636F 
embryo 610, 611-612, 710 
pigment cells 9F 
Xenopus 157F, 306, 615, 616F 
fructose 1,6-bisphosphate(ase) 436F, 
448 
fructose 6-phosphate 436F, 448 
fruit fly see Drosophila 
Fugu rubripes 313 
fumarate 442F-444, 488 
fungi and antibiotics 256 
see also yeasts 
fusion proteins 353, 521, 724F 


G 


G proteins 
effects mediated by phospholipase C 
549, 551-553, 562F 
subunits activated by GPCRs 545-546 
as trimeric GTP-binding proteins 
542 
G, phase, cell cycle 612, 620, 622 
G,-Cdks 617, 621 
G, cyclins 617, 620 
G, phase, cell cycle 611, 614, 618, 
620-623, 644 
G,/S-Cdks 614, 617, 620-621, 630 
G,/S cyclin 614, 617, 620-621, 622F, 
630 
G, to S transitions 612-614, 617-621, 
623, 644 
G, phase, cell cycle 611-612, 615 
G, to M transitions 612, 614, 619, 623 
GABA (y-aminobutyric acid) 418, 420 
GABA-gated Cl- channels 419 
GAGs (glycosaminoglycans) 700-701 
gain-of-function mutations 672F, 673, 
723-725 
galactocerebroside 368F 
B-galactosidase 281F, 353F 
gametes 
distinction from somatic cells 
299-300 
as haploid 652 
gap-junction proteins 674 
gap junctions 403, 707-709 
GAPs (GTPase-activating proteins) 
505F, 542, 559 
gating 
ion channels 408-409 
ligand-gated ion channels 408, 
417-418 
light-gated ion channels 408, 
417-418, 421, 422F 
mechanically-gated ion channels 
408, 409F, 419T 
transmitter-gated ion channels 
417-419, 543F, 544 
voltage-gated ion channels 408-411, 
414-415, 416, 417F, 419T, 604F 
GEFs (guanine nucleotide exchange 
factors) 154F, 542 
Ras-GEF 505F, 559 


gel electrophoresis 
agarose gel 336 
antigen separation 141F 
DNA fragments 335-336 
isoelectric focusing and PAGE 167F 
polyacrylamide-gel electrophoresis 
(PAGE) 161F, 167F, 336 
protein purification 141F, 159, 160T, 
161F, 166F-167F 
two-dimensional 161, 167F, 270 
gel-filtration chromatography 166F 
gels, extracellular matrix 577, 595 
genes 
carried in DNA 3, 193-195 
defined 180 
duplication and divergence 298, 
299F 
encoding functional RNA 322F, 323 
evidence for chemical nature 
173-175 
gene function studies and 
applications 350-361 
mutation driving evolution 5 
number in human genome 284 
numbers in model organisms 678 
numbers in vertebrates 313 
oncogenes and tumor suppressor 
genes 560, 723-725, 726-730 
see also protein-coding genes 
gene cloning see DNA cloning 
gene duplication 
and divergence 302-304 
and gene families 302-305 
whole-genome duplication 298, 306 
gene editing 358-359 
gene expression 
and cell differentiation 6, 267-268 
control during transcription 270-278 
effects of antibiotics 256 
enzyme-coupled receptors and 557, 
560 
extracellular signals and 270 
localization of 340 
MRNA analysis and 270, 351-352 
overview 268-271 
post-transcriptional controls 288 
proportion of protein-coding genes 
expressed 189, 270 
rates 228 
regulation of 270-271, 279, 282, 302 
regulation of protein activity via 149 
as transcription and translation 178, 
228, 232 
gene families 302-305, 313 
gene inactivation/silencing 
interphase X chromosomes 191, 
192F, 277, 291 
using RNAi 354-355 
“gene knockdown” 355 
“gene knockout” 356-357, 731 
general transcription factors 235-237, 
271, 276-277, 279F 
genetic change 
instability and cancer 721, 722F, 728 
origins of variation 298-308 
sexual reproduction and 660-662 
genetic code 178, 244-248, 249F 
genetic disorders 33, 215, 517, 672, 717 
cancer as 334, 720 
chromosomal loops and 278 
detecting responsible mutations 333 
epidermolysis bullosa simplex 577, 
579 


familial hypertrophic 
cardiomyopathy 605 
hemophilia 239F, 307, 317, 334, 672, 
680 
Kartagener’s syndrome 592 
muscular dystrophy 579 
progeria 579 
role of environment and mutations 
680, 682-683 
stem cell studies 717 
Tay-Sachs disease 681-682 
Timothy syndrome 717 
xeroderma pigmentosum 215 
Zellweger syndrome 506-507 
genetic engineering 
bacterial enzymes 145 
GFP tagging as 520 
optogenetics as 421 
protein production using 158, 
161-162, 361 
selective breeding as 334 
signal sequence investigations 502 
transgenic organisms 355, 357F, 
358-360, 577 
see also DNA cloning 
genetic linkage 672, 684-685 
genetic maps 672, 675F 
genetic screens 676, 677F, 678 
genetic variation, generating 298-308, 
643-644 
genetics 
complementation tests 675F, 678 
Drosophila melanogaster model 29 
experiments in classical genetics 
674-676 
human genetics 678-687 
mouse model 32 
genomes 
Caenorhabditis elegans 29 
comparing 297-298 
genomic DNA libraries 339 
of mitochondria and chloroplasts 17, 
19, 458 
phylogenetic trees 310, 314 
role in the cell 6 
whole-genome duplication 298, 
306 
See also genome size; human 
genome 
genome sequences/sequencing 
automated 346-348 
and common ancestors 33-34, 297 
and genome organization 310 
and protein databases 159, 161F 
as records of DNA replication and 
repair 223 
species sequenced 180, 323 
techniques 346-350 
see also human genome; nucleotide 
sequences/sequencing 
genome size 34-35, 180 
E. coli 334 
viruses 318 
genome-wide association studies 
(GWAS) 677, 683, 685-686 
genomic libraries/DNA libraries 339, 
348 
genotypes defined 666, 675F 
geometry see conformations 
germ cells/germ line 299-300, 652 
GFP fusion proteins 353-354, 521 
GFP (green fluorescent protein) 24T, 
118F, 353-354, 384, 520-521 


G, protein 548 
Giant gene 281F 
Gilbert, Walter 324 
Gleevec® (imatinib) 148, 729 
glioblastoma 725 
globins, a- and g- 133, 304-305 
B-globin gene/protein 181F, 191, 222F, 
239F, 243, 305, 308, 311F 
globular proteins 134, 189F, 581 
glucagon 268, 396, 450, 550T 
gluconeogenesis 448-449 
glucosamine 73F, 700F 
N-acetyl- 53, 73F 
glucose 
configurations and isomers 52-53 
feedback regulation and 447-448 
and glycolysis 430 
and insulin secretion 522 
passive transport 396 
as predominant energy source 428 
see also glycogen 
glucose 6-phosphate 436F, 448-450 
glucose transporter 118F, 396, 398F, 
400-401, 402F 
glucuronic acid 73F, 700F 
glutamate 418, 420 
glutamine synthesis 106, 110 
glyceraldehyde 3-phosphate (and 
dehydrogenase) 431T, 432, 434, 435F, 
436F-437F, 480 
in carbon fixation 486-488 
glycerols, triacyl- 54-55, 74F, 370F, 
439F, 450 
glycine 418, 420 
glycocalyx 383, 516 
glycogen 53, 73F, 102, 396, 445, 
449-450, 496F, 550T, 551, 552T 
breakdown and adrenaline 550T, 551 
glycogen phosphorylase 449 
glycogen synthetase 449 
glycolipids 
as amphipathic 367, 368F 
chemistry 53, 55, 75F 
membrane asymmetry and 374 
glycolysis 
biosynthetic pathways beginning 
with 441 
in cancer cells 723 
coupled reactions 434F 
enzymes involved 431T 
oxidation coupled with energy 
storage 434 
in plants 449F, 450 
reversed as gluconeogenesis 
448-449 
as second stage of catabolism 430 
of sugars 430-431 
ten stages of 431-432, 436F-437F 
glycoproteins 53, 382 
glycosidic links 53, 78F 
glycosylation 516, 527 
GMOs (genetically modified organisms) 
355 
GMP (guanosine monophosphate), 
cyclic 555, 557F 
goblet cells 703, 712-713 
gold-labeled antibodies 141F 
“golden rice” 360 
Golgi apparatus 
appearance 20 
cis and trans networks 519, 522, 527 
cisternae 512F, 513T, 518-519 
COP-coated vesicles 513 
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as ER protein destination 511 
function in eukaryotes 497T, 498 
microtubules and 587, 590 
origin of membrane asymmetry 
373 
protein modification by 511, 
518-519 
gonorrhea 308 
GPCRs (G-protein-coupled receptors) 
545-557 
adrenergic receptors as 550 
as cell-surface receptors 543 
as drug targets 545 
and G protein subunits 545-546 
as GTP-binding 542 
intracellular signaling pathways 
from 555-557 
structure 545 
G, protein 552, 553F, 555 
gradient-driven pumps 397, 399-401, 
402F 
active transport 399-400 
symports, antiports, and uniports 
400 
grana 479, 487F 
green fluorescent protein (GFP) 17F, 
24T, 32F, 118F, 353, 384, 520 
green sulfur bacteria 489 
Griffith, Fred 193-194 
growth factors 643, 644 
GTP-binding proteins 
dynamin 512, 514F 
as molecular switches 154, 542 
trimeric (see G proteins) 
“GTP cap” 583 
GTP (guanosine triphosphate) 
citric acid cycle 440 
GTP hydrolysis 504, 505F, 583-584 
GTP tubulin 583-584 
GTPase-activating proteins (GAPs) 505F, 
542, 559 
GTPases 
G protein a subunit 546 
monomeric GTPases 504, 505F, 514, 
542, 559, 598 
Rab as 514, 515F 
Ran as 504 
Ras as 559 
Rho as 598 
guanine 57 
guanine nucleotide exchange factors 
(GEFs) 154F, 505F, 542, 559 
guanylyl cyclase 555 
guide RNAs 358-359 
GWAS (genome-wide association 
studies) 677, 683, 685-686 


H 


H* ion see hydronium, protons 
Haelll restriction enzyme 335, 337F 
Haemophilus influenzae 348 
hair cells, auditory 13F, 408, 409F, 419T, 
702 
Halobacterium halobium 379, 477 
handedness, of helices 127-128, 177F, 
185 
haploid cells 
distinguished from diploid 652 
germ cells as 652 
haploid nuclei, meiosis II 660 
haplotype blocks 679-680 
Hartwell, Lee 30-31, 616 
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heart muscle 
acetylcholine effects 543 
contraction 605, 707 
heart attacks 55, 526, 716 
mitochondria in 459-460 
heart pacemaker cells 537, 538F, 548, 
556 
heat loss 84 
helices 
actin double helix 134, 593F 
collagen triple helix 135, 136F, 696 
in common folding patterns 
127-128 
DNA/RNA hybrid 231, 319 
DNA double-helix formation 58, 
175-176 
superhelices 696 
see also a helices (at “alpha”) 
heme group 133F, 148F, 149, 305F, 474, 
475F 
hemidesmosomes 578F, 704, 707, 708F 
hemoglobin 
early investigation of 
macromolecules 60-61 
fetal and adult 305 
nonprotein constituent 149 
sickle-cell anemia 222, 680 
subunits 133, 304-305 
see also globins 
hemophilia 239F, 307, 317, 334, 672, 
680 
hemopoietic stem cells 714-715 
hepatocyte growth factor 644 
hereditary diseases see genetic 
disorders 
“hereditary factors” 666 
heredity and DNA structure 176-178, 
327-328 
see also inheritance 
heroin 544 
Hershey, Alfred 195 
heterochromatin 184F, 189-191, 277, 
291, 321-322F 
heterozygous individuals 666-670, 
672-673, 675F, 677, 680-682, 684F 
hexokinase 137, 142, 436F 
hexosaminidase 682 
hibernation 465, 477 
“high-energy” bonds and their 
hydrolysis 67F, 95F, 102-103, 109F, 
111, 112F 
see also ATP; GTP 
“high-energy” electrons 
ATP production from 461-463 
from chlorophyll 463, 479, 482 
in NADH 107 
“high-energy” intermediate 476-477 
Hill, Archibald (A. V.) 102 
HindIII enzyme 335-336F 
histones 184-191 
histone H1 187 
histone H3 185-186, 189F, 190-191 
histone H4 185-186, 189F 
histone H2A 185, 186F, 189F 
histone H2B 185, 186F, 189F 
and the nucleosome 184-186 
histone acetyltransferases/deacetylases 
277 
histone-modifying enzymes 188-190, 
277F 
histone octamers 185-186, 188 
histone tails, modification 186, 189-191, 
277, 287 


historical landmarks 
cell structure determination 24T 
understanding of proteins 160T 
HIV (human immunodeficiency virus) 
318T, 319F, 320, 344F, 526, 680, 
682 
HMG-COA reductase 147 
Hodgkin, Alan 413 
homeodomains 272-273F 
homologous chromosomes 179, 184F, 
304F, 345F 
maternal and paternal 179, 652 
in meiosis 652, 655-659, 661F, 671F, 
679 
homologous genes 
in comparative genomics 309 
and proteins 35 
homologous recombination 
in DNA repair 220-222 
gene duplication and rearrangement 
303, 308 
in meiosis 653, 658 
production of transgenic mice 356F 
homophilic binding 705, 711 
Hooke, Robert 7, 24T, 693 
horizontal gene transfer 
antibiotic resistance through 308, 
338 
genetic change through 298, 299F, 
308 
hormones, in extracellular signaling 
534, 535F, 536T 
see also epinephrine; insulin; steroids 
housekeeping proteins 270 
HPr bacterial protein 124, 125-126F, 129 
human genetics 678-687 
human genome 
African origins and 320F, 326-327F, 
679-680 
animal counterparts 33 
compared with other species 309, 
310-311 
conserved proportion 323 
disease predisposition 679 
exon shuffling contribution 307 
and human individuality 327-328 
mobile genetic elements in 316-317 
Neanderthals and 326, 327F 
noncoding RNAs 289, 291 
size and number of genes 179, 320F, 
322T, 324-325 
tRNA genes 248 
see also genome sequences 
human genome sequencing project 346, 
348-349 
human papillomavirus 720 
human studies 32-34 
Hunchback gene 281F 
Hunt, Tim 31, 616 
Huxley, Andrew 413 
hyaluronan 700, 701F 
hybrid cells, mouse-human 381 
hydra 652 
hydride ions 107-108, 431T, 432 
hydrocarbon tails, lipids 367, 371-372, 
377F 
hydrocarbons, saturated and 
unsaturated 66F 
hydrogels 158 
hydrogen-bonding 
in a helices and g sheets 128-129, 
377-378 
as noncovalent 47 


in nucleic acids 58, 70F, 175-176, 
177F 
in proteins 70F, 121, 122F, 126, 127F 
at replication origins 201 
in RNA structures 230F, 248, 260 
strengths of A-T and C-G 177F, 201 
in water 47, 68F, 70F 
hydrogen gas as a nutrient 491 
hydrogen molecules 44F 
hydrogen peroxide see peroxisomes 
hydrogen sulfide (H,S) 489 
hydrogenations as reductions 88 
hydrolysis 53 
“high-energy” bonds 95F 
proteolysis as 256 
hydronium ions 49, 69 
hydrophilicity 47, 68F, 369, 701 
hydrophobic interactions 
as noncovalent 48, 62, 71F 
in protein conformations 121, 122F 
see also amphipathic molecules 
hydrophobicity 47-48, 68F, 369 
signal sequences 503 
hydrothermal vents 488, 490 
hydroxyl ions 49F, 50-51, 69F 
hypervariable loops 139 
hypothalamus 421, 422F 


Illumina sequencing 347, 351F 
imatinib 148, 729 
immune system 
complement system 685 
and stem cell potential 716 
use against cancers 728 
immunoaffinity chromatography 141F 
Immunofluorescence microscopy 575F 
immunoprecipitation 141F 
co-immunoprecipitation 563, 730 
import signals see signal sequences 
in situ hybridization 352 
in vitro fertilization 161 
in vivo and in vitro studies 32 
inactivation, voltage-gated ion channels 
414 
indels 679 
independent assortment, law of 
669-671 
indigo 145 
induced pluripotent stem cells (iPS 
cells) 285-286, 716-718 
infection diagnosis and PCR 343 
influenza virus 318T, 319F, 526 
inheritance 
laws of 664-674 
uniparental 664F, 665 
see also genetics 
inherited diseases see genetic disorders 
initiator tRNA 253 
inositol 1,4,5-trisphosphate (IP,) 549, 
551-552, 553F 
inositol phospholipid pathway 552-553 
inositol phospholipids 374, 552, 553F, 
559, 561 
Inoué, Shinya 588 
insertion mutations 307 
insulin 
isolation 563 
secretion stimulated by glucose 522 
insulin-like growth factor (IGF) family 
560-561 


integrases 319 
integrins 
in cell adhesion 698-700, 702, 
706-707 
in cell locomotion 597, 699 
in cytokinesis 637 
interaction domains 558-559 
interference-contrast optics 9F, 12F 
intermediate filaments 22F, 23, 135, 
574-579, 630, 693 
desmosome connection 705, 706F, 
708F 
see also keratin filaments 
interphase, cell cycle 181, 611, 628F 
interphase chromosomes 181-184, 
185F, 187, 189-192, 277 
interspersed (“junk”) DNA 180, 181F 
intestinal cell renewal see crypts 
intracellular compartments 19-21 
see also organelles 
intracellular condensates 157, 158F, 242 
intracellular signaling molecules 79F 
intracellular signaling pathways 
elucidation 563-564 
GPCR-triggered 555-557, 562F 
integration by 568, 569F 
response to extracellular signals 
539-540 
RTK-activated 562F 
intracellular signaling proteins 
as molecular switches 541-542 
RTK activation 558-559 
intrinsically disordered sequences 131 
introns (intervening sequences) 
239-241, 244F, 298, 302, 312-313, 
320-322F, 323-325 
iodoacetate 102 
ion-channel-coupled receptors 543-545 
acetylcholine activated 543 
also known as transmitter-gated ion 
channels 417-419, 544 
ion channels 
direct G protein regulation 548 
examples 419T 
nerve signaling and 410-422 
patch-clamp recording 160T, 
407-408, 411F 
selectivity and gating 404-405, 
408-409 
see also gating 
ion concentrations inside and outside 
cells 391, 399 
see also concentration gradients 
ion-exchange chromatography 166F 
ionic chemical bonds 42-43, 46-47, 79F 
ions, impermeability of lipid bilayers 
391 
IP, (inositol 1,4,5-trisphosphate) 549, 
551-552, 553F 
iron 
atmospheric oxygen and 490F 
heme group 133F, 148F, 149, 305F, 
474, 475F 
receptor-mediated endocytosis 526 
iron-sulfur centers 473, 483F, 489F 
isocitrate dehydrogenase 443F 
isoelectric focusing 167F 
isomerases 142T, 431, 436F, 448 
isomers 
monosaccharides 52, 72F 
optical isomers 53, 56, 76F, 142 
isotopes 40-41 
see also radiolabeling 


J 


Jews, Ashkenazi 682 

joules, conversion with calories 45, 94F 

jumping genes see mobile genetic 
elements 

“junk DNA” 180 


K 


K leak channels 375T, 405, 406F, 413, 
415, 419T 
Kartagener’s syndrome 592 
karyotypes 179F, 180, 721, 722F 
keratin/a-keratin 118F, 126, 128, 135, 
575F, 577, 578F, 579 
keratin filaments 135, 577, 578F, 705, 
706F, 707, 714F 
see also intermediate filaments 
a-ketoglutarate (dehydrogenase) 
441-444 
Khorana, Gobind 246-247 
killer lymphocytes 642, 728 
kinesins 118F, 586-587, 589, 634 
kinetics, enzyme 144 
see also reaction rates 
kinetochores 628F-629F, 631-634, 635F, 
660, 661F 
Kit gene 34F 
K,, (Michaelis constant) 143, 144 
“knock-in mice” 355F 
“knockout mice” 356-357, 731 
Krebs, Hans 444-445 
Krebs cycle see citric acid cycle 
Kruppel gene 281F 


L 


LI element (LINE-1) 311F, 316-317, 
322F 
Lac operon 275-276, 279 
lactase gene 302, 303F 
lactate dehydrogenase 131F, 447 
lactic acid cycle 102 
Lactobacillus 2 
lactose 73F, 275, 302, 303F 
lagging strand, DNA replication 
205-210, 211F, 212-213, 214F 
lamellipodia 596-599 
laminin 702, 703F, 707 
lamins 577-579, 635, 636F, 641 
“lariat” structures 240, 241F 
lasers 12F-13F, 159, 384, 633 
last common ancestor 
of plants and animals 567 
latent viruses 319 
latrunculin 594 
laws of inheritance 
Mendel’s first law 666-668 
Mendel’s second law 669-671 
laws of thermodynamics 
first 84 
second 83-84, 90 
LDL (low-density lipoproteins) 525-526 
leading strand, DNA replication 207, 
209, 211F, 212-213 
lectins 383, 386 
Leder, Phil 247 
Leeuwenhoek, Antoni van 7, 24T 
leptin gene/protein 311-312F 
leucocyte adhesion deficiency 699 
leukemia 148, 714F, 715, 729 
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life see living things 
lifestyle factors and cancer 719, 721 
ligand-gated ion channels 408, 
417-418 
transmitter-gated ion channels 
417-419, 543F, 544 
ligands, defined 137 
see also substrate 
light-driven proton pumps 402, 
545 
light-gated ion channels 408, 417-418, 
421, 422F 
light microscopes 
and the discovery of cells 6-7 
and the structures of cells 7-9 
see also fluorescence microscopy 
light reactions (photosynthesis) 479, 
484, 486, 487F 
lignin 694 
LINEs (long interspersed nuclear 
elements) 322F 
LI element 311F, 316-317, 322F 
linker DNA 185, 186F 
linker histones (histone H1) 187 
linker proteins 506F, 579, 580F, 705, 
707 
lipid bilayers 367-374 
essential fluidity 370-373, 384-385 
fatty acid derivatives as 55 
formation in aqueous media 
367-370 
limited permeability 390-391 
and lipid aggregates 75F 
protein association with 366, 376 
as self-sealing 366, 370 
synthetic/artificial bilayers 371, 
385, 390F 
lipids 
dolichol 75F, 516, 517F 
ER as a source 373 
hydrophobic interactions 48 
types 74F-75F 
see also glycolipids; phospholipids 
Lipmann, Fritz 102-103 
liposomes 371, 390F, 477 
Listeria monocytogenes 289F 
living things 
autocatalysis requirement 5F, 
259-260 
cells as fundamental units 1 
characteristics 1, 4, 39 
chemical composition 39 
number of species 2 
origins of life 309-315 
spontaneous generation 7 
ultimate dependence on solar 
energy 86 
local mediators 535-536, 545, 550T, 
555, 557 
logic operations 275 
logos (for DNA sequences) 273F 
Lohmann, Karl 103 
Loligo pealei (squid) 411F, 412 
long noncoding RNAs 289, 291-292 
loss-of-function mutations 672-673, 
682, 685-686, 724 
Lou Gehrig's disease 578 
LoxP recombination sites 357F 
lumen, organelles 374 
Lundsgaard, Einar 102-103 
lysine residues 
acetylation 153, 189-191, 277 
methylation 189F, 190-191 
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lysosomes 
digestion in 527 
endocytosis and 511, 524 
endosomes maturing into 527 
H* transporter 395, 402 
as intracellular compartments 21, 
498 
lysozyme 122F, 125F, 136, 142T, 143, 
146-147, 160T 


M 


M-Cdks 614, 616, 617T, 618-620, 
623-625, 630 
M cyclin 614-616, 617F, 618-619, 625, 
633 
M phase, cell cycle 
animal cell shapes 638F 
five stages of 627, 628F-629F 
mitosis and cytokinesis as 611 
MacLeod, Colin 194-195 
macromolecules 58-63 
biosynthesis 110-111 
evidence for 60-61 
nuclear pore complex and 503-504 
proportion of a cell’s weight 52T, 58 
sequence and conformation 59, 62 
see also polymers 
macrophages 2, 3F, 523-524, 697, 
710-712, 715F 
apoptosis and 641 
magnesium, in porphyrin 481F 
maintenance methyltransferases 287F 
major groove, DNA 177F, 272 
malaria 34F, 222, 302, 680 
malate 442F-444 
malonate 444-445 
manganese 483 
mannose 6-phosphate (receptor) 527 
MAP (mitogen-activated protein) kinase 
module 560, 568F 
margarine 372 
marriages, consanguineous 668, 681 
marriages, first-cousin 654, 669 
mass spectrometry 159-161, 270, 361F 
master transcription regulators 284, 
286 
matrix (of mitochondrion) 430, 432 
Matsui, Kazuo 615 
Matthaei, Heinrich 246 
McCarty, Maclyn 194-195 
mechanical stresses 577 
mechanically-gated ion channels 408, 
409F, 419T 
Mediator complex 276, 277F, 279 
meiosis 654-663 
contrasted with mitosis 655, 
656-657F 
errors 662-663 
homologous recombination role 222 
as reductive division 652 
meiosis I 655, 656F, 657-658, 660-662, 
672F 
meiosis II 655, 656F, 657, 660, 661F 
meiotic cell cycle, Xenopus 615 
meiotic spindle 657F, 660, 670-671 
melanin 667-668 
melanomas 728, 729F, 731 
membranes 
electron microscopy 10 
endomembrane system 499, 500F, 
501, 507, 511, 512F, 519, 522 


mitochondrial 18F, 395, 428, 429F, 
432 
surrounding organelles 495 
see also nuclear envelope; plasma 
membrane; transmembrane 
proteins 
membrane domains 381-383, 383F, 
385F 
see also action potentials 
membrane-enclosed organelles see 
organelles 
membrane fusion 372-373, 515 
membrane potentials 
concentration gradients and 393-394 
contributing to electrochemical 
gradient 393-394 
and ion channels 403-410 
and ion permeability 391-392, 
405-406 
resting membrane potentials 392, 
394, 406, 412, 414F, 419T 
voltage-gated ion channels 409-410 
see also electrochemical H* gradients 
membrane proteins 
ATP synthase as 456 
bilayer association 366, 376 
cell-free studies 378, 379F 
integral and peripheral 376 
main types 375T 
movement of 371, 381-382 
structure and function 375-376, 
379-380 
transporters and channels 389, 392 
see also channels; ion channels; 
transmembrane proteins; 
transporters 
membrane retrieval, Golgi apparatus 
523 
membrane transport 
active and passive 392-393 
diffusion and facilitated transport 
390 
principles 390-395 
see also electron-transport systems 
memory 158, 552, 554 
Mendel, Gregor 354, 664-674 
Mendelian diseases/monogenic 
diseases 681-683 
Mendel’s first law (of segregation) 
666-668 
Mendel’s second law (of independent 
assortment) 669-67 1 
mental illness 420 
MERRF (myoclonic epilepsy and ragged 
red fiber disease) 459 
Meselson, Matt 202-204 
messenger RNAs see MRNAS 
metabolic pathways 
anabolic and catabolic 82 
enzyme sequences 142 
regulation 447-451 
metabolic wheel 103F 
metabolism 
aerobic 438, 476, 478, 489 
anaerobic 476, 488 
defined 82 
metals, protein-bound 
associated with active sites 149, 464 
as electron carriers 471-473 
metamorphosis 640 
metaphase, meiosis 661 
metaphase, mitosis 352F, 627, 629F, 
632-633, 634F 


metaphase plate 
in meiosis 657, 660, 677F 
in mitosis 632-633 
metastases 719, 722, 725, 727-728, 
729F 
Methanococcus jannaschii 314F, 490-491 
methionine role in translation 253 
methotrexate 147-148 
methylation 
adenine in bacteria 218 
DNA 287 
lysine residues 189F, 190-191 
Meyerhof, Otto 102 
Mg” ions 391T 
micelles 75F, 378-379, 385F 
Michaelis constant (K,,) 143, 144 
Michaelis-Menten equation 144F 
microRNAs (miRNAs) 232, 289, 290F 
microcephaly 718 
microfilaments see actin filaments 
micrometer scale 2, 14, 134F 
microscopy 
interference-contrast microscopy 9F 
and knowledge of cells 6-11 
light and electron microscopes 6, 
8-11 
light microscopes 6-9, 12F-13F 
SPT (single-particle tracking) 
microscopy 385 
video-enhanced microscopy 585F, 
588 
see also electron microscopy; 
fluorescence microscopy 
microtubule-associated proteins 585, 
626-627, 630, 631F 
microtubules 574, 580-592 
appearance 12F-13F, 22, 23F 
in cilia and flagella 590-592 
drug effects on dynamics 584 
dynamic instability 582-586, 627 
formation of cilia and flagella 480 
meiotic spindle 657F, 660, 670-671 
organization of differentiated cells 
584-586 
orientation of cellulose deposition 
and 695 
as protein assemblies 134, 574 
structural polarity 581, 586 
see also mitotic spindle 
microvilli 592, 593F, 595, 663F, 703F 
milk, ability to digest 202F, 302, 328 
Mimosa pudica 409 
minor groove, DNA 177F, 272 
miRNAs (microRNAs) 232, 289, 290F 
miscarriages 663 
mismatch repair system 218-219 
Mitchell, Peter 476-477 
mitochondria 
chloroplast collaboration with 487F 
effects of dysfunction 459 
genetic code variants 245 
maternal inheritance 459 
origins 14, 17-18, 457, 490, 499, 
500F 
oxidative phosphorylation in 428, 
432, 459-469 
protein and lipid imports into 
505-506 
reproduction 458F, 480F 
shape, location, and number 
459-460 
structure and function 17-18, 
460-461 


mitochondrial DNA 17, 245, 458-459 
mitochondrial matrix and glycolysis 
430, 432, 438 
mitochondrial membranes 
electrochemical H* gradient 466-467 
pyruvate transporter 395 
mitochondrial replacement therapy 459 
mitogen-activated protein (MAP) kinase 
module 560, 568F 
mitogens 
cell division and 643, 644 
cyclins and 620-621 
mitosis 
chromosome behavior 181-182 
five stages of 627-635 
meiosis contrasted with 655, 
656-657F 
mitotic spindle 
and cytoplasmic cleavage 636-637 
formation and chromosome 
attachment 182, 183F 
microtubules and 23F, 580 
spindle poles 627F, 629F, 631, 633, 
635F 
staining 12F 
mitotic spindle assembly/attachment 
630-632 
mobile genetic elements 
also called transposons 315-317, 
322F 
Alu and L1 as 310, 311F, 316-317, 
322F 
DNA-only transposons 315, 316F 
genetic change through 298, 299F, 
307, 310-311 
in human genome 316-317 
and viruses 315-320 
model organisms 27-36, 674 
embryonic development 710 
genome sizes 35 
mole (unit) and molar solutions 41 
molecular chaperones 123, 124F, 258, 
505, 506F, 517-518 
molecular machines see motor proteins; 
protein machines; ribosomes 
molecular models 
backbone, ribbon, and wire models 
of proteins 124, 126F 
ball-and-stick models 44F, 52F 
space-filling 52F, 124, 125-126F 
molecular switches 
cell-cycle control system 613 
control by phosphorylation 542 
GTP-binding proteins as 154 
intracellular signaling proteins as 
541-542 
in muscle contraction 604-605 
Ras as 559 
molecular weights 41 
molecules 
defined 39-40 
electron microscopy of 11 
monoclonal antibodies 141F 
monogenic diseases 681-683 
monomeric GTPases 504, 505F, 514, 
542, 559, 598 
monomers see subunits 
monosaccharides 
aldoses and ketoses 72F, 436F 
structures 52-53, 72F-73F 
mosaicism 191F, 202 
motor proteins 
and actin filaments 595 


ATP hydrolysis and 154-155 
in the cytosol 23 
intracellular transport 586 
kinesins and dyneins as 118F, 586 
myosins as 118F 
mouse-human hybrid cells 381 
mouse (Mus musculus) 
conditional knockout mice 357 
embryonic development 640F 
embryonic stem cells 356 
experiments on genetic material 
193-194 
genome, compared with human 35T, 
310-311 
“knock-in” mice 355F 
“knockout” mice 356-357, 731 
as model organism 32 
muscle overdevelopment 646F 
optogenetics 421, 422F 
transgenic 356-357F, 359, 577 
MPF (maturation promoting factor) 
615-616 
mRNAs (messenger RNAs) 
eukaryotic 232, 237-238, 244F 
exported from the nucleus 242, 
243F 
pre-MRNAs 239-242, 244F 
prokaryotic 244F, 254 
revealing gene expression 270, 351 
translation 288 
mucus 53, 522, 590, 703, 713F 
multicellular organisms 
dependence on endosymbionts 459 
as eukaryotes 16 
of plants and animals 567, 692 
sexual reproduction among 299-300, 
652 
multicellular structures, prokaryotes 14, 
15F 
multigenic diseases 682-683 
multiple polypeptide chains 132, 134F 
muscle cells 
differentiation 283, 286 
use of myosin-II 599-600 
see also heart muscle; skeletal 
muscle; smooth muscle 
muscle contraction 600-606 
isolated muscle 102-103 
speed of 603, 605 
muscular dystrophy 579 
mutagens 
cancer and 720 
random mutagenesis 354, 674 
various effects of 674F 
mutations 
cancer-critical mutations 720-721, 
723-726, 728T, 730-731 
complementation tests 675F, 678 
as disease models 359 
disease predisposition and 333, 347 
driver and passenger mutations in 
cancer 720 
elimination of deleterious mutations 
654 
favorable 309, 680 
frequency in E. coli and humans 
301-203 
gain- and loss-of-function 672-673, 
682, 685 
gene inactivation by 354 
germ-line and somatic 222-223 
investigating lethal mutations 
676-678 
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and need for DNA repair 199, 215, 
219, 222 
neutral 302, 309-320, 326 
from nucleotide modification 217 
Ras protein, in cancers 560, 564 
recent, in human genome 327, 686 
as replication failures 5 
screening for mutants 564 
temperature-sensitive yeasts 520, 
521F 
types of genetic change 298-299 
see also SNPs 
Mycobacterium tuberculosis 524 
myofibrils 460F, 601, 602F, 604-605 
myoglobin 24T, 125F, 130, 160T 
myosin-I 599, 600-601F 
myosin-II 599-602, 603F, 605 
myosins 
animal cell cortex 381 
contractile structures with actin 597, 
599-605 
filaments 602-603 
as motor proteins 128, 155 
myostatin 645 


N 


N-linked oligosaccharides 516 
N-termini, polypeptides 56, 120 
Na*-H* exchanger 401, 403T 
Na*-K* ATPase 397, 403T 
Na* pump 
energetics 397-398 
restoring ion gradients 398-399 
NADH/NAD* system 
citric acid cycle and 429F, 432-433, 
438-441 
fermentation 433-435 
spectrophotometry 144 
NADH dehydrogenase (complex) 464, 
469-470, 473-474, 475F 
NADH (nicotinamide adenine 
dinucleotide) 
as an activated carrier 106-108 
in catabolism 429F 
resulting from glycolysis 431-433, 435F 
role 106-108 
NADPH (nicotinamide adenine 
dinucleotide phosphate) 
as an activated carrier 106-108 
in photosynthesis 479-480, 482, 483F 
role 106-108 
Nanog sequence 273F 
natural selection 
cancer cells 721, 722F 
and evolution 5, 8 
mutant hemoglobin 683 
operating on germ-line mutations 
223, 309, 673 
protein sequences 131 
Neanderthal genome 326, 327F 
necrosis 640-641 
negative regulation 151 
Neisseria gonorrhoeae 308 
nematodes see Caenorhabditis elegans 
Nernst equation 406, 413, 465F 
nerve cells (neurons) 
depolarization 411, 414-415, 418 
extent of apoptosis 643, 644F 
Notch receptor role 565 
polarization 585 
shape 2, 3F 
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nerve cells (neurons) (continued) 
structure and function 410-411 
as terminally differentiated 286 
use of ion channels 409, 410-422 
use of scaffold proteins 157 
nerve signaling 411-416 
nerve terminals 410, 416-417, 419T, 
420, 535, 536T, 555, 604 
neural circuits 411, 421 
neural tube 705, 706F 
neuraminidase 133 
neurodegenerative disorders 
intermediate filaments in 578-579 
protein misfolding 129 
neurofilaments 577-578 
neuromuscular junctions 418 
neuronal signaling 535-536 
neurons see nerve cells 
neurotransmitters 
dopamine 707-708 
drug action on receptors 419-420 
as excitatory or inhibitory 418-419 
as extracellular signal molecules 
535, 536T 
function 416-417 
glutamate as 418 
see also acetylcholine; transmitter- 
gated ion channels 
neutral mutations 302, 309-320, 326 
neutrons 40 
neutrophils 386, 596 
next-generation sequencing 347, 
350-351 
NGF (nerve growth factor) 118F, 536T, 
559 
nicks, sealing in DNA 210, 212, 213T, 
218, 337, 338F 
nicotinamide see NADH; NADPH 
Nirenberg, Marshall 246-247 
nitric oxide (NO) 536T, 554-555 
nitrogen fixation 15, 490 
NMR (nuclear magnetic resonance) 
spectroscopy 160T, 161, 168F 
Nobel Prize 31, 61, 102-103, 324, 413, 
444-445, 477, 520 
nomenclature 
nucleotides and bases 79F 
sugars 52-53 
noncoding RNAs 232, 235T, 288-289, 
291F, 325, 352, 675F 
noncovalent bonds/interactions 
electrostatic attraction 48, 62, 71F 
equilibrium constant and 96-97 
hydrophobic interactions 62, 
70F-71F 
in intermediate filaments 576 
in macromolecules 62-63 
membrane proteins 376 
in protein conformations 121 
and substrate binding 100, 137 
transcription regulators 272 
van der Waals attraction 70F 
see also hydrogen-bonding; substrate 
binding 
nondisjunction 662-663 
nonhistone chromosomal proteins 184, 
187, 189-190 
nonhomologous end joining 220 
Notch receptor 565F 
nuclear envelope 
continuous with ER 497 
electron micrograph 16F 
in eukaryotic cells 17 


inner and outer nuclear membranes 
503 
intermediate filaments and 578-579 
interphase chromosomes and 183 
mRNA transport through 237 
possible origins 499 
prometaphase disassembly 630-631 
protein transport through 501, 
503-504 
telophase reassembly 635 
nuclear export receptors 504 
nuclear import receptors 504, 505F 
nuclear lamina 
chromosome attachment 183 
cytoskeleton attachment 503, 575, 
578 
mitosis and 630, 635 
nuclear lamins 577-579, 635 
nuclear localization signals 504 
nuclear magnetic resonance (NMR) 
spectroscopy 160T, 161, 168F 
nuclear pore complexes 242, 497, 501, 
503F 
nuclear receptors 566-567 
nuclear transplantation 268-269 
nucleases 
Dicer 290-291 
in DNA repair 210, 217, 218T, 220F, 
221 
recombination-specific 221 
restriction nucleases 335-339, 349, 
358-359 
ribonucleases 243, 246, 340 
use in nucleosome investigations 
185 
nucleic acids 
3’ and 5’ ends 58F 
hydrogen-bonding in 58, 70F, 
175-176, 177F 
nucleotide subunits 56-58 
phosphodiester bonding in 58, 79F, 
176, 205, 218F 
synthesis 111 
see also DNA; RNA 
nucleolus 9, 25F, 157, 158F, 183, 184F, 
237F, 242 
nucleosides 57 
nucleoside triphosphates see ATP; GTP 
nucleosome core particles 184-187, 
189F 
nucleosomes 
chromosome structure and 184-186 
DNA repositioning 188-189 
eukaryotic transcription and 235, 
276-277 
histones in 184-186 
nucleotides 
chain-terminating ddNTPs 346, 347 
CTP (cytidine triphosphate) 151F, 231 
functions and nomenclature 
78F-79F, 176 
repetitive sequences 348, 349F 
ribonucleotides and 
deoxyribonucleotides 57 
as subunits 51, 56-58 
see also ATP; GTP 
nucleotide sequences/sequencing 
Alu sequence 310, 311F, 317, 322F 
conservation of 312-313, 350 
encoding hereditary information 177 
falling costs 321 
in human genome 179, 231, 
310-312, 321-323 


restriction enzyme targeting 335 
signaling introns 240 
techniques for DNA sequencing 
346-350 
see also conserved DNA; exons; 
genome sequences; human 
genome; introns; mobile genetic 
elements; regulatory DNA 
nucleus (atomic) 40 
nucleus (cell) 
as defining eukaryotes 11 
division in meiosis 655 
location of eukaryotic transcription 
238-239 
membranes 497 
Notch receptor access 565 
structure and function 16-17 
transplantation experiments 268-269 
Nurse, Paul 30-31, 616 


O 


obesity 682-685, 721 
occludins 704 
Okazaki fragments 207, 210, 211F, 212, 
213T, 218 
“oligo-” prefix 53 
oligosaccharides 53, 73F 
in glycoproteins 382 
in glycosylation 516, 517F 
oncogenes 560, 723-725, 728-729 
proto-oncogenes 724-725, 727, 728T 
oocytes 157F, 513F, 615-616, 659, 663 
operons 254F, 273, 279 
Lac operon 275, 279 
tryptophan operon 273-274, 278 
optic cup/eye cup 705, 706F, 717-718 
optical isomers 53, 56, 76F 
optical microscopes see light 
microscopes 
optogenetics 421, 422F 
ORC (origin recognition complex) 623 
ORFs (open reading frames) 324-325 
organ size and apoptosis 639-642 
organelles 
in cell division 500, 638-639 
electron microscopy 10, 16F, 496F 
eukaryotic cells 16-21 
evolutionary origins 457-458 
internal membranes 366 
location and transport 585, 587, 590 
main functions 497T 
membrane-enclosed 496-500 
motor proteins and 586 
protein sorting and import 500-511 
volumes and numbers 498T 
see also chloroplasts; mitochondria 
organic chemistry 
chemical groups 51 
defined 39 
organoids 33, 717-718 
origins of life 309-315 
role of RNA 259-262 
origins of replication see replication 
origins 
osmosis 394-395, 693 
see also chemiosmotic coupling 
osteoblasts 697, 712 
osteoclasts 712, 715F 
ouabain 397, 398F, 399 
oviduct 590, 663 
oxalic acid poisoning 145 


oxaloacetate 110F, 142, 430 438-439, 
440F, 441-445, 447 
oxidation 
energy derivation from food 86-87 
viewed as electron removal 87-88 
oxidation-reduction reactions see redox 
reactions 
oxidative phosphorylation 
efficiency 446 
electron-transport systems in 445, 
456 
evolution 488-489 
as a membrane-based mechanism 
456, 461 
in mitochondria 428, 461, 463 
role of mitochondria 428, 432, 459-469 
see also chemiosmotic coupling 
oxygen 
atmospheric, in catabolism 441 
in cell respiration 464 
photosynthetic origin 483 
origin in Earth's atmosphere 439, 
455, 490F 
water as source in citric acid cycle 
440 


P 


p21 Cdk inhibitor 621 
p27 Cdk inhibitor 619F 
p53 gene/p53 protein 153, 621, 723, 
728T 
P site, ribosomes 251, 252F, 253-254 
PAGE (polyacrylamide-gel 
electrophoresis) 161F, 167F, 336 
palmitic acid 54, 74F, 153 
pancreatic B cells 267-268, 523F, 
536T 
Paneth cells 713F, 715F 
paracrine signaling 534-536 
Paramecium 2, 3F, 26F 
Parkinson's disease 716 
parthenogenesis 652 
Pasteur, Louis 8, 476 
patch-clamp recording 160T, 407-408, 
411F 
paternity testing 345F 
pathogenicity restoration, pneumococci 
193-194, 338 
Pauling, Linus 45, 160T, 202 
PCR (polymerase chain reaction) 
diagnostic use 341, 343-344 
DNA cloning by 341-345 
forensic science use 341, 343-344 
second-generation sequencing 347, 
350 
PDGF (platelet-derived growth factor) 
375T, 536T, 559, 644-645 
peas, genetics of 664-669, 672-673 
pectin 141F, 693-694F 
pedigrees 668, 669F 
penile erection 555 
penile spines 326 
peptide bonds 60, 67F, 70F, 76F, 92, 126 
in proteins and polypeptides 56, 
119-120 
proteolysis 256 
peptidyl transferases 253-254, 256T 
periodic table 43 
peroxisomes 21, 24T, 165F, 497F, 497T, 
498, 500 
protein import 506 


pertussis 547 
pH, organelles 527 
pH gradients 
across mitochondrial membranes 
466-467 
in isoelectric focusing 167F 
see also electrochemical H* 
gradients; proton gradients 
PH scale 49, 69F 
phagocytic cells 140F, 523-524, 
640-641, 712 
macrophages and neutrophils as 
523, 524F 
phagocytosis 523, 528F, 592 
phalloidin 593F, 594, 599F 
pharmaceuticals see drugs 
phase-contrast optics 12F, 33F 
phenotypes 666-669, 670F, 672-678, 
680, 684 
Phormidium laminosum 15F 
phosphatases see protein phosphatases 
phosphates 
energy of phosphate bond 103 
inorganic, and phosphodiesters 67F 
see also nucleotides 
phosphatidylcholine 55F, 74F, 367-368, 
373F 
phosphatidylethanolamine 373-374F 
phosphatidylinositols 374F 
phosphatidylserine 368F, 373-374F 
phosphoanhydride bonds 57, 67F, 
79F, 105F 
phosphodiester bonds 58, 79F, 176, 205, 
218F 
phosphoenolpyruvate 434, 437F, 476 
phosphofructokinase 436F, 448 
phosphoglucose isomerase 436F 
2-phosphoglycerate 437F 
3-phosphoglycerate 434, 435F, 437F, 
441F, 485, 486F 
phosphoinositide 3-kinase (PI 3-kinase) 
560-561, 562F 
phospholipase C 
diacylglycerol and inositol 
trisphosphate from 549, 551-553 
RTK use 559 
phospholipids in cell membranes 55, 
74F, 367-370 
phosphorylation 
as a condensation reaction 105 
conformation changes following 
151-153 
control of molecular switches 
542 
of cyclin-Cdk complexes 616 
lamins 578 
mutual, of enzyme-coupled 
receptors 558 
of RNA polymerases 236-237, 238F 
substrate-level phosphorylation 
432, 434F, 435, 476 
phosphorylation cascades 542, 560 
photobleaching 384 
photoreceptor cells 
GPCR response speeds 556 
rods and cones 556, 557F 
switching between 708 
photosynthesis 
chloroplasts in 18-19, 478-488 
electron-transport systems in 456F 
energetics of 85-86 
evolution of 489 
in prokaryotes 15 


Index 


photosystems (I and II) 481, 482-484, 
485F, 489 
phragmoplasts 638, 639F 
phylogenetic trees 310, 314 
PI-3-kinase-Akt signaling pathway 561, 
562F 
pinocytosis 523-526 
PKA (protein kinase A) 550-551, 552F 
PKB (protein kinase B, Akt) 561, 562F 
PKC (protein kinase C) 553 
plants 
Arabidopsis as model 28 
carnivorous plants 405 
cell signaling in animals and 567 
cell walls 692-695 
cytokinesis 638 
flowering, evolution 28 
glycolysis in 449F, 450 
growth and microfibril orientation 
694 
metaphase and anaphase 
chromosomes 633F 
mobile genetic elements 307 
regeneration potential 268 
RNAi in 291 
separate evolution of multicellularity 
from animals 567, 692 
touch-sensitive 405, 409F 
transgenic 359-361 
whole-genome duplication 306 
plasma membrane 
cell cortex underlying 380-381 
cell wall production 694 
in cytokinesis 636-637 
depolarization 408, 421 422F 
in endocytosis 523 
in exocytosis 515, 522 
extracellular signal molecules 
crossing 565-567 
glucose transport 400-401 
microscopic appearance 9-10 
Na* concentration gradient 398-400 
as self-sealing 366, 370 
structure 365-366 
as sugar-coated 382-382, 386 
surface-to-volume ratios 499 
see also lipid bilayers; membrane 
potentials 
plasmids 337-338 
plasmodesmata 708, 709F 
Plasmodium vivax 302 
plastocyanin 484, 485F 
plastoquinone 482, 483F 
plectin 579 
pluripotency, ES cells 283, 715 
pluripotency, induced 285 
pneumococcus (Streptococcus 
pneumoniae) 193-194, 338 
point mutations (single nucleotide) 
300-302 
see also SNPs 
poisoning 
curare and strychnine 419 
cyanide 468, 475 
ethylene glycol 145 
malonate, on the citric acid cycle 
444-445 
see also toxins 
polar covalent bonds 43F, 45, 47-49, 88 
polar molecules 
acids and bases from 49-50 
among amino acids 76F, 120F, 121 
water 68F 
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polyacrylamide-gel electrophoresis 
(PAGE) 161F, 167F, 336 
polyadenylation/poly-A tails 237-239, 
242, 243-244F, 253F 
polycistronic RNAs 254 
polyisoprenoids 75F 
polymers 
energy in synthesis 110-111 
importance in living things 39 
see also macromolecules; subunits 
polymorphisms 
disease predisposition and 
679 
formation of haplotype blocks 
679-680 
see also SNPs 
polynucleotides see nucleic acids 
polynucleotide phosphorylase 
246 
polypeptide backbones 119-121, 122F, 
124, 126, 127F, 128, 377 
polypeptide side chains see amino 
acid side chains 
polypeptides 
C- and N-termini 56, 120 
theoretically possible number 
131-132 
proteins as 119-120 
ubiquitin as 153 
polyps 726-727, 731 
polyribosomes (polysomes) 255, 508, 
SO9F 
polysaccharides 53, 73F 
cellulose as 694 
lysozyme effects 143, 146F 
protein gels with 700-701 
in proteoglycans 700 
starch 53, 449F, 450-451, 487, 
672-673 
see also glycogen 
porins 125F, 378, 403, 452, 461-462, 
468F 
aquaporins 394, 404 
porphyrin rings 481F 
see also chlorophyll; heme 
positive feedback 284, 286-287, 540, 
541F, 625F 
positive regulation 151 
post-transcriptional controls 287-292 
post-translational modifications 258, 
288 
see also covalent modification 
postsynaptic cells 416-418, 420 
pre-mRNAs (precursor messenger 
RNAS) 239-242, 244F 
precursor cells 712-714, 715F, 718, 722, 
731 
prereplicative complexes 623 
presynaptic cells 416-417, 420 
primary structures, protein 159 
primases 208-209 
primate phylogenetic trees 310 
primers, DNA for PCR 342, 343F 
primers, RNA for DNA synthesis 
208, 210 
prions 129, 130F 
probability 83 
procaspases 641-642, 643F 
procollagen 697, 698F 
profilin 595 
progeria 579 
programmed cell death 640 
see also apoptosis 


prokaryotes 
antibiotics and 255-256 
comprising bacteria and archaea 14, 
15-16, 314 
constituent domains 15 
genome simplicity 34 
prokaryotic cell 11-16 
see also bacteria 
prometaphase 627-628F, 632, 635, 
636F 
promoters 
in transcription control 233-237 
and transcription regulators 271, 
273-277, 278F, 279 
pronuclei 355F, 663 
proofreading 207-208, 209F 
prophase 
meiosis 655, 657-659, 661F, 672F 
mitosis 627, 628F, 630-631 
proteases 
controlled breakdown by 256-257 
matrix proteases 697 
protein sequencing 158 
proteasomes 169F, 257 
proteins 
as amino acid polymers 4, 56 
denaturing and renaturing 123, 
342F 
ER modification 516 
fusion proteins 353, 521, 724F 
identification through 
co-immunoprecipitation 563, 730 
interchangeability between species 
30-31 
large-scale production 361 
lifetime and breakdown 256-257 
misfolding 129, 130F, 257, 517-518, 
673, 682 
multiplicity of functions 117-118F, 
137-149 
nature of genes and 227 
phosphorylation 152-153 
primary, secondary, tertiary, and 
quaternary structures 129 
production using genetic 
engineering 158, 161-162, 361 
purification and analysis 158-162 
regulation of activity 149-158 
RNA translation into 4 
shape and structure 119-136 
specific to cell types 269-270 
synthesis 255-256, 257-259 
unfolding to cross membranes 501, 
505-506 
see also pump proteins; protein 
machines 
protein-coding genes 
DNA cloning 340 
human genome 270, 322-323, 324, 
325F 
other species 35 
reporter gene studies 280-281, 
352-353 
protein complexes 
noncovalent bonding 63 
as protein machines 155-156 
protein-DNA interaction, transcription 
regulators 271-272 
protein domains 130, 131F, 139, 153F, 
306-307 
protein families 132, 162 
protein filaments 9F, 22, 24T, 134, 503, 
573-574 


protein kinases 
Akt kinase 561, 562F 
CaM-kinases 554 
and complex cell behaviors 562, 567 
in intracellular signaling 542 
and protein phosphatases 152, 542 
tyrosine and serine/threonine 542 
Weel inhibitory kinase 618F, 625 
see also Cdks; PKA; PKC 
protein machines 155-156 
replication machines 200-201, 205, 
210-211, 218 
see also motor proteins; ribosomes 
protein phosphatases 
Cdc6 623, 624F 
Cdc25 618F, 623, 625 
and protein kinases 152, 542 
protein sorting 496, 500-511, 519 
protein structures see amino acid 
sequences; conformations; domains; 
helices 
protein translocators 467, 501, 506, 509, 
516 
proteoglycans 382-383, 700-701 
proteolytic cascades 640-642 
proto-oncogenes 724-725, 727, 728T 
protons 
in atomic structure 40-41 
hydronium ion formation 49 
mobility and availability 469 
proton gradients, electron-transport 
chain 446 
see also pH gradients 
proton-motive force see electrochemical 
H+ gradients 
proton pumps 
ATP-dependent 397, 526-527 
cytochrome b,-f complex as 484 
electron-transport chain 464 
endosomes 526 
molecular mechanisms 469-475 
probable evolution 488 
protozoans 
as eukaryotes 26 
flagella 590 
ingestion of other cells 26 
Paramecium 2, 3F, 26F 
variety 27 
proviruses 319 
Prozac 419 
pseudogenes 305, 322T 
pseudopods 524 
psychoactive drugs 419-420 
PTC (phenylthiocarbamide) 664 
pufferfish 313 
pump proteins 
in active transport 380, 396-397 
ATP-driven pumps 397, 399, 400F, 
488F, 526-527 
Ca?* pumps 118F, 399, 400F, 403T, 
554, 605 
chemiosmotic coupling and 462 
gradient-driven pumps 397, 
399-401, 402F 
light-driven pumps 397, 402, 545 
transmembrane pumps 403F, 403T 
see also Na* pumps; proton pumps 
purification techniques, protein 141F, 
158-162 
purifying selection 311F, 312-314 
purines and pyrimidines 57, 176 
see also bases (nucleotide) 
pyrimidines 57, 67F, 151F, 176, 215 


pyrophosphate (PP) 111, 112F, 206 
pyruvate 
fermentation 433 
in gluconeogenesis 448 
in glycolysis 430-431, 437F 
wider role 447 
pyruvate dehydrogenase 438, 447 


Q 


quality control, DNA see proofreading 
quality control, gametes 663 

quality control, proteins 516, 517 
quaternary structures, protein 129, 132 
quinones see plastoquinone; ubiquinone 


R 


Rab GTPases 514, 515F 
Rac protein 599F 
Racker, Efraim 477 
radiation damage 215, 712, 715, 720 
radiolabeling 
amino acids 246-247, 520 
T2 bacteriophage 195 
radiotherapy 728 
random walks 99 
rapamycin 562 
Ras-GAP 559 
Ras GTPase/Ras gene 562F, 564, 673, 
722 
activation by RTKs 559-560 
Rb protein 620-621 
reaction centers (chlorophyll) 481-483, 
484-485F, 489 
reaction rates 
enzyme effects 95F, 142-144 
maximum (Vaa) 143-145 
reactions see chemical reactions 
reading frames 245, 252F, 253, 324 
ORFs (open reading frames) 324 
receptor-mediated endocytosis 525-526 
receptor proteins 118F 
receptor serine/threonine kinases 567 
receptor tyrosine kinases see RTKs 
receptors 
insulin receptor 563 
mutant receptor studies 563 
role in signal transduction 534, 537 
receptors, cell-surface 
death receptors 642 
effector proteins and 537, 540, 552, 
560 
enzyme-coupled receptors 543, 545, 
557-569 
interfering substances 544T 
main classes 543-544 
see also GPCRs; transmembrane 
proteins 
recessive alleles 665-666 
recessive mutations 
complementation tests 678 
loss-of-function 672 
persistence in the human genome 
673-674 
recombinant DNA technology 
DNA cloning 337, 361 
see also genetic engineering 
recombinases 357 
red blood cells 367, 380-381, 524, 710, 
712 
redox pairs 471 


redox potentials 
effects of concentration 472F 
electron affinities and 470-471 
standard redox potential, E’, 472F 
water and NADPH 484 
redox reactions 107, 470-472F 
reductions 87-88 
redundancy, in the genetic code 244 
Reese, Thomas 588 
refractive indices 8-9 
regulated exocytosis pathway 522 
regulatory DNA 
as conserved 313 
in human genome 35 
point mutations and 302, 303F 
reporter gene studies 280-281, 
352-353 
and species differences 312, 326 
and transcription regulators 235, 
271-272 
regulatory ligands 151-152, 156 
regulatory pathways targeted by 
cancers 725-726 
regulatory RNAs 288-289, 322T, 325, 
708, 710 
regulatory sites 145, 150-151, 566 
renaturation, DNA see DNA 
hybridization 
repair DNA polymerases 210F, 215, 
217-218, 221 
replication forks 201, 205-207, 209-213, 
219, 623, 624F 
asymmetry 206-207 
replication machines 200-201, 205, 
210-211, 218 
replication origins 
in bacteria 201 
in eukaryotic chromosomes 181, 
183F 
initiation of DNA synthesis 201 
origin recognition complex 623 
replication forks at 201, 205 
reporter genes 280-281, 352-353 
repressor proteins 274, 277, 280, 288 
resolution 
electron microscopy 13F, 169F 
fluorescence microscopes 9, 13F 
light and electron microscopes 9-10 
resonance (alternating double bonds) 
66F 
respiratory chain see electron-transport 
systems 
respiratory enzyme complexes 464-465 
restriction nucleases 335-339, 349, 
358-359 
retinal 148, 379-380 
retinoblastoma 620 
retrotransposons 316-317 
see also Alu sequences; L1 element 
retroviruses 318-319, 320F 
reverse genetics 354 
reverse transcriptase 316-317, 319-320 
and cDNA libraries 339, 340F 
and PCR 344F 
Rho GTPase 598-599 
rhodamine 382F 
rhodopsin 
bacteriorhodopsin 118F, 160T, 
379-380, 397, 402, 403T, 477, 545 
channelrhodopsin 421, 422F 
as a GPCR 545, 556 
retinal and 148-149 
ribbon models 124, 126-127F, 130 
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ribonucleases (RNases) 243, 246, 340 
ribose, ready formation of 261 
ribosomal RNAs see rRNAS 
ribosome-binding sequences 288, 289F 
ribosomes 
electron microscopy 10-11F 
free and membrane-bound 508 
inside organelles 501 
location in prokaryotes and 
eukaryotes 20F 
mRNA decoding by 249-252 
polyribosomes (polysomes) 255, 508, 
509F 
as protein complexes 63 
as ribozymes 252-253 
speed of operation 251 
structure 169F, 250-251 
see also endoplasmic reticulum 
ribozymes 109, 252-253, 259-260, 261T 
ribulose 1,5-bisphosphate 485-486 
ribulose bisphosphate carboxylase 
(Rubisco) 59, 118F, 168F, 485-486 
ring closure, sugars 72F 
RISC (RNA-induced silencing complex) 
289-290 
RITS (RNA-induced transcriptional 
silencing) 291 
RNAs 
chemical differences from DNA 58, 
229 
as DNA replication primers 208 
double-stranded (dsRNA) 318 
essentially single-stranded 229-230 
genetic information storage 260 
genetic information transcription 4, 
228 
as indicators of gene expression 270, 
351-352 
intermediate for retrotransposons 
317F 
and life's origins 259-262 
miRNAs (microRNAs) 232, 289, 290F 
noncoding 232, 288-289, 291-292, 
325, 352 
siRNAs (small interfering RNAs) 
290-291, 355 
snRNAs (small nuclear RNAs) 240 
synthesis in eukaryotes 242, 244F 
tRNAs (transfer RNAs) 232, 235T, 
245-249, 251, 252F, 253-254, 256T, 
288 
types of RNA 232 
see also MRNAS; rRNAs 
RNA capping 238, 254, 258F 
RNA interference (RNAi) 290-291, 
354-355, 359, 564, 676, 677F 
RNA polymerases 
compared to DNA polymerase 232 
primase as 209 
RNA polymerase | 235 
RNA polymerase II 235-239, 242 
RNA polymerase III 235 
in transcription 230-234 
RNA processing, nuclear 237-238 
RNA scaffolds 157 
RNA-Seq technique 325, 352 
RNA splicing 228, 230F, 232T, 239-241, 
244F, 259, 261T, 288, 298 
RNA viruses 319F 
“RNA world” 109, 259 
rod domains, intermediate filaments 
575-576 
rods (photoreceptor cells) 556, 557F 
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rRNAs (ribosomal RNAs) 169F, 231F, 
232, 250, 251F, 252-253, 288-289 
genes encoding 184F, 235T, 248, 
314 
sequence in molecular phylogenetics 
314 
RTKs (receptor tyrosine kinases) 
as enzyme-coupled receptors 153, 
557 
growth factor binding 644, 645F 
insulin-like growth factor (IGF) 
family 560-561 
intracellular signaling pathways 
from 562F 
MAP kinase and 560 
PI 3-kinase and 561, 562F 
Rubisco (ribulose bisphosphate 
carboxylase) 59, 118F, 168F, 485-486 


S 


S-Cdks 614, 617, 620-621, 622F, 623, 
624F, 630 
S cyclin 614, 617 
S phase, cell cycle 611-614, 617-621, 
623-625, 630F, 644 
Saccharomyces cerevisiae 
cell-cycle control 616 
cell size and shape 3F, 16F 
gene density 180F 
genome size 35T, 181 
mating factors 534 
as model organism 28 
protein substitution 30-31 
saltatory movements 586 
Sanger sequencing 346, 347F 
sarcomeres 601-603, 605 
sarcoplasmic reticulum 400F, 403T, 
604-605 
saturated hydrocarbons/fatty acids 54, 
66F, 74F, 371-372 
scaffolds 
long noncoding RNAs 291 
scaffold proteins 156-157, 158F, 
541F, 558 
scanning electron microscopy (SEM) 
11, 13F 
Schizosaccharomyces pombe 30-31, 
616 
Schleiden, Matthias 7, 24T 
Schwann, Theodor 7, 24T 
Schwann cells 710 
scramblases 373 
SDS (sodium dodecyl sulfate) and 
SDS-PAGE 167F, 379F 
second-generation sequencing 
techniques 347, 350 
second law of thermodynamics 83-84, 
90 
second messengers (small messengers) 
541, 549-552, 554-555, 557F 
secretory cells/vesicles 522 
secretory pathways 511-512, 515-523, 
678 
see also exocytosis 
securin 633 
segregation, law of independent 
666-668 
self-organizing structures 134 
semiconservative model, DNA 
replication 201-202, 204 
separase 633 


sequence and conformation, 
macromolecules 59, 62 
see also amino acid sequences; 
nucleotide sequences 
sequence selectivity, nucleases 335 
sequential reactions 
citric acid cycle 444 
free-energy change, AG 98-99 
see also coupled reactions 
serine proteases 132 
serine/threonine kinases 
Akt as 561 
PKA as 550 
as switch proteins 542 
Tor as 562 
serotonin 
as an excitatory neurotransmitter 
419-420, 550F 
muscle contraction 605 
sex chromosomes 320, 657 
X chromosomes 179, 191, 192F, 277, 
291 
Y chromosomes 179, 567, 652 
sex-determination genes 223F 
sex hormones 
estradiol 536T, 565, 566F 
testosterone 75F, 536T, 565-567 
sex pili 308F 
sexual reproduction 
benefits 652-654 
genetic change and 5, 299-300, 
653 
in unicellular organisms 654 
shapes of cells 2 
Sheetz, Michael 588 
Shimomura, Osamu 520 
short tandem repeats (STRs) 345F, 
364F 
shotgun sequencing 348-349 
sickle-cell anemia 160T, 222, 680 
sigma factor 233F, 234, 236, 271 
signal conversion, electrical and 
chemical 416-417 
signal molecules see extracellular signal 
molecules; intracellular signaling 
molecules 
signal proteins 118F 
signal-recognition particles (SRPs) and 
SRP receptors 508-509, 511F 
signal sequences (sorting signals) 
501-502, 504-505, 507, 509 
ER retention signals 517, 519 
in vitro investigations 520 
signal transduction 534, 539, 545, 558, 
568 
intracellular signaling pathways 
539-540 
SINEs (short interspersed nuclear 
elements) 322F 
Alu sequence 310, 311F, 317, 322F 
Single Molecule Real Time sequencing 
350 
single-nucleotide polymorphisms see 
SNPs 
single-strand DNA-binding proteins 211 
single-strand DNA viruses 318 
single-stranded DNA 623 
siRNAs (small interfering RNAs) 
290-291, 355 
sister chromatids 183, 625, 626F, 
627-629F, 631, 657-658, 660, 661F 
anaphase separation 633-634, 635F 
in meiosis 657-658, 659F, 661F 


sizes 
of atoms 41 
of bacteria 14F 
of cells and their components 2, 11F 
skeletal muscle 
acetylcholine effects 418F, 539, 543 
adrenaline effects 551 
muscle contraction 601-603 
myostatin and muscle mass 645, 
646F 
skin 
cell types 710 
electron micrograph of 698F 
epidermis, as epithelial sheet 
701-702, 711F, 713 
epidermis, renewal 712 
in epidermolysis bullosa simplex 
577, 579 
hyperextensible 697, 698F 
sliding clamps (on DNA) 211-212, 213T 
small contractile bundles 593 
small interfering RNAs (siRNAs) 289, 
290-291, 355 
small messengers (Second messengers) 
541, 549-552, 554-555, 557F 
small molecules 
in cells 50-58 
enhancing protein function 148-149 
small nuclear RNAs (SnRNAs) 240 
smoking tobacco 685, 720 
smooth muscle 286, 536T, 555, 575, 
600, 605, 692F 
SNAREs 514-515 
SNPs (single-nucleotide 
polymorphisms) 327-328 
human disease and 683, 684-685 
snRNAs (small nuclear RNAs) 240 
snRNPs (small nuclear ribonuclear 
proteins) 240, 241F 
sodium chloride 46-47, 68F 
solutes 
active and passive membrane 
transport 392-393 
lipid bilayer impermeability 390 
transport in plants, bacteria, and 
fungi 402 
somatic cells 
distinguished from germ cells 299, 
652 
mutation in cancer 720, 726 
sorting signals see signal sequences 
space-filling models 
DNA double helix 177F 
phosphatidylcholine 368F 
protein structure 124, 125-126F 
“spacer” DNA 279F, 323, 340 
special pair (chlorophyll dimer) 481, 
482F, 483-484, 485F 
specialization (of cells in multicellular 
organisms) 3 
species 
chosen as model organisms 27-36 
estimated numbers 2 
with genomes sequenced 180, 323 
spectrin protein 380-381 
spectrophotometry 144 
sperm 
as gametes 652 
mitochondria in 459-460 
propulsion 590-591 
sphingomyelin 373-374F 
spindle assembly checkpoint 635 
spindle equator 632 


spindle poles 627F, 629F, 631, 633, 635F 
spliceosome 235T, 240, 241F 
splicing machines/sequences see RNA 
splicing 
spontaneous reactions 
depurination and deamination 215 
energy barriers 91-92, 94F-95F 
genome duplication 306 
protein renaturation 123, 134, 258 
virus reassembly 319F 
SPT (single-particle tracking) 
microscopy 385 
squid giant axons 411-413, 588 
squiggle P (“high-energy” phosphate 
bond) 103. 
SRPs (signal-recognition particles) and 
SRP receptors 508-509, 511F 
Stahl, Frank 202-204 
staining 
chromosome painting 179, 180F, 
184F, 722F 
for microscopy 3F, 8-10, 12F 
standard free-energy change, AG° 
defined 92-93 
equilibrium constant and 95F-96F 
example reactions 94F 
hydrolysis of phosphates 105, 434 
redox potential difference and 
470-471, 472F 
standard redox potential, E’, 472F 
starch 53, 449F, 450-451, 487, 672-673 
Start, G, to S transition as 620 
start codons 254, 255F 
start-transfer sequences 509 
statins 147 
stem cells 712-718 
differentiated cells from 712-714, 
716 
embryonic stem cells (ES) 283, 
285-286, 356, 715-717 
induced pluripotent stem cells (iPS) 
285-286, 716-718 
stereocilia 13F, 409F 
steroids/sterols 
crossing the plasma membrane 
565-566 
as lipids 55, 75F 
synthesis in smooth ER 497 
see also cholesterol; cortisol; sex 
hormones 
“sticky ends” of DNA 335F 
Stoeckenius, Walther 477 
stop codons 245F, 251, 254, 255F, 324 
stop-transfer sequences 510 
storage proteins 118F 
Streptococcus pneumoniae 
(pneumococcus) 193-194, 338 
stresses, mechanical 577 
striated muscle see skeletal muscle 
stroma (chloroplasts) 478, 487 
STRs (short tandem repeats) 344F 
structural formulas see molecular 
models 
structural proteins 118F 
strychnine 419, 544T 
substrate binding 
and activation energy 89-90 
affinity and Ku 143 
electrostatic component 71F 
noncovalent interactions and 100, 
137 
specificity 63, 139, 142 
see also ligands 


substrate concentrations 
and equilibrium reactions 99 
and Michaelis constant 143, 144 
substrate-level phosphorylation 432, 
434F, 435, 476 
subunits 
cytoskeletal filament types 574 
polypeptide chains as 132 
see also macromolecules; polymers 
succinate/succinyl-CoA 442F-443F 
succinate dehydrogenase 444-445 
sugars 
a and B links 73F 
breakdown and utilization 428-446 
as carbohydrates 52 
from carbon fixation 484-487 
in cell respiration 427 
chemistry of 73F 
numbering system 78F, 177F 
prime mark numbering 78F 
ring closure 72F 
as subunits 51, 52-54 
sucrose biosynthesis 488 
types 72F-73F 
see also fructose; glucose; inositol 
sulfur bacteria 15F 
superoxide radicals 474-475 
surgery, for cancer 728 
survival factors/signals (and apoptosis) 
643-645, 723 
SV40 virus 135F 
Svedberg, Theodor 60-61 
switch proteins see molecular switches 
symbiosis 
chloroplasts 458 
mitochondria 18 
symports 400-402, 403F, 403T, 419 
synapses 158F, 359, 416, 417F, 418, 
419-421, 535, 536T, 544, 554 
synaptic cell signaling 419T, 420-421 
synaptic cleft 416-417, 420 
synaptic vesicles 416, 417F, 420F 
synaptonemal complexes 658-659 
Szent-Györgyi, Albert 444 
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T tubules (transverse tubules) 604-605 
T2 virus 195 
tandem mass spectrometry 160-161 
taste, of PTC 664 
TATA boxes 236, 237F, 276-277F, 281F 
Taxol 584 
taxonomy and comparative genomics 
314 
Tay-Sachs disease 681-682 
TCF transcription regulator 730-731 
telomerases 213-214, 289, 291, 723 
telomeres 181, 183F, 213-215, 232T, 
723 
telophase 627, 629F, 635-636, 638, 639F 
temperature-sensitive mutants 520, 
521F, 677-678 
template strands 
DNA replication 200, 205-207 
transcription 230-231, 234, 260 
templates, RNA use by retroviruses 318 
teosinte 307, 308F 
terminal differentiation 286, 622, 644, 
712-714 
terminator sequences 233 
testosterone 75F, 536T, 565-567 
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tethering of transcription regulators by 
DNA 276 
tethering proteins 156, 157F, 382, 385, 
514-515, 699, 705 
GPCR subunits 545 
microtubules 583 
TFIB/D/E/F/H 236-237 
thermodynamics, laws of 83-85 
third-generation sequencing techniques 
350 
“three-parent families” 459 
thrombin 142, 552T 
thylakoid membrane 458F, 479-483, 
485F, 487F, 505 
thymine 
dimers 215-216 
as a pyrimidine base 57, 78F 
thymosin 595 
tight junctions 382, 383F, 401, 527F, 
703-705, 708F 
timelines, RNA world 259 
see also historical landmarks 
Timothy syndrome 717 
tissue-specific knockouts 357 
tissue types 8F, 695 
tissues 
maintenance and renewal 709-718 
as mix of cell types 710 
stem cells in repair 715-716 
tobacco smoking 685, 720 
Tor protein (target of rapamycin) 562, 
645F 
touch-sensitive plants 405, 409F 
toxins 
acting on actin filaments 594 
curare 419, 544T 
G proteins and 547-548 
ouabain 397, 398F, 399 
strychnine 419, 544T 
see also poisoning 
Tradescantia 7F 
transcription 
in central dogma 4, 228 
and control of gene expression 
270-278 
cyclic AMP and 552F 
eukaryotic, location in nucleus 238 
general transcription factors 
235-237, 271, 276—277, 279F 
initiation of 233, 235 
mechanism of 224-243 
post-transcriptional controls 
287-292 
rates of 231 
transcription initiation sites/complexes 
233, 236, 271, 276 
transcription regulators/transcription 
factors 
bacterial 271-275 
combinatorial control 279, 282-285 
conserved structures 132 
in embryonic development 
280-281 
eukaryotic 235-238, 276-277, 279 
fusion with Cas9 359 
Lac repressor as 118F 
master transcription regulators 284, 
286 
numbers, in humans 284 
p53 153, 621 
and regulatory DNA sequences 
271-272 
repressors and activators 274-275 
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transcription regulators/transcription 
factors (continued) 
in stem cells and precursor cells 712, 
716-717 
TCF in cancer 730-731 
see also DNA-binding proteins 
transcriptional repressors 274 
transcriptional switches 273-274 
transcriptomes 352 
transcytosis 527 
transducin 556, 557F 
transfer RNAs (tRNAs) 232, 235T, 
245-249, 251, 252F, 253-254, 256T, 
288 
transformation, bacterial 338 
“transforming principle” (Avery et al.) 194 
transgenic organisms 355, 357F, 
358-360, 577 
transgenic plants 359-361 
transition states 146-147 
translation 243-259 
in the central dogma 4 
ribosome involvement in 251-252 
translation initiation factors 253 
translocation see protein translocators 
transmembrane proteins/receptors 
as amphipathic 376 
B barrels in 378 
cadherins as 705 
connexons as 707 
enzyme-coupled receptors as 557 
GPCRs as seven-pass 545 
insertion process 505, 507, 509-510 
integrins as 698-699 
lectins as 383, 386 
plants 567 
reaction centers as 481 
signal sequences 509-510 
in signal transduction 539 
single-pass and multipass 377, 391, 
510 
SNAREs as 514-515 
see also membrane proteins; pump 
proteins 
transmembrane pumps see pump 
proteins 
transmission electron microscopy 
(TEM) 10, 13F, 23F, 134, 169F 
transmitter-gated ion channels 417-419 
as ion-channel-coupled receptors 
543F, 544 
transport proteins 118F 
HPr bacterial protein 124, 125-126F, 
129 
transport signals 513 
transport vesicles see vesicular 
transport 
transporters 395-402 
distinguished from channels 389, 392 
glucose transporter 118F, 396, 398F, 
400-401, 402F 
pumps as 393 
scramblases as 373 
transposable elements/transposons 
see mobile genetic elements 
transposases 315 
transverse tubules (T tubules) 604-605 
treadmilling (in actin filaments) 594 
tree of life 309-310, 314-315 
triacylglycerols 54-55, 74F, 370F, 439F, 
450 
tricarboxylic acid cycle see citric acid 
cycle 


trichothiodystrophy 357F 
triose phosphate isomerase 436F 
tRNAs (transfer RNAs) 232, 235T, 
245-249, 251, 252F, 253-254, 256T, 
288 
tropomyosin/troponin 605 
trypsin 132, 142T, 159-160, 161F 
tryptophan operon/tryptophan 
repressor 273-275, 278 
Tsien, Roger 520 
tuberculosis 524 
tubulins 
a-tubulin 581 
6-tubulin 581-583 
y-tubulin 582 
see also microtubules 
tumor suppressor genes 723-725, 
726-728, 730 
tumors, benign 719, 731 
tumors, malignant see cancers 
turgor pressure 395, 405F, 693, 695F 
turnover numbers 143 
turnover times 711-712 
two-dimensional gel electrophoresis 
161, 167F, 270 
tyrosine aminotransferase 270 
tyrosine kinases 542 
see also RTKs 
tyrosine phosporylation, RTKs 558-559 
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ubiquinone 464, 469, 472F-474, 482 
ubiquitin 153, 257, 528, 617, 618F, 634F 
ultracentrifuge 60-61, 164F-165F, 
203-204, 252F 
ultraviolet radiation 
formation of thymine dimers 216F 
use in visualizing labeled DNA 
fragments 336 
uncoupling agents 476-477 
undifferentiated cells 283, 359, 692, 712 
see also stem cells 
unfolded protein response (UPR) 518 
uniparental inheritance 664F, 665 
uniports 400-401, 402F 
units of measurement 11F 
molecular weight 41 
unsaturated hydrocarbons/fatty acids 
54, 66F, 74F, 371-372 
unstructured regions, proteins 130-131 
UPR (unfolded protein response) 518 
UTR 3’ and 5 untranslated region 
(3’ UTR and 5’ UTR) 238F, 243, 288, 
322T 
uracil 
as characteristic of RNA 209, 229 
poly-U as coding for phenylalanine 
246-247 
as a pyrimidine base 57, 78F 
resulting from cytosine deamination 
262 
urea 68F, 123F 
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vaccines 361, 547, 720 

Vale, Ron 588-589 

van der Waals attractions 48, 70F 
vectors, plasmids as 337-339, 360-361F 
velocity sedimentation 165F 
vertebrates 


cyclins and Cdks 617T 
DNA loss and gain during evolution 
and gene numbers 313 
whole-genome duplication 306 
vesicles 
artificial 385 
coated vesicles 512-513, 514F, 
524-525, 526F 
communication between organelles 
499 
formation by epithelial cells 705, 
706F 
Golgi apparatus 20F, 374F 
membrane renewal via 370-371 
vesicular transport 
of proteins from ER 21, 501-502, 
506-507, 511-515 
of proteins within and from Golgi 
apparatus 519 
tethers and SNAREs in 514-515 
Viagra 555 
video-enhanced microscopy 585F, 588 
vimentin (and vimentin-related) 
filaments 577 
vincristine and vinblastine 584T 
viral capsids 135F 
Virchow, Rudolf 609 
viroids 261F 
viruses 
detection in blood samples 344F 
disease states caused by 318T 
hosts and genomes 317-318 
and mobile genetic elements 
315-320 
protein coats 134 
reproduction 317-318 
retroviruses 318-319, 320F 
RNAi as protection from 291 
spontaneous assembly 319 
T2 virus 195 
whether living 5 
vitamin A 149, 360-361 
vitamin B,, and uptake by endocytosis 
526 
vitamins, biotin as 149 
V ax (Maximum reaction rate) 143, 
144-145 
voltage-gated Ca™ channels 416, 417F, 
604F 
voltage-gated K* channels 414-415, 
419T 
voltage-gated Na* channels 411, 414, 
415F 
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water 

acid and base formation in 69F 

biologically significant properties 48, 
68F-69F 

diffusion across membranes 
390-391 

electrostatic attractions in 71F 

hydrogen-bonding in 47 

osmosis 394-395 

proton mobility and availability 469 

redox potential 484 

solubility of ionic compounds 69F 

as source of oxygen atoms in citric 
acid cycle 440 

as source of oxygen gas in 
photosynthesis 483 


water-splitting enzyme (photosystem II) 
483-484, 489 
Watson, James 174-175, 202, 204 
wavelength and microscope resolution 6 
weak acids 69F 
weak interactions 47-48 
see also noncovalent bonds 
Weel inhibitory kinase 618F, 625 
whales 223F 
whole-genome duplication 298, 306 
whooping cough 547 
Wilkins, Maurice 175 
Wingless gene 730-731 
wire models 124, 126 
Wnt pathway and cell proliferation 
714-715, 726, 730-731 
wobble base-pairing 248 
worms, nematode see Caenorhabditis 
wound healing 535, 559, 573, 644, 698 
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X chromosomes 179, 191, 192F, 277, 
291 
X-inactivation 191, 192F, 277, 291 


X-ray diffraction/crystallography 
DNA structure elucidation 175 
lysozyme structure 143 
nucleosomes 186F 
principles of 128F, 161, 168F 
protein structure elucidation 61 
Rubisco 168F 

Xenopus spp. 157F, 306, 615, 616F 

xeroderma pigmentosum 215 

Xist noncoding RNA 291 

Xpd gene 357F 
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Y chromosomes 179, 567, 652 
yeasts 
Candida albicans 324F 
cell-cycle control system 616 
cell shapes and sizes 2 
as eukaryotes 16 
mating factor 533, 534F, 545 
occasional sexual reproduction 
654 
protein similarity experiments 
30-31 
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Schizosaccharomyces pombe 30-31, 
616 

sexual reproduction in 654 

temperature-sensitive mutants 520, 
521F 

see also budding yeasts; fission 
yeasts, Saccharomyces cerevisiae 
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zebrafish 
CRISPR system 359 
embryonic development 32, 710 
gene multiplication 306 
as model organism 32, 354 
Zellweger syndrome 506-507 
zinc ions 149 
zona pellucida 663 
zygotes 652, 663 
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